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lation is a constant different from zero. Thus, for example, one might 
attempt to hedge an inventory of cash good hy purchasing futures for a 
good which is a complement in the sense that ctoi < 0- 

The generalization to multiple futures contracts says that the pro¬ 
portion of output that should be hedged in each contract is given by 
the coefficient of the theoretical multiple regression of cash prices on 
the n futures prices (XnSio is the vector of these multiple regression 
coefficients). This result suggests that the partial (given remaining 
futures prices) correlation coefficient between the cash price and a 
fuiures contract is a better indicator of the usefulness of that contract 
for hedging purposes than is the simple correlation coefficient, hhus 
it is conceivable that the optimal hedge against a long cash position 
involves a long futures position in a contract even when the (simple) 
(orrelation between the cash and that futures price is positive. 

Finally, it is worthwhile pointing out that the pure hedge term, 
ySii'Sio, is etjual to the risk-minimizing futures position corresponding 
to a predeteimmed cash position y. A hedger thus uses the possibilities 
offered hy futures markets to minimize his risk, then he speculates. 
This interpretation should not, however, conceal that futures and 
cash positions are jointly determined and that the risk-minimizing 
futures position is generally suboptimal since it ignores the oppor¬ 
tunities for speculative return and because it is decided conditionally 
on a cash position that will generally Ije suboptimal. It should be clear 
from the previous analysis that the larger | o-qi/o-,, |, the more effec¬ 
tive in reducing risk is a single cross hedge. This criterion could be 
used for selecting which delivery month, which exchange, or even 
which underlying commodity would offer the best risk-reducing op¬ 
portunity. 

IV. The Optimal Cash Position 

F.quation (.')) characterizes the optimal futures position, given y, in¬ 
serting (5) into (IF) gives the necessary condition for y given that / is 
adjusted ojjtiinally asy varies. After simplification, this condition is 

p\ “F ~ p\) ~ yx(^oo “■ ^oi“i!^io) (y) “ 0- il) 

Equation (7) is nothing else than an equalization of marginal reve¬ 
nue and marginal cost. The latter is the sum of the two last terms: c'(y) 
is the marginal cost of production; xyC^oo “ ^oi^u^io) 's the marginal 
risk premium. The risk premium can be rewritten as x)'0’oo(l 
where is the theoretical multiple correlation coefficient squaied. 
Thus the risk premium depends on the degree of risk aversion, the 
size of the cash position, the variability of cash prices, and the propoi - 
tion of that variability which cannot be eliminated through the opti 
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mal hedge in the n futures contracts. For a given cash price variance, 
the risk premium decreases as the multiple correlation coefhcient 
increases: consequently, the higher tlie/f^, the larger the cash position 
(in absolute value). 

The two first terms of (7) can be viewed as a "planning price,” used 
as a substitute for the as yet unknown cash price. The planning price 
is a weighted sum of the expected cash price and all current and 
next-period futures prices. The weight given each futures price de¬ 
pends on its coefficient in the multiple regression and thus corre¬ 
sponds to the proportion of the cash position hedged on that particu¬ 
lar market. Again it is enlightening to look at the simple cross-hedge 
case (n = 1); the planning price can then be written 

(1 -/3)^ + y3(K-«), (8) 

where /3 = croi/o-,i and R = is the expected value of what is 

known as the basis, the difference between futures and cash price. 
The appropriate allocation signal for the decision maker is now a 
linear combination of the expected cash price and the currently 
quoted futures prices "scaled” tor the expected difference between 
futures and cash prices. The intere.st of this formulation is in the fact 
that people in the ttade often asset t that bases (i.e., price differences) 
are more easily predictable than price levels. Fquation (8) shows that 
the closer the relationship between cash and futures, the more the 
hedger can rely on futures price and basis expectation rather than 
price expectation for "pricing” his output. 

From this analysis, it is quite clear that an increase in the absolute 
value of the covariance between futures and cash pi ices, | cro, |, may 
have conflicting effects on real production. On the one hand, it 
decreases the riskiness as.sociatcd with the "rt'al” activiiv of the ()io- 
ducer; hence the proportion of marginal revenue .illocated to risk 
remuneration decreases and the cash position can be made larger m 
absolute value. On the other hand, the heavier reliance on the futuies 
price in making production decisions ma\ entail .1 smallei cash posi¬ 
tion depending on its level relative to the expected c.tsli price 

In the general case (n > 1), vve can also mtei pret the planning pi ice 
in terms of the vector of bases, ot the lorm - p,. For the hi st two 
terms of (7) can be written as 

(1 -/ii)/>, -t-j8(/4 - «i). 

where /3 = Soi^n . i is the unit vector, and = //, - (the vector 
of the expected bases).^ Thus observed futures prices and b.isis ex- 


^ We could also conceive ol ihe IoHovmii); .lUein.iuve ot ti.isiv toi die 

case of n > 1 . la't the generali/eil basis Ih‘ Hg - i.„|i - ,''i. ibis is die dit lei eiu e ot 
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I he (.ISC foi posilis c lime |)ic(<‘i ciK c IS ahsoUiU-h (iiiii|)cllirif», mi less 
llicic IS tin iiiliiiiu nine hoii/on wiili the espe* t.iiirm ol iiiiciidiii^ 
U (Imiiloj^K ;il :i(lv.111(1' 1 (iiiihiiicil with wli.ii we (,i|| ' ili .isi» ,ilK tii- 
luinisliiiiis iii.ii^m.il iiiilin ' tins limliii).; hulils hoih in ilic iiusiinc 
.111(1 11(11 111 .Ill\c senses. \ (ol(ill;ir\ IS ih.it s.i\nij^s ;ne iineiesi el.istu 


■Most wiiK'ts oil tnpitnl iheoiv ha\o taken it toi ^i.iiitetl tliat inosi 
peoitle s\ slem.itK .ilh piefei pieseiit to (tiltiie i onstiiiipt ion. I tiis 
posiiue rate o( iinie pteletetue ts iisii.ilK asttthed lo a "laiilt\ tele- 
stopit (.iiitlu,'' to "tti\o|)t,i," ot to "itiip,itieti<.e," e\t‘n tlioitjili thts 
itiiplies .til it lationalttN oi (oiitiiiiiiiit' peiiepttial Iti.ts th.it is iittoiisis- 
leni with the Ijeli.o loi al .issiniipiiotis eioiioinists usii.tlK use. lo oiti 
knowledge no one It.is e\t'i provided loiit nuin^ e\ ident e ili.it tlieie is 
III 1 .111 noi iiialls piisiliM' tune prelen in e, ot e\ en spec ilied an ein|iiri- 
lal lest (.t])able ol detei niiiiin}; vshethei there is oi not .XppareniK 
die lieliel in nnopi.i often rests on stil))eiti\e jnd^'inents altoiit the 
relaiise ntilits lesels ol oihet individii.ils .n loss ilillereni nine pet lods 
ih.ii h.tse sell liniited siieniifn value. In oihet i .tses .i prelereiiie lot 
inesenl i)\er ftititre c onstnnption is ertoneonsls snpporied with e\ i- 
deiue ill.It, .IS we sh.tll aigtie, need lelieit nothing mote ih.in .i 
1 at ton.il adajil.ition to |)ositi\ e iniet est i aies ^enei ated h\ tlie lai t ih.ii 
< .ipilal goods 1,111 he nseftil in ptodtu lion In still other insi.niies time 
preleii iue is siip|)osed to loliow direith liotn the l.u t th.u there is 

c .lie iiuteliii'll lo C..ti\ Ilec ket. ( .hi istoplici (.l.i^iie. llon.tU) Dcwci RoIh'II 
Ooiliii.iii. I i.illiiij; Koopin.iiis, Pcici Miiiiell. (.e(>i)>c SiikIci. P.miI Womm.hoii .hi.I 
Howell /,(■(■ loi lielpliil . oiiiiiiriiis .iiul sii);)>esiioii\ P.iil ol the woi k on ihis .iriii li ((.is 
siippoiied Ik 1(1,1110 Iroiii ilie t iiviioiinu-nl.il I'loteOion .'\i>en(\. ilie N.iiioimI S, km. (. 
)oiiiid.iiioii. .111(1 Ke.MxiioK fill i)ie f'tuiirc 

I/""frui/ (,/ /’n/Kini/ /-turioin), 84. m> l| 

' l'*HI |>\ I 111 \ ,i| ( hu.ig.) WlIH.'HI 'id 
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;ilwa\s tiiKcilaiiii) ahruit ihe limnt', so the consumer cannot be sure 
uhcihei .in\ assets iliat he ai(|uires by savinjf will retain their value or 
e\eii that he will he alive to enjoy any fruits of his saving. In fact, 
lUKeriaints alioiii (iiinie economic conditions and the duration of 
ones life (.in iiKie.ise as well as decrease the incentive to save. 

1 Ins papei develops an approach to intertemporal choice that 
makes ii possible to deiennme Ironi actual household choices in 
(cit.im ( IK nmstames vvheiher oi not there is positive time prefer- 
enie Ihe enipnical lesis on ordinary saving and consumption he- 
ha( loi ih.il glow onl ol this approac h suggest that a decidedly positive 
Kite o| lime prelereiKe is ivpical il not almost universal, 'fhere is an 
.ilieinaiue iiiierpielat ton ol the evidence that permits doubts in cases 
whi le the pi.inning hoii/on is infmite, and this we label "drastically 
dnnmisinng maigmal nlililx." 'fills intriguing phenomenon could in 
pimii[)le be wides|)iead. In long-run contexts it is so similai to time 
pieleieiue m Us iinplu aiioiis that the two are difhcult to distinguish. 
\onelheless. espei i.ilb when short- as well as long-nin contexts are 
(onsideied. (he case dial (heie is often lime (nefeience becomes 
I ompellmg. e\en il diasiicalh diminishing maiginal utility should also 
be pieseiil 

We shall also show dial those who advoiaie leiiain enviromnental 
,nid lesoniie policies on die gioimd dial the interests ol all fiiiuie 
(e.nsand geiieiaiions ought to be wc'ighif.’d t'tjiially withoiii (oiuein 
loi niiliii in die pieseni aie (oniiadic led l>v iheii own Itehavioi. We 
will siimlai K show ih.ii it is logii.ills iiuoiisisteiit to argue th.it theie is 
posiiue lime )iie(eieiue and dial household savings do not generally 
lespond to ch.inges in mieiesi rales 

I. Positive Time Preference and Diminishing Marginal Utility 

Ofieii till idea ol tune pieleieiiie appeals in an impiessionistic lorm 
widiinii an w oi kable dehnii ions (see, e g.. Pigou IW)();Jevons lOb.a), 
ihough iheie aie also a niimbei ol (jiiite iiselul elloris to obtain a 
pieiise undei standing ol die iiiaitei.' Inieicsiinglv enough, the 
ilearesi loncejiiion of |)osiii\e lime pielerenie that we have been able 
lo find w.is in Kolitii-Haweek's original account. 

guilds hiii’r m suh/xln'i' thtm /utKX' 

Ifiii/I iiitrtwi'ilnilr) oj irjual i/uantily and /jiialily ■ ■ ■ 

lint pi me i])al c ause capable ol prodiic ing a difference in 

'See ilie liel|<liil elwi i/ssioii in Maiglin (ItMi't). telilsli-in (I'Ki-t). .mil Itii sfileifei 
(I'MIl Iln-Te .lie .liso ini|»iii.ini .iiiiiles iliac .cpjxMi cil siiiiicw li.il f.irliei. iiotalciv 
S.muiels(.ii (l'K7), Siml/ (l').'.IJ). Deljieii (l't.5'1), .mcl KiMipiilaiis ()<«)()), 
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value between present and future goods is ... if a person 
suffers in the present from apprec iable lack of certain goods, 
or of goods in general, but has reason to hope to be more 
generously provided for at a future time, then that person 
will always place a higher value on a given quantity of imme¬ 
diately available goods than on the .same quantity of future 
goods. This situation (xcurs with very great frequent y in t)ur 
economic life. . . . 

We must now consider a second phenomenon of human 
experience—one that is heavily fraught witfi tonsequence 
That is the fact that we feel less concerned about future 
sensations of joy and sorrow simply because they do lie in the 
future, and the lessening of our concern is in proportion to 
the remoteness of that future, (ainsequenilv we actoid it) 
goods which are intended to serve luture entls a value which 
falls short of the true intensity of their luture marginal 
utility. llO.'if), pp. 26.3-73] 

This (juotation suggests two distinct elements, each of which might 
influence intertemporal choice: (I) diminishing marginal uiilin of 
consumption at any given time, where utility is an unchanging func¬ 
tion of the iimount of consumption at the time, and (2) discounting of 
future versus present titility. I his distinction is so futidameiiial that 
we believe there can be no adet|uaie explatiation of inierteiiipoial 
choice W’ltliout it, yet it is too often ignored Hath of these two ele¬ 
ments, moreover, naturalh suggests a testable lupothesis that can be 
proven false. Biuley, Olson, anef Wonnacoti have shown elsewhere 
that " The Maiginal Utility of Income Does Not IiKiease" (1980) and 
in at least a great majority of cases is aclualK decreasing. In this papei 
we accorclinglv accept Bohm-Bawerk’s assumpiion that there is cfi- 
minishmg marginai utility to consiiniption in eat h |x*riod and go on to 
explore whether the available evidence* is consistent with the h\ pothe- 
sis that utility in the future is valued less than uiilitc in the present. In 
oi rlei to do this, and to ofitain the full range of hvpiotheses that can Ije 
drawn with Bohm-Bawerk's distinction, we shall first develop a more 
formal statement of the distinction. 

following a suggestion by Hirshic’ifer (1970),'* we represent utiliiv 
at monient I by a function i'(f-c); the consumption is time sub¬ 
scripted, but the function is time invariant, depending onlv on the 
level or quantity of regardless of t. Then with time measured in 
terms of discrete periods of ec|ual length, we write the consumer's 

” . he ttirshk-itci dnalysis in lum lus pic-n-dc-iiis in ihr .nticli-v h\ .Sanuielson (I'.CtTl. 
•Stiolz (I9.5t)), I'lehrcu (19,59), and KcHipmans (1909) ou-d in ihe prc-ceding tiHiiiioir 
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fuiuiioii to i)C ina\iini/c(l a( time zero, subject to a wealth 
budi^ct ( onsli ;iiii(, as 


r 


-(' + vY' 


( 1 ) 


ulu u' Ilu' (Oiisimu I S Iiiiif hon/on lor planning is 7', which nct-cl nol 
Ik- (mile, and uheie t) is a conslanl, the tale per period ol positive 
lime preleieiue We assume that ihe liisi clerivaiiveT’' is positive and 
(hei.iiise oi the finding mentioned helore) tli.it the second is negative, 

T-" ■ d. 

\ou (onsidei ,i household choosing .nnong alternative toinbin.i- 
IKills d I,, b,) ol toiisimipiion .it / ^ 0 .nid / ^ 1. holding its consiiiiip- 
iioii paiiei ii loi .ill l.itei pel lods lived. II we lake the dilleieniial ol 
I 1 1 , selling./(', ^ d loi .ill / -, we h.ive 
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• md seltmg rl( - d to lepiesent holding the household mdillc-ient. 


.•'(b„l,/r„ 


e'(( .,V/ b', 

T-+ r,~' 
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dr, 
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I he household will demand <m inc le.ise in r,, gi c-aler than llu- i eciiu - 

(ion III r,,|, (h,il IS, 

I 1 

i i/C ,, I 

lo hold nlilits (Oust.nil, li eithc-i ol two things is It tie 

r I • (.„ with rj - 0 . (A) 

ol 


Tj '=• (I wiihr,, == r;,„ (.^l 

(loiiditioiis lA) ,iiid l5) Slice incih st.iic- the- two c.iuses ol interesl in the 
(|noied pass.ige of Bohiii-B.iwei k. 

1 ne(|naliic (4) follows heeaiise e" 0, Diinmishmg iiiaiginal niiliiv 
of iiKoine miiilies that, if all else is <-<|nal, .i household with gre.itei 
eonsniiiption potential in the latei pei lod will he willing lo boriow at 
a |)osiii\e inieresi late in oidei to make eonsnnifnion in the two 
petiods iiioie nearh ecpial I his is the “fust cause” of the prefeience 
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(or jircseiit over tulurc (.oiisurnpiioii meiiiionecl in the quotation 
from Bohni-Bawcrk. 

"I he- liy[)otbcsi/t‘(l spcond ciiuse is ifprcsontcd tiy llif (onslant t) in 
the inierteniporal utility function ( 1 ). When it is positive lalhei than 
zero as in ( 5 ), the householfl would Ik- willing to pay some |)ositive 
interest rate to inirease consumjition in the initial period at the 
expense ol ioiisuniptioii in the latei period, even with equal endow¬ 
ments 111 (he two peiiods. 

Bv tiacitig out ihe comhmalions of and (., to vshich the house¬ 
hold is iiidillereiit, using (1) or (‘S), assuming rdf;,) and tj are known, 
one can oblain an indilleience (m ve helween preseni and nexl- 
]ieriod consumption. That is. e(|ualion (I) is an implu it Imu tion ol 
and f.'i, when all othei f,, are held (onstant and when I' is also helfl 
coMsi.int, ihis (muiion, 

I'll = I’K'ti) + ^ const., 

1 + T) 

IS the e(|iiaiion ol an mdillereiue (Ui\e in (f.,,. (■.',) s]).i(e. Siuh an 
nidi I lei eiue cm ve / is shown in figure I. .Xiong the 4 . 5 " line (■„ = f.',. 
and on this line anv tangent to an indilleience cni\e, sm h as.l/l/>, has 
<1 slope of -- (I + T)). I Ins can Ik- seen l)\ setting f,',, = (., m equation 
(5), Ol h\ noiing that when consumption is the same m both |K-iiods. 
.m\ jneleieiue loi piesent oxei Intuu- (oiisumption must l.)e due to 
V 

I hough ihe marginal rale ol snhsiiiniioii hetyceen lonsninption in 
.nn two peiiods toi .i household in eqnilihinnn is lieqiieiiily (.died 
ihe I ale ol ''lime pi elerenc e," m hit h is ol i oiii se eipi.il lo ihe iniei esi 
lale, (his casn.il ns.tge has caused a lot ol (oninsion We iherelotc 
slioiigK snggesi lh.it "posinye time pielc-ieiice" he delined lo e\( hide 
ihe elleci ol .i difleieiue in maigni.il inihl\ due lo .nu lower le\el ol 
consiimplion in the pieseni .nid, iheielore. lo iiHlnde onl\ .in\ piel- 
(lenteloi pieseiil ovei I nl in e < onsnm|>lion due lo ol hei i.inses. In 
oni loi mnl.ilion, lime preteieiue is ihen delmed .is .i posilne c.ilne ol 
the (onst.nil tj, i .nhei ih.ni in lei ms ol the eiilne expi ession ( 5 ), w hit h 
gives ihe slope ol ihe indilleience inrce. Ihe r.nio ol maigm.il 
niiliiies, depends noi on nine hiii on le\els ol con- 

snmpiioii III the Ivso ])eiiods; .ni\ preleiente foi ])ieseni o\ei Inline 
nlihly. liy contr.isl. t.in he (on\enienll\ lepresenled h\ the snh|ecli\t‘ 
disc onni i <ite tj. 

11. Separable Utility 

Beloie wc can properly deiive oni posinve resnlis, wt- need lo con- 
sidei whethei the vievy ol mleriempoi.il choice set oni m e(|n.ition (1) 
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IS .ippi opt laic. In [laiiiiulai, wc iR'od lo <oiisidcr wlicihci il is noces- 
sai \ or t'xpcdiciH lo vvi ilc uiiliiv as a f uiu non ol coii.snniptioii in fvery 
|)<.'i lod. 

r = ..c,). (ft) 

I Ins toi nnilaiion, unlike dial in equation (I). would iorinally I'liioin- 
p.iss am ( oinpk'iiK niai ii\ Ih-Iwcimi die levels oT ionsurnptioii in dii- 
li-reni penods liuk'ed, one lould ^o luidier and write utility as a 
luniiion ol die ainoiinl of ea< li sepaiaie good consiiiiied in each 
peiiod—as uoukl be iieiessari. lor example, ii the typical liouseliold 
consuined a lillle < leam today lo go vvilli the coflee il will consume a 
week tioiii Monday. In olliei words, the utililv derived from a given 
level and panel ii of consumption m a particular period could con¬ 
ceivably depend noi only on the level but also on the composition of 
tonsunipiion in imoi |)ei locks I here are examples ok interdepen¬ 
dence ol consninpiion on diflereni dates, such as a preference to have 
ditleriMil dirinei menus on successive days, and it could be necessary 
for some special pui[)oses lo take the.se into account. However, as 
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Samuelson (1937) has argued, functions that allow unlimited inter¬ 
relationships become so general as to be almost vacuous. If a utility 
function rules nothing out, it will be consistent with almost any possi¬ 
ble type of behavior and thus generate no potentially falsifiable 
hypotheses of interest. In addition, it has been shown elsewhere 
(Bailey and Olson 1977; Bailey et al. 1980) that the assumption of 
separability is more realistic than is usually supposed. Finally, it is 
hard to imagine how any plausible type of temporal complementarity 
could generate results that are qualitatively different from those in 
this article, with one obvious exception we can handle now. 

An individual cannot, of course, derive utility from ttmsumpiion in 
any period unless he has survived prior periods and also preserved 
the health needed to enjoy consumption. This could be accommo¬ 
dated formally by treating the essentials needed to preserve life and 
health as crucial investments in human capital. This would not really 
solve the piohlem, though, since the temporal interdependence 
would still have to be taken into account in investment decisions. 
.Similarly, if a family is to have grandehilchen or more distant descen¬ 
dants, then each intermediate generation must have at least enough 
resources to survive long enough to raise children. I'lnis we em- 
phasi/.e that our analysis must Ik* modified before it can be applied in 
anv situations where levels of consumption too low for healthful 
viability or reproduction are at issue. Abstractly, we deal with this 
starvation level of consumption by supposing that v'iC) —* x as C —* 
and that v{C) is undelined foi C.' < (liven this ipialihcation. we 
can safely cemtinue to use iumiion (1). 

III. Erroneous Inference of Myopic Behavior 

In Section 1 we distinguished lime preference from the effects of 
diniiiiishing marginal utility o( consumption in eai h period. With the 
aid ol this distinction we can clear up one elementarv yet surprisingh 
common erroi. Since Bofim-Bawerk's nine, if not Ix’lore. some 
c-conoinists have assumed that positive iiileiesi rates indic.iie that most 
people have positive time preference. Fhough we sh.ill later provide 
evidence that time preference is in fact positive, this finding bv no 
means follows from the existence of inieiesi. 

1 his is obvious once one considers the implications ol the tael that 
cajntal goods can be useful in production. The existence of profitable 
investment opportunities implies that reduction of consumption bv a 
doll; ir now will ;illow more than a doihu's increase in consumption in 
the future. A glance al equation (1) reminds us that even if time 
preference (tj) w'ere /.ero, this fact can generate (Kisitive interest rates, 
be cause diminishing marginal utility of consumption in each period 
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cm.iils tli.il (Diisiiiiu'is uill a lowci level of eoiisuniption in one 

period liian .uiodiei onK if tlie\ ate leivardeci lor doin^ so. Obvi- 
n(isl\, p<)siii\e iiileiesi rale.s anti the exte.ss t>l inaiginal ulilily of 
pieseni oeei dial oi future i onsninpiion that lliey induce' doe.s not by 
iisell |usiil\ till' (ontliision that there is any pref'etence for present 
o\ei luliiie uiilin, unit h less peisisleni mvo))ia or irrationality. 

V\ hen one looks at holli the demand and supply sides, it becomes 
( le.u dial eat h of the following toinhtnations of conditions is 
sullitieni lo genet.lie die (nstoinaiA posilice interest lates; (1) di- 
ininislnng in.iigm.il unlitv plus profitable investment o]iportunities 
with nondec re.ismg mtoiiie (.iiid /eio time pief'erente); ( 2 ) di- 
niimsiimg m.ugin.ii utilitv with iiit leasing endowment income (and 
/eio time pieleieritet. (.'() posilne lime pieference with const,int 
intoiiie oi widi pi olit.ible iin esiiiieiil oppoi tiinilies (and nomnereas- 
nig m.iigin.il iiiiliu) llieieloie. die l.iti di.il iiueiesl lales aie posi- 
ii\t‘ does noi l)\ iisell lell us wliitli t.iiises are operative. OnK evideiue 
ol ,1 dilleieni kind could tlemonstiate tli.it tlieie is positive lime 
pi elei enc e 

I lie I iidinieiii.n \ t onsidei .iiioiis just mentioned .ne, liowevei. 
sallii leni to lell us someiliing .iboiil die i (‘l.ilioiiship ol i). wlielhcn it 
lit' posiiite. neg.ilive. oi /eio. .iiid the piev.iiling inleiesl i.ue (ion- 
side i mil i.ilK ,1 in,inn.i-l I oiii lie.iven woi Id vvilli no |)i odiu ers. .Sui li .i 
aoi Id t oiild li.ii e .1 ( oiisuitiptioii-lo.iils m.irkel iv illl .i detet min.He t .He 
ol inieiesi III long-iun etpnlibi amt Suite ue li.ive not s pet i I it'd the 
nines al ulm li in.inn.i kills lioni lie.iveu. oi \ el olleied ,uiv evidenie 
di.ii would eidiei eiii.iil oi lull' out posHive time pieleiente. llie 
inieiesi I, He in tills t onsumjilion-lo.uis m.irkel lonltl be .mvllimg— 
t veil /ei o Ol iieg.Hive 11 die tii.uin.i kills eveiiK ovei lime, the mteiesi 
i.ik III tins < onsiiniptlon-lo.uis m.u ket is bound lo be e(|u.il lo r/. 

bill now siijipost ili.il .itlv.iiil.tgeoiis piotiutlioii jjlot esses ,ne dis- 
I o' eit d III " lilt li jii odut ed t .ipii.il IS .ill .11 gumeiil in the pi od in 1 ion 
I lint I ion loi ( onsum|)l ion gootls Sint e this tlist o\ei \ will int i e.ise the 
dem.ind loi boiiowings but not shift tile siijijiK luiulioii loi lo.ms, 
lilt inieiesi i.He imisi list, file result is tb.H m etjuililirium i ■ t/ 

1' \ < epi loi ,1 leu tM I eme I .isc’s. die i e.i! uorld u ill be one in u hie li i - • 
r/. bei .nise in .ilniosi .ill i e.il-u oi Id t .ises t ,i jiil.il gootls h.iv e a jitisilive 
iiel in.ngin.il piodiiil 1 he m.uginal i.ile ol siibslilulion belueen 
pitseiil .111(1 luiuie (oiisumplioii iii long-iun eijiuhbi luiii will ol 

I ]■( (lilif.uliics iin..)\rf( III iii\ lihiti in mu iifi.iiriliigiioiis frie.isdit ol Hit 
. .i|u. .il .1. K k .1 IK 1 I lie esiriKIt <1 I (inn o' cl Ml s I li.it li.i't gi o" ti out oi I his sliouill mil. ol 
tonisi olistun ill. t.Ki Hi.n (,i{iit.il .it t iiiiiiil.itioii (e(|iiiit-s jiosl |ioii(-(iieii( oi (oiisiiiiiji 
no'I holnii-ti.mt it ' It Ml mgs (t<'s|iiie Hit ii ucll tiio" ii slioi It oiiiiiigs, ,ii e siiHit iciii lo 
silo" Hiisisct iloiiiii.iii i'l'aiti, l(|"i'l/t) 1 liiis itie |ii eseiit e ol “i (‘s" lit liiiig,*'c g . (lots 

Mol uiKicrmiiit out .iii.il'sis hfic 
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coiiise always equal the rale of'inlerest, but 17 is not in Kt-iu-ral equal to 
this marginal rate of subsiitutioii, even if 17 is positive. 


IV. Is Time Preference Observable? 


ft is inluitivclv obvious from ibe discussion of figure I and from what 
lias )usi been said that we can ascertain an mdividuars late of lime 
pieference bv observing the interest rale at which he chooses a level 
consumption stream. This means that we have- a hi m and simple basis 
for using .1 levealeci preference- appioach to obl.iin solid scientific 
evicleiKC- about lime preference and bave no need to apiieal to any- 
ihmg resembling introspection 01 iiilei personal conqiai isons of tilil- 
itv .■* 

I he cone lusioii that the household chooses a level consumiition 
stream if the 1 ate of interest eciuals the iale of pine lime ]iiefc-ieiue 
cleailv holds lor the multiperiod as well as ioi the two-petiotl cases 
(see S<inuielson UK-l?) 1 his finding is iisell enough to make time 
jiieleience ohseivable. U'e iioic-d eailiet tliai, unle-ss mconies ate 
declmtiig, the existence of piocluciive investment opjioi tunnies im¬ 
plies that the e(|uilil)riuin late ol iiiteiest will be grc-atei than the rale 
of time pieleiencc-. aticl this is true bowevei high the rale ol nine 
pielerente might he 1 hc-ieforc- we would predict that in most cases 
households would have- positive levels of savings that geneiaie a using 
level ol c onsu nipt ton ovei time I'liis iheorc-iic al pi edit non appeal s to 
be lelativflv lealisiit 

I heie ate also oiliei wavs to measure time [ireference from le- 
vealecl piefeieiKes. If the household niaxmii/es ntilitv il) sub)ett to 
the InicIgei t oiistrami 


V > .1 - 

rr;, (1 +')' 


- 0 . 


| 7 ) 


varvmg each to obt.un this maximum, the result implies 

!''(<-,) ^ (I T?)' 
e’ff.i,) (1 + T)' 


(HI 


'In si'ckiM|» 1 t’VcMifd (-IUf we .lie. of i iiuise. onic loiiiiiuiin^ ,i line ol iiii.]iiiM 

sl.iiieilln oilieis In 1 n;f 7 S.iiiuielsoii piofsiseil .1 In}>oltieii( .il iiii'iIkhI loi obt.iiiiiin-the 
iiiilin /IIIKIIOII In iiilejri.Kinn. 11 , In iii.i|)|iiii^ olnerveit i lion es iiilo ,1 fiiiiHioii.il 
lepieseiii.itioii, in elfeci iliis w.n .ipiilMii^ die 1 e\e.ileil-|itelei elu e .ippnsuli to oui 
jnohliiii .Siiiiil.il Ic. Deliieu (lil.-itt) showed, wiih ,1 iigoioiis [iniof, lli.it .1 'pie- 
oideiiiij;,' 01 ,1 I oiisisu-nl. Ii.iiisiiive sei of prefeieiues. iinplic‘s .1 nlilitv fiiniiionili.il is 
mil, (lie np III ,1 liiie.ii 11 .insloi in.iiion Kiisi h s t l‘l,’)<l) .in.di sis .ilso pim eeded in .1 siniil.n 
spu n 1 lieu- ,iie .ilso otheis who h.ive iii.ide signifii .nil t onii ihiiiions lo this w.n of 
lese.irching ilio piohleni. such as .Stroi/ fllt.'ifi) and Hirshleiler (I 97 ll). whose loniri. 
huiioiis will Ih- discussed laiei 
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with ;i (onsiaiu iiiicicsi laic- > for all I. Evidently, as we noted above, C, 
^ when I = 1 ), so that when we liiid the rate of’ inieresl at which 
(he household chooses a level consumption stream (i.e., zero savings 
iiom I it), iheii T/ = )fl, and the value of r) has been revealed. 

Similai K . if ' ''1). as it usually is, f,', > C'o- If one has independent 
inlorniatioM on how lapidlv declines with increasing C (.see 

Baile\ el al I OHO), one can estimate r) simpiv by examining the chosen 
growih p.iih ol consumption and finding the r) that reconciles it with 
cijiiaiion (8) 

V. Uncertainty 

I limit’ll iheic <nc some notable extepiions. such as Ramsey (1928), 
•ind nioie icccnib Siiglci .md Betkei (1977), most economists appear 

10 lake ii loi gained ihai lime preferente is positive. .Sometimes this 
(OIK lusion seems to test not so ninth on the erioneoiis inferences we 
desdihed in Set I ion III as on the notion that the individual is uncer- 
lain alioui the d.iie ol Ins own death and will therefore discount 
piospedive ntihn because ol the possibilitt he will die earlier than 
e\pe(ted I fiis aigument is (|uiie unsatisfacior\ for at least two rea¬ 
sons 

fiisi, uiueriamu ahoul ifte date ol one's death (.m increase as well 
.IS (let lease an mdnidual's provision lot the luture. because il can 
(le.ue ,111 inteiitnc lo save inoie to provide iindiininished consump- 
iion III the eveiii ol .ill unexpededK long life. If income could 
( osi lessh he ( onveried inlo an .iiinuilv. I his would not he ihe case, bul 
ideal .iniiuiiies al e not available, and m less developed economies they 
often t.ninot he [mu based at all. It is an einiiiiital (|ue.siion whether 

11 IK ei I anil \ .ifioul die lengih ol one’s hie int i eases oi deci eases |)rovi- 
sioii loi the Inline. 

set ond ic.istm whv lint ertainiv .ihout ihe dale of one's death does 
not enlail positive lime prelereiite is evitleni Irom the laci that there 
is noi m.illv posiiive s.iv mg in the sot lelv as .i w hole. Even if there were 
negative nine |)iefeicute over the lile tvtle. that is, a preferente for 
uiihiv III laiei .is against earhei ye.ns ol lile, a sot ielv with no popula- 
iion gioKih Ol let hnologital .idvaiue would have no sot ial saving 
whatevei il eveivoiie tonsuined his entire oul|nil ovei the tourse ol 
lile: The s.ivmgs in one set ol vears would just et|ual tlissaving in 
anofhei. in lad, the developed .sotielies have saved enough to 
provide lot a tonsideiable long-term giowih ol tangible capital pet 
workers as well ,is a vast mt rease m the amount of education and 
human capital. I his substantial social saving had to be due in large 
pad to a desiie lo he(|ue.ith intiea.sed tonsumption opfxntunities to 
descendants. 1 he tlesiie to lx(|uealh assets lo de.scendants provides 
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another reason why the argument that uncertainty about the date of 
one’s death implies positive time preference is inadequate. 

There is also uncertainty about whether the assets obtained 
through savings will generate the promised yield or even preserve 
their value. Such risks lower certainty-equivalent rates of return, and 
so arc different and distinct from time preference. Risks may Ire 
reduced through diversification of |>ortlolios, but transactions costs 
limit the extent to which risk can be eliminated in this way for the 
household of average means, and there are in addition great prob¬ 
lems for such a household in diversifying against the risks to properly 
in any single country (or single set of countries that might have a 
common destiny). Thus the risk that investments will prove unre¬ 
warding would appear to reduce saving, ju.sl as a positive rate of pure 
lime preference would. 

Uncertainty nevenheless also has Iwcj im|>oiiani countervailing ef¬ 
fects that have been neglected, if noticed at all, in the literature. I'he 
first of these ari.ses from risk aversion and from the absence cjf futures 
markets atui itisurance markets for most goods The absence of fu¬ 
tures tiiaikets makes it impossible for a tvpical household to hedge 
against a loss of clematid for whatever speciali/ed skill tt has to olfei. 
or against the loss in value of whatever other assets it |x>ssesses. or 
agaitist a deprcssioti. Similarly, a typical household cannot insure 
against the birth ol a chtld that will never lie capable of suppoititig 
ttscif or eveti fitlly insuie against a loss of the skill or health needed to 
genertite tncoitie. These utihedgeable and uninsurable risks to the 
total iticottie-eaitiitig capacitv of a household, in combination vviih 
risk aversion, entail an incieased incentive to save. In other words, 
even uncertainty about the securiiv of assets tan work to intrea.se as 
well as to detiea.se savings, because in unfavor.ible toiiimgeiu les 
assets contribute a large iiicieineni to prospective tiiiliiv. This iinpli- 
latioii IS corroborated even by evervdav language, .is when [leople 
speak of "saving lor a lainv day." Second, the greater the amount 
saved, the smaller in general w ill Ik- ific- raliool the li ansae lions costs 
of diversifying the jjorfolio or protcxtiiig it iii other veavs-' to the value 
ol the portfolio. In summarv. iincerlaintv niav reduce the expected 
.ipparent rale of reiurii to saving, yet at the same lime m the absence 
ol univeisal f utuies and insurance markets it tan increase the “need" 
or incentive lor saving, that is, its pios|K-tlive ulililv. Because, in 
addition, uncertainty may lavoi accumulation of enough wealth to 

Iti itie < a.sf (ft heicstauilcU u'ho.se .csceix jic .tcc|uireft tn illegal (»i iiri^Hipiil.ir inraiiv. 
Ill . hull Iksoiik* so laigt’ llu’v geiuTau* noionc-iy. a<lcliiion.il saving niav if anvihing 
incicasf ihc- (xissibililv ot proseciilion. cxlnriioii. oi adverse ^xiliiieal deeismiis N'ei 
cOIKealinent and diversificalion van redme these risk.s also, perhaps iinrre cavils ,is 
Hcallh glows largei. 



JOURNAl, OF POLITICAl. ECONOMY 


1 a 

make diversilk ation oi other forms of portfolio protection practical, it 
nceci not lowei even the apparent rate of return. 

I he tiiicsiion o( v\hether those uncertainties that imply the same 
behavior as positive time preference are ciuantitatively more 
signifit am ilian iho.se that imply the same behavior as a negative time 
pieferente is an empirical tpiestion. Yet to the best of our knowledge 
il has noi even been addressed. Nor are we aware of any persuasive 
empirical liieiatine on whether other possible influences on saving 
behasior. such as ii latioiiality or perceptual bias, combine with the 
two opposing effects of nnccilainty to create an appearance of time 
]jiefeience that is on balance positive, oi zero, or negative. 


VI. Is Time Preference Positive? 


Ihe aigmneiii in pi ioi sections helps us address this question now. 
V^■t' sliall see in the ne\i section tli.it there are rea.sons lor assuming 
dial some households can have ;m indehnitelv long or infinite time 
hon/on. and th.ii die conclusion we shall draw with this assumption 
holds a foi iion when there are shorter planning horizons. 11 17 = 0 , 
liom equation ( 8 ) we ha\e 


n'd'd 1 

n'ff.,.) (1 + >y 


( 9 ) 


11 we then extend the time lioi i/on to latei and later t tuid tipply (9) to 
the "last " period in the future, we obtain 


lim = 0 . 


( 10 ) 


fills lesnlt inqilies that with the continuing ]iositive interest rates that 
tioi mally [irevtiil and an 17 of/eio, the tioiisehold would lower to 
the despetation level, whete f'lfo) = d the ('.j attainable under the 
budget constraint (7) is below the saltation level, at which — 0- 

Inasmuth as we normally do not ob.serve eithei people who cut 
consum|)tion down to utterly ab)ect levels in ot tler to provide for the 
future ot people wlio have because of past saving reached consump¬ 
tion levels at which theii taste lot goods is .satiated, there appears to be 
a positive rate of time iireference, that is, in our formulation an tj > 0 . 
(We discuss latei a possible escape from this conclusion if there is both 
an inhnite time horizon and what we call “drastically diminishing 
matgitial utility.") 

W’e have shown eatlier that r is normally greater than 17 becau.se 
capital is u.sefiil in production, and the pre.sent argument suggests 
that if ) exceeded 17 by a very large amount that would also generate 
implausibly low levels of present consumption. This suggests that, at 
least in those societies where real interest rates are very high and 
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expected to remain so, rj must exceed zero by more than a miniscule 
amount." 

Before turning to finite planning horizons, we should examine a 
familiar policy conclusion which assumes an infinite time horizon. 
This is the prescription of those who claim or imply that, at least when 
ecological systems or depletable re.sources are at issue, the only right 
and moral attitude is, in effect, a zero rate of time preferenc e and an 
infinite time horizon—the interests even of distant generations 
should have the same weight as one’s own interest in current-year 
consumption. One need not have a what-has-posterity-ever-done- 
tor-me attitude or a shortsighted view to oppose this supposedly 
moral command. It implies, as a glance at (10) will remind us, that at 
every positive interest rate the household should reduce its consump¬ 
tion to the level at which the marginal utility of current consumption 
is infinite, in order to give all to the future. Of course, that applies to 
evety future year, and to every future generation, until either income 
1 ises so higfi that the marginal utility c>t consumption (when con¬ 
sumption ec|uals income) has fallen to zero, or alternatively there is so 
ntuch citpital arouttd that the interest rate is no longer positive. VV'e 
dottht that the proponents of a zero time preference and an infinite 
time Itori/on have imderstood the implications of their argument and 
coiiclucle that their argument sureh cannot be a proper basis for 
n.tlional policy.^ 

'1 he existence ol positive time preference also has immediate impli- 
catioiis (or the responsiveness of s;i\mgs to changes m interest rates. 
We earlier defined tj as zeio lot a household it. when it had ecpial 
endowments in each period and faced an interest rate of zeio. it chose 
a stable level of c<)tistim|)iioii. A positive tj m these conditions, or an\ 
rj greater than the iiileiest rate, implies a decreasing level of con¬ 
sumption through borrowing or clis.saving, whereas an r > r) implies 
net saving. If 17 = r at .ins level, there is a stable level ol consumption, 
.Starting f rom suc h a situation and looking at ecjuation ( 8 ). we ask what 
would happen il the interest rate fell to zc’io, I he analogs with the 
lesults 111 e<|uations (fl) and ( 10 ) iinmc-diateK reminds us that this 
would give us 


n'lfn) _ (1 + y)' 

r'(C„) 1 

As / gels intlefmileiy long, this would give 

Inn —= *■ 

/-» V ((.„) 


( 11 ) 


(12) 


" VVc aie ilunktiil u> I’elfi Murifll lor calling thcs point 10 our .itu-iilion 
’ Foi a iclaled tiiuliiig, .sec llu- KiHipnians (ItRiO) articlr diMiisM'cl txrlocc. 
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Wc see tliat either f-o increases until v'(Co) = 0, or v'(Ct) decreases 
until T’'(Cr) That is, the ratio CJCi will become huge: The 

lamily will dissave on a large scale, unless it somehow suffers a per¬ 
manent plight so severe that v'{C,) goes rapidly boundless with a small 
reduction in C,, or so near satiation of the capacity to enjoy goods that 
0 quickly. This shows that it is logically contradictory to hold 
that the representative household is myopic or for any other reason 
has positive time preference, and also to hold that savings do not 
lespond to changes in the interest rate and would normally remain 
posiri\e at a zero interest rate.'* (In fact, saving would be sensitive to 
the rate of interest even if time preference were negative and, since 
the marginal utility of consumption is normally diminishing, also if it 
weic zero in ati\ environment in which endowments vary over time.) 
We think this implication may surprise many readers. It will be clear 
from the next two sections that this conclusion, like the others, also 
holds true with finite time horizons. 

VII. Truncated Time Horizons 

.Most significantlv, the case that there is positive time preference— 
that IS, something hevoiid Bdhtn-Bawerk’s “first cause"—gets 
stiongei as the planning horizon gets shorter. The reason for this is 
utterlv obvious once staled: A finite planning horizon of T years 
implies that whaievei happens after 7' years is not only discounted but 
given a value of zeio. Were there not usually markets in which one 
can trade with tfiose who.se time horizons extend farther into the 
future, ati individual wemid place no value at all on an asset, such as a 
bond, that could be redeemed or .sold onlv after the end of his 
planning hoiizon. Truncating a decision maker’s time horizon can 
therefore onb strengthen the evidence for a positive time preference, 
since it merely involves attributing a zero value, rather than a dis¬ 
counted but positive value, to any consumption after a certain date. 

The fact that truncated time horizons lead so directly and simply to 
positive time preterence makes one ask how common and how long 
truncated time hot izons might be. Are observed savings rates, which 
as we have shown ate fat lower than a zero time preference and an 
infinite lime horizon would imply, as low as they are simply because 
the behavior we actually ob.serve is driven by time horizons on average 
about a lifetime or .so in length? 

1 he notion that time horizons might average about a lifetime in 


" VVe- have in mind a (i)in(K'nsaied saving schedule, along which household real 
income i,s conslant, so (hai ihe elicci of a change in the interest rate is a purr 
sulistitution effect. 



positive time preference 


•5 

length has some appeal at first glance. It certainly must be an overes¬ 
timate for older people without heirs or other survivors about whom 
they are concerned. Probably it is about right for some others too, 
who for one reason or another may feel that the cares of one lifetime 
are enough. The number with concern for heirs is probably larger, 
but it appears that mainly this concern extends only a generation or 
two into the future. Even in dynastic families, such as that of the 
Capetian kings of France, we may wonder whether they thought 
ahead much beyond producing-a male heir and preserving his birth¬ 
right. Taking the different types of cases together, it might seem 
that truncated time horizons might average out to jjcrhaps a lifetime, 
and that this is the main explanation of the positive tj that we inferred 
from the small percentage of incomes that are saved. 

'I'his interpretation is nonetheless unsatisfactory, as we realize from 
our earlier reference to the fact that lifetime horizons imply no saving 
by sot ieties, unless the age distribution for a time contains dispropor¬ 
tionate numbers of people in the “savings” stages of life. There is 
.substantial social saving, especially when we include education and 
human capital as we must, that is not explainable by growth, so the 
typical time horizon must be more than a generation in lcngth.“ 

This consideration in turn means that there is no special need to 
know the rate of time preference within planning horizons of a 
lifetime or less in letigih. Though individuals with time horizons as 
short as this presumably exist, the evidence adduced above suggests 
that they are exceptional. It is conceivable that these exceptional 
individuals could have a zero or negative time preference over the 
coui.se of their planning horizons. If .so. there is a remarkable discon¬ 
tinuity, for we know from the definition of a finite planning horizon 
that nothing that occurs after it is even worth taking into account. The 
dominant fact is that the time preference for consumption within the 
planning horizon is infinite compared to anything that happens after 
it. Whether one is interpreting aggregate time series on saving and 
accumulation or analyzing social policy alroui them, this infinite time 
pieference is the dominant fact, besides which the presence or ab¬ 
sence of time preference within the life span pales to insignificance. 

It is indeed fortunate that time preferentc within planning hori¬ 
zons of a lifetime or less in length is not the key issue, for (contrary to 
what we once believed) it is almost impossible to find out what time 
preference over the life cycle might be. Family size and optimal 

C.iowth due lu uxhiuiloj^ical advatice iii what wciuld otherwise be a steads state alsti 
mi|jlie5 some net social saving, Ix-cause young savers have higher lifetime incomes than 
have older people, so that their total saving exceeds the dissaving ol' retired people. 
How'cvci, White (1978) and KotlikoU and Summers tin press) find that total saving far 
exceeds the amount that ran be attnhiited to life-cycle saving in a growing economy. 
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periods for major iiivesiments in human capital change over the 
(ourse of life, and though it is easy to incorporate these considerations 
into our theoretit a) framework, the changes in spending and savings 
levels ihev hring about are difficult to disentangle from any short-run 
time preferent c. Kven the individual’s capacity to enjoy consumption 
mav vary vvith age, as is suggested by the cliche that one should not 
postpone high levels of consumption until one is “loo old to enjoy it,” 
In most households the wage rate also changes substantially with age, 
ami this :igam affects the level of savings in each period, fhere are 
even problems arising bet aiisc of changes in the value of time due to 
these changes in wage rates. The final outputs desired by consumers 
are characteristicalh pioduced through household production func¬ 
tions in which time and market goods are inputs. As Becker and Ghez 
(197")) have sfiown, dianges m wages ovet lime can affect savings 
levels in ;it letist two significant ways. First, the higher the vvage rate 
the gieater die imentive to substitute purchased goods for lime in the 
dotiiestii ptoduction of final outputs, as may mcur wlien a family 
increases its leliance on valercTs, convenience foods, and restaurants 
when vvage rates of family members are unusually high. Second, the 
anticipation of a dif ferent opportunity cost of time in the future can 
(hange the lime profile of final consumption and savings, so that 
there is more <otisuntption in periods when a lower t ost of time makes 
the price of final output (heaper, ;is could occur when a family 
[lostpones the spending involved in distant tourism till reiiremenl. 
(iiven tlie unknovsn and someliines tontrarv effects of these and 
othcT influeiKes, it is not immediately deal what levels of savings 
would sujipvirt oi ielute the hypothesis that lime preleience is posi¬ 
tive within lime periods of a liletime oi le.ss. 


VIII. Bequest.s 

Let us now i elm n to the more rejiresentaiive case of a household vvith 
lieiis loi w'hidi it h.is someioncein What a loiiple during the couise 
of life gives to its diildien ,ind what it betjueaihs to them has a ca))iial 
value etjual to the disioimtetl future value of the consumption (;md 
later bequests) it is exjieiied to jirovide. In the minds of the givers this 
lx-£|uesi mu.st have a prosfx-ciive value oi utility at least etjual to the 
utility forgone bv not consuming it tfiemselves. 'file extent of any 
w illingness of tlie liousehold to reduce the heads’ cut rent cotisump- 
iion to provide for their children’s endowments is the central param¬ 
eter revealing the pre.sencc or absence of the “myopia” discus.sed in 
debates about the determinants of .saving. 
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A household with the faith that it will have a continuing line of 
descendants with the same regard for their children as it has for its 
own will act as if it had an indefinitely large or infinite time horizon, 
even if it gives such a long time horizon no explicit attention. Supjjose 
that the household spends that amount on educatitm, gifts, and be¬ 
quests that strikes its preferred balance between its own desire to 
consume and its provision for the children's future, and leaves it to 
the children to allocate their resources in turn. When the problem is 
understood in this way it becomes clear ihat there is an extensive 
iheorelical literature that is direttly applicable, notably that of 
dynamic programming. One of the basic findings in this literature is 
that the optimal program for several periods is the same as that 
reached by optimizing each period in turn, including setting optimal 
end-of-period conditions each time for the beginning of the next 
period (see, e.g.. Bellman and Kalaba 1965, chap. 2): the result is the 
same whichever way one looks at the problem. I hat this notion 
applies precisely to intergenerational lesouicc allocation is shown by 
Barro (1974). It can accordingly be appiopriate to view the choice of 
the size of the bequest as if the household were maximizing a utility 
luiiction of the form ( 1 ) with ati indefinitely large, or infinite, time 
horizon. 'I'he /' represents not how far into the future people plan 
(onsutiifition, but the length of a dynamic program implied by a 
continuing series of intergenerational tratisfers. 

l.et us now suppose that the conditions that make it appropriate to 
assume an infinite lime horizon do not hold, so that finite lime 
horizons such as we used to develop ecpialioiis (l)-( 8 ) are appropri¬ 
ate. If the present heads of household are utierK unconcerned aixmt 
am descendants beyond, say, theit giaiidthildren. this lack of con- 
leni will manliest itself in the form of a smaller beijuesi than other¬ 
wise. LhiceriaiiUy about whether a subsecpieni gc-neraiion will have 
offsjn ing, or fail to make a bequest to its children, will also make tor a 
smaller bequest. I'he effect of .i lack of concern for more distant 
geneiaiions, or of uiueriainty alMiiit whether the Ix-cjiiest will in fact 
serve the givers' purposes, is approximately the same as if the house¬ 
hold applies a high rate of time preference 17 in maximizing utility ( 1 ) 
with a large or infinite T. We can see this in taking etpiation (1) and 
making n(fo) a declining I unction ol time, for example. with i’, < 
0 . I’his is in effect e(|uivaleni to keeping ( 1 ) as it is and raising the 
value of 17 . This makes it clear in another wav that less-lhan-infinite 
time horizons entail positive time preference. Thus the decision to let 
7 ill equation ( 1 ) be finite at some points, and to let it approach 
infinity in Section VI, was governed largely by analytical convenience 
and had no substantive effect on the conclusions. 
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IX. Drastically Diminishing Marginal Utility 

With infinite time horizons and continued positive interest rates the 
returns to sav ings ultimately become so colossal that, if there were not 
(wsitive lime preference, current consumption would be drawn down 
to survival levels. There are, however, two phenomena in addition to 
positive time preference that conceivably could explain the high levels 
of consumption that actually occur. One is that such large increases in 
endowment income are expected that saving for still more future 
consum])tion is not worthwhile however much interest would ac¬ 
crue."’ The other possibility is that the marginal utility from con¬ 
sumption in any period diminishes so drastically as consumption rises 
that even essentially unlimited interest income would not yield 
enough f uiiirc utility to justify much saving. There is no reason for 
most households to assume increases in endowments so great that 
they could justify the low observed savings rates. It is, on the other 
hand, just possible tftat drastically diminishing marginal utility in 
combination with plausible expectations about increasing endowment 
incomes, pet haps due to technological advance, could explain the 
observed levels of saving. 

.Sufjposc-, fur example, that the hou.seluild expects an exponentially 
t ising real income due to technological advance, even if its saving and 
aggiegate savings ate zero. Equation (8) implies a chosen rate of 
growth ol consumiiiion that is independent of the horizon 7'; this 
growlh will be exponential if marginal utility is a constant-elasticity 
lunction, ihai is, if utility has the form 

cfT) +5 (13) 

lor O' < 0, .-f < 0, and B > 0, or. for the unit elastic case, the form 

T'ff;) = d InC +/f (14) 

with A > 0 (see White 1978). For Uarrod-neutral technological ad¬ 
vance, there is some Iraction ol iiuonie saved that is consistent with 
having consuinpiioii grow exponentially at the same rate p as income 
does. If so we tan have v'(C.r) —» 0 as 7" —» because of this growth. 

for example, suppose that utility is given by the Bernoulli function 
(14). I hen the oiiiiinurn consumption pattern implied by (8) is 




(1 + 0 ' .. 
( 1 + 7 ,)' 


(Ih) 


for eveiy I. If it should happen that 1 + p = (1 + f)/(\ + tj), or, 
approximately. 


Wc are giaietul to Chnsioplier Clague fcit calling cjui attciiiiuii to this pos.sibility. 
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(16) 


consumption will grow at the same rate as income and will remain a 
constant fraction of income, as is actually observed. This pattern can 
be sustained indefinitely and can plausibly be reconciled with a con¬ 
stant rate of interest (see Solow 1970, chap. 5). 

If the rate of time preference were zero, equation (15) would imply 
that consumption rises indefinitely at the rate r. When 7 > p, as has 
been true of the United States, this high rate of growth of consump¬ 
tion can be mainuined for large T only by pushing C'o down toward 
zero, pushing the fraction of income saved toward unity. 

This set of results and considerations makes a compelling case for 
the existence of a positive rate of time preference, unless one sup¬ 
poses that the utility function v(C) is inelastic and that unlimited 
exponential growth of income is expected even with zero saving. In 
the past, output per unit of input has grown at about 1 percent a year, 
and the U.S. birthrate has converged on zero population growth. If 
the real rate of interest of about 5 percent (which has prevailed 
during most of the past several decades) is what households expected 
to continue into the indefinite future, these figures tan be shown to 
reconcile with zero time preference only if the elasticity of the utility 
function (13) is minus 5 corresponding to a = -4. This function has 
what we shall label “drastically diminishing marginal utility." We do 
not argue that utility functions of this character are the norm but 
know of no data that rule them out and think it is not without interest 
that (when exogenous increases in endowments are expected and 
rime horizons are also infinite) the facts of saving force us to choose 
between positive time preference and drastically diminishing margi¬ 
nal utility of income. 

X. The Relationship to Prior Findings 

Though many of the prior findings on time preference contradict one 
another, and there has been no generally accepted analysis of the 
matter, many of the.se writings nonetheless have considerable value. 
With the aid of the findings here we arc in a better position to 
understand or appreciate some prior contributions. 

One of the best-known contributions is that of Stroiz. (1956). The 
extreme simplicity of Strotz's insight and its relationship to our argu¬ 
ment can be made immediately apparent. Suppose that an individual 
has a positive time preference, in that he values consumption in the 
future less highly than consumption now, but it is only the distance 
into tne future, rather than the specific date, that determines the 
weight given to the utility. Suppose for the sake of a simple example 
that the marginal utility of consumption in any one period is the 
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sanif at all k vt ls of consumption, however high or low, so that we can 
igtioie the climiiiishing marginal utility of consumption and focus 
exclusively on Sirotz's insight. Let us .suppose that our individual must 
obtain all consumption out of an initial endowment of 25 and that his 
time preferences are such that he ch<} 0 .ses initially to consume 20 
imiis, or foul -fif ths of his .stock, in the present period, three units in 
the second period, and two units in the third. When the first period is 
over, however, the individual will abandon his initial plan and con¬ 
sume four-fifths of his remaining endowment of five, or four, sitice by 
stijmlaiion he weighs the utililv in the present so highly he consumes 
foin-fifths of his stock, and period 2 is now the present. This is of 
coui se inconsistent with the initial plan for consumption in the second 
and third pericjcls; ihe initial plan has not been followed, even though 
all of the individual's expectations about his situations and tastes have 
piovecl collect, Ihe point is that no individual with the pattern of 
time piefeieiice descrihed will voluntarih adhere to the plans he 
makes in advance', it is impo.ssihle for him to maintain his i>aitern of 
rime prelerencc' and also adhere to the intertemporal plans he makes 
at the fteginnmg. It becomes obvious on leflection that the only 
patrei ns ol positive time piefeience that individu<tl.s can have and still 
make consisient cn rational inteitempoial choices are those which 
involve a eonstant exponential rate of discounting; a constant r) such 
as we have' all along assumed. If an individual plans to save the same 
jiei c eiiiage ot his period 2 stoc k as of his period 1 stoc k. he will in fat t 
cai'M tliroiigh vcitti ihai plan it his tastes do not change. Of course, no 
positive time pielereuce at all (t; = 0) is consisient with rational 
beliav ioi; it would have led oin livpoiheiital individual lo consume ;i 
thud of ills stock m each of the peiiods and to have adhered to that 
)jl,m. 1 hus we see that it is not eonsistent lo assume both rational 
bebavioi and po.siiive time prelt'rente vviihoul also assuming that 17 is 
an exponeiu ol constant value. 

I'liloi luii.itelv, Siiot/ .goes hevoncl the in.siglit just de.sciiheci lo 
delme "prudent" 01 “ihiiftv’' hefiavioi as anv behavior which meets 
the standard ot intertemporal c onsislenc v : An individual is arhilrarily 
dehnecl 10 be "ibrifty " if he discounts the Ininie al a constant expo¬ 
nential rate, however ftigb that rale niav be. In keeping with his 
pteoecupalion with perceptual error, iiuctnsisient choices, and the 
psyc hoiogv of mieriem]X)ral c lioices, Siioi/, cone hides that "consumer 
sovereignty has no meaning in the- context ol the dynamic decision¬ 
making problems" (111.56, p. 179). 

Though Sircvtz's distuihing conclusion is consistent with the rest of 
his argumcTtt, we can now readily see that it is not in fac I justified. If a 
positive time preference, which he calls "myopia,” were solely the 
result of it rationality and perceptual errors, then consumer 
sovereignty would indeed have little or no meaning for intertemporal 
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choice. But a.s we have seen, the absence of a positive time preference 
implies extreme behavior, or else drastically diminishing marginal 
utility. Short planning horizons can tKcur and need not imply any 
irrationality, and the very act of ignoring any return beyond some 
limited planning horizon as we know entails a positive time prefer¬ 
ence. fhus, though the intransitive and irrationally impatient behav¬ 
ior Strotz illuminated can of course al.so occur, his nihilistic general 
conclusion about the inapplicability of consumer sovereignty to in¬ 
tertemporal choices is incorrect. 

Our analysis is in closer atrord with writings of Koopmans. Dia- 
inond, and Williamson (1964) and Hirshleifer (1970). These writers 
also conclude that there is positive time prelerente, whit h ihev dehne 
somewhat differently. They do ntit, however, offer an\ “empirical 
test” of the sort we used, nor go into shorter time horizons and 
clrastitalK diminishing marginal utility. Koopmans el al. (1964) prove 
that various sets of formal postulates entail the existence of positive 
time jireference. I he (ormal po.stulates are naturallv chtisen in pan 
betaiise of their tractability, and it is not always immediateK clear 
wlietliet these postulates are very restrictive or succeed in capturing 
the essence of the matter. It is not immediateK dear, lor example, 
whether Diamond’s (196.5) assumption that each period’s utility i'( •) is 
bounded is limiting, or even material to the jiroof, or whether it 
matters that these authors exclude the possibilitv that = x or 

that 7' {(>t) — 9, which W't' allow. I he fact that more* work nt*eds to be 
done belore the generality and appluabihtv of the Koopmans (1960) 
and Diamond (196.5) pioofs tan be delunleK established is perhaps 
best demonstrated by quoting Koopmans’s briefest summarv of the 
le.ison for positive time preference m Ins article " . , if theie is m all 


(U( uinsi.uues a preference (or posipoinng salistailion—or even 
neulialily toward liming—then there is not enough room in the set of 
re.ii numfiers to actommodate and label numericalK all the tliffeieni 


saiistaclion levels that mav occur in relation to consumption levels for 
an mlinite future" (1960, p, 288). Hirshleifer (1970. p. 96). using a 
fine of rea.sonmg much closei to ours, concludes in an im[ 3 ortant if 
<i\piit passage that zero time preference wcmld imph infinite defer- 
lal of all consumption. I hat is closer to the mark but overlcMiks some 
of the possihilities. In general, if the argument in our paper is correc t. 
It suggests that the contributions we have just cited deserve more 
.mention and perhaps further extension as well. 


More recenllv, Siigler and Bee kei (1977) have pul forth conclusions 
that diiecilv coniradici those of .Strotz (19.56), KcHipmans el al. (1964). 
and f.nshleifer (1970). Stigler and Becker argue that main Iv|h-s of 
H havioi Ihat ccononiisls have naditioiianv characiiTizcd as sj.>ecial 
patterns of taste are in fact explicable in terms of dif ferent pric es and 
incomes. One of the ad hcK a.ssumpiions alvout tastes to which Stigler 
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and Bt( kcr espec ially object is the assumption, going back as we know 
to Bbhni-Bawerk, that people ‘“systematically undervalue . . . future 
wants.’ rhe taste for consumption in say 1984 is alleged to continue to 
shift upward as 1984 gets clo.ser to the present. In spite of the impor¬ 
tance freciuently attached to time preference, we do not know of any 
significant behavior that has been illuminated by this assumption. 
Indeed, given additional space, we would argue that the assumption 
of time preference impedes the explanation of life cycle variations in 
the allocation of resources, the secular growth in real incomes, and 
othet phenomena” (Stigler and Becker 1977, p. 89). 

It might seem that the <iuoied passage by Stigler and Becker con- 
tiailicis oin own findings no le.ss than those of the authors we have 
cited, since we also claim to have demonstrated the existence of 
positive lime pteference. 'I'liis is not in fact the case; one could 
consistently accept Stigler and Becker’s conclusion and our own as 
well. We have defined positive time preference in a slightly broader 
and more readily testable fashion than Stigler and Becker have, and 
w<- neithei assumed nor tilled out irrationality or perceptual error, 
.\c( ()rdmgl\, it is, as we argued, an empirical question whether there 
is time preference, and the revealed preference approach made it 
possible to assess the evidence. It would of course be possible to 
discuss the c|uestion of why bequests are as small as they are, or why 
some people have only finite planning horizons, with difiereni ter¬ 
minology, thougfi excluding the phrase “time preference” and 
sMionvms for it would in our judgment involve circumlocution.” 

XI. Conclusions 

Our [irincipal findings are the following: 

1. A finite time hori/.on implies that only consumption inside the 
hoi izon has value and signifies an absolute unwillingness to transfer 
goods to children or other destendants who could benefit from them 
after a si>ecilied future date. Thus a finite planning horizon is both 
theoretically and practically ecjuivalent to a positive rale of pure time 
jiitference. Planning ftori/ttns that do not reach beyond the lifetimes 
of those who make tfte plans can exist, and (though this implies a 
remarkable di.scontinuity) it is logically possible that within such 
lifetime horizons time prefereiit e could he zero or even negative. But 
this possibility can have little significance in comparistm with the 
high rate of time preference implied by the short horizon, and in any 
event it does not seem feasible to di.sentangle time preference during 

" Siiglei and B«< kcr'v foulnotc assertion that "a consistent application of the as¬ 
sumption of stable preferences implies , the absence of lime preference" (1977, p. 78, 
n, 4) Ls cither mistaken or dependent upon definitions of "stable preferenees" or "time 
preference” that ap|>eai in be untypical. 



POSITIVE TIME PREFERENCE 23 

the course of a lifetime from other changes over the life cycle. Time 
horizons which imply no bequests for descendants also cannot be the 
norm, since they would imply lower savings than are observed and no 
social saving whatever in a society in a stationary state. We have also 
shown that finite time horizons and positive time preference are 
functionally equivalent, so the most interesting question is whether 
there is positive time preference within an infinite time horizon. If 
there is, there is no need to go into the length of planning horizons, 
and the analytically convenient assumption of an infinite time horizon 
can be used without significant loss of generality. 

2. An infinite time horizon and a zero rate of time preference 
suggest that (if the foreseeable rate of economic growth is less than 
the expected long-term rate of interest) current consumption would 
be no higher than the subsistence level. Over an indefinitely long time 
span continued positive interest rates imply such colossal gains in 
future consumption from reductions in current consumption that all 
income beyond that needed for survival would be saved. 

3. There is, however, the alternative possibility that even vastly 
higher consumption is expected to bring so little increase in utility 
that hou.seholds are unwilling to forgo consumption now, even 
thotigh this can generate essentially limitless increases in consumption 
later. The possibility that there will be absolute satiation at these 
future consumption levels can be rejected on the classical grounds 
that wants are limitless, but there remains the intriguing possibility 
that there is what we here labeled “drastically diminishing marginal 
utility": Even boundless increases in future consumption W'ould then 
bi ing such modest increases in utility that the low levels of saving we 
observe could be rational even with a zero rate of time preference. 
The observed levels of saving imply pcjsitive time preference, or 
drastically diminishing marginal utility, or both of these interesting 
phenomena. In an age that has seen secularly rising incomes, it might 
seem that drastically diminishing marginal utility is a type of positive 
time preference, since an unwillingness to bequeath much to descen¬ 
dants, on the grounds that they will be better off than we are and will 
not get much .satisfaction from still higher consumption, provides a 
good rationale for preferring current consumption and leads to the 
same observed savings levels. This is not in fact correct. If there were 

, a sufficient technological regression or other losses in endowments, 
yet a positive interest rate, zero time preference would imply that 
- current cotisumpiion would be reduced to subsistence levels, but 
, drastically diminishing marginal utility would not. Similarly, if for 
shifts of goods within their cjwn lifetimes people behave as if they had 
: Bernoulli utility functions, intertemporal consistency would require 
the use of positive time preference to reconcile this behavior with 
; their gifts and bequests. Also, whether drastically diminishing utility is 
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pti'scMii oi not, finite planning horizons entail positive time prefer¬ 
ence. Drasticalh dimini.shing marginal utility also may be needed to 
explain oilier phenomena, siieh as a willingness to pay a high price to 
insure against even modest losses or to reduce fluctuations in con¬ 
sumption levels, and savings levels in particular cases can be traced to 
drasiiialli diminishing marginal utility according as its other implica¬ 
tions are present oi absent. 

4. rncertainty about the future has double-edged effects. Of 
(ourse, if the tvpiral household thinks that there is a greater 
likelihood that part of its l)et|ue.st will merely be destroyed in a 
future catastrophe th.ni that the full becpiest adds to the probability 
dial descendants will survive well in that catastrophe, the expected 
rale of lelurii lo saving is thereby lowered in their niiiitls. 'fhai 
expectation, mslead of time preleience, could in principle account for 
high consumpiioii and love saciiig. However, each type of uncertainty 
vee can think of offers possible reasons for saving moi e as well as 
leasoiis lot s.iMiig less. .'Vs in the other cases, it is hard to see a simple 
wav to distinguish it empiric ally fiom pure time preleience. 'rhus for 
|)uiposc's of this in<|uii\ ue htne niiicle the simplest assumption; that 
uiueiiaintv about whether anv in.irginal savings will be lost is on 
aveiage about ollsi't i)\ the incentives foi additional saving that un- 
(eit.iiiitv <ibout the fuiuie cieates. To make a belter sjK-’cilicatiori 
|)()ssil)le m anv lurihei studies, we have made clear above that the 
jirevalence of futures ;mcl insui.nice niatkets, the degree of risk 
.ivetsion, the* degiee ofConlidence in predictions about the futuie, 
.mil the costs of cliveisific alion or pioiection of'poi ilolios c'ac h has 
distinct effects on savings r.iies. and these might eveiiluiilly be esti¬ 
mated. 

In sumimiM. the case lor positive- tune |)releience is abscilulely 
(oinpellmg. except m I tie inieresiing case where there is drasiicallv 
(iiimnislung m.iiginal utihtv I'liliiy functions of this sort htive in- 
tc-resimg iiii plications of then own which need to be tested septirately. 
lime ptefereiice is accorthngh by no means a .suh)ect that deserves 
only casual references to allegc-d peiceptiial eiiois or erronc-ous in- 
leiences from the existence of positive interest rales. It is a ])henonie- 
non of great ptactical and theoretical iiri|xHtaiic e that is fotlumitely 
c|nile aineii.ible lo .nialvsis. 
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Temporary Income Taxes and Consumer 
Spending 


Alan S. Blinder 

Vnnmjit ii/iil Sntional Hmrau «/ Krunimiir Rneaich 


Bolli ccoDoniK theory ;m<l casual eiupinta! observation of the U.S. 
econoinv suggest that s[x-nding projtensities from temf)orary tax 
(.h.inges are stnaller tliaii those fiom permanent ones, but neither 
provKies tnnch guidance aliout the magnitiicle of this difference, 
f fits pa(7er offers neic eiiipincal oiirnaics of this cliffeience and 
finds It to lie c|uiie substantial. The analysis is based on an amend¬ 
ment of the sianrJard distributed lag version of the permanent in¬ 
come hvpoihesis that distinguishes temporary taxes from other in¬ 
come on the giouiids that the forinei are "more transitory." I his 
amendineiii. vvliuli is hioadly consistent tviih rational expectations, 
leads (o a iionlitiear consumption fimciioii. I'hough the standard 
eiroi IS unavoidatilv laige, the point estimate suggests that a tempo¬ 
rary i.ix change is treated .as a 50-.'>0 blend of a normal income tax 
change ;ind a pure vcmdfall. Over a 1-ycai planning hori/.oii, a 
lemporai y tax c hange is csumaicd to have only a little mote than half 
the iinpaci ol a pennaiiem tax change of e()nal magnitude, and a 
lehaie is estimated to hace only alxuii ‘ifi percent of the impact. 


In 1968, laced with a classic case of demand inllation, Ciongress 
enacted a temporary iiicicase in personal income lax payments to 

1 fits paper has fx'iicfued from sciiiiiiar piesenlalions at Princeton, Hebrew Univer- 
sit> ol Jeiiisalciii, 7el-.^nv Ibiiversily. and the Ifiiiversiiy ol Pennsylvania, and from 
lielpfiil suggestions fioni Walter Dolcle, Mar)oric Flavin, Roger Gorefon, Levis Kuchin, 
Rofien Solow, (anies f rnsseli. and a referee of this /ouniaf. Most of the laborious 
iDinpulations were done expeitly by Suzanne Heller, additional research asscstanec was 
provided by Uavicl C.ucl, Robin Liiicfsey, and William Newton. Stephen Goldfeld and 
Richard Qiiaiidi gciierou.sly guided this novice through the intricacies ol ihcir non¬ 
linear opuinizaiioii routines. Financial supfxirt from tlie N'aUonal Science F'oundalion 
and from the Insiilule (or Advanced .Studies in Jerusalem, where pan u( this work was 
done, IS gratefnih acknowledged. This research is part ol ihe National Bureau of 
Kconciinic Research's program in economic fUinuations, but any opinions expressed 
herein are those of the aiitfior, nc* of the NBER. 

tfnurruii ijf PnUtuaf f fiimrmy, 1481. vol, 89, nif 1| 

C6, 1981 b> Ihe- t'liivcrsiu cif Cliua(jD (K«2.Sfl08«lrtl90taK8IS$0l 50 
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curb aggregate demand. In 1975, near the trough of our worst post¬ 
war recession. Congress enacted a ux rebate and other temporary 
decreases in taxes and increases in transfer payments designed to 
stimulate aggregate demand. Questions have been raised about the 
effectiveness of both measures. 

The questions have both theoretical and empirical roots. On 
theoretical grounds, the permanent income-life cycle hypothesis 
seems to argue that temporary income tax changes should have little 
ef fect on consumer spending in principle. On empirical grounds, data 
on consumer behavior seem to suggest that the impacts of the two 
temporary taxes on spending were also small in practice. In 1968, the 
savings rate fell from 7.5 percent in the quarter immediately preced¬ 
ing the tax surcharge (1968:2) to only 5.6 percent in the first quarter 
of the surtax, suggesting that consumers kept sptending despite the 
tax. In 1975, the savings rale ballooned from 6.4 percent just prior to 
;the rebate to a stunning 9.7 percent in the quarter of the rebate 
I (1975:2), suggesting that little of the rebate was spent. 

I The purpose of this paper is to study these two lem|>orary tax 
ichanges in some detail. Precisely what prediction does economic 
I theory make about the relative effectiveness of temporary versus 
; pertnanent tax changes? And what conclusions can be reached from 
U.S. time-series data? The fact that the Caiier administration asked 
for (but did not get) a repeal performance of the rebate in 1977 
suggests that there is more than academic interest in the answei s to 
these questions. 

Section I outlines the theoretical issues, beginning with an idealized 
life-cycle model and pnx'eeditig to introduce some important “real 
world” considerations. Since the discussion shows quite clearly that 
the i.s.sue i.s an empirical one. Section II reviews previt^us empirical 
work on the subject very briefly. Section III explains the underlying 
basis of the empirical model of this paper, relating it to recent litera¬ 
ture on rational expectations and the permanent income hvptoihcsis 
(PIH), and then Sections IV and V show how this basic conceptual 
framework was converted into an operational empirical model. The 
estimates are presented and analyz.ed in Section VI, and Section VII 
summarizes the main conclusions. 


I- The Implications of Theory: Pure and Impure 

I. The Pure Permanent Income-Life Cycle Theory 

As Eisner (1969) pointed out some time ago, the PIH casts doubt on 
the effectiveness of income tax changes that are labeled as temporary 
because such measures have only minor effects on permanent income. 
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I'o develop a theoretical benchmark for the marginal propen- 
sit) to consume (MPC) that the PIH suggests might apply to a tempo¬ 
rary tax, consider a rarefied world in which consumers with exoge¬ 
nous earnings streams select consumption paths to maximize lifetime 
utihtt. II capital markets are perfect, only the discounted present 
values of the earnings streams matter, so suppose all households earn 
a umstant incomey per year. Assume further that households differ 
onh in age, «; that the real rate of interest is zero; and that the 
suhietti\'e rate of time discounting is also zero.' d'he question is, If 
me ome taxes aie raised by ; pel capita for the period from / = 0 to t = 
^i, how much less will lonsumers s|)end over this interval? 

In answciing this question, there are three population groups to 
keep tiack of. People who are “alive" (in the economic sense) at t = 0 
and w'fio live pasi die expiiation f>f the tax suffer an inciime loss cjf/,i 
ovci the period. If 7 denotes the length of life, then each such person 
of ageu consumes a fiaction/,/(/' - <•/) of this lossdtiring the years in 
wine li the temporal \ tax is in effect. Thus the change in consumption 
fier Ccipita is Af.', = c/f/('/ - <vl. 

Old people who are alive at / = 0, hut die before / = /,, lose onlv (7‘ 

/i)z 111 income. However, since they htive MP(a of unity during the 
suit.IX period .is .i whole, their change in consumption per capita is 

At,2 = (7 - 11)1. 

I'iiiallv, we imisi worry al>out people who .ire horn between / = 0 
and / = /,. 11 a, a negative mimhei hetwc'en 0 and - /,. denotes the age 
of such .1 person, and tie lives foi /, -i- c/ < /, ye.irs during the period 0 
s / /|. tiis income loss is (/, + c/);. .Since lie spends .t fiaciion (/, 4- 

Cl)//' of this income during the period 0 -e / /,, the change in his 

ccmsiimption is At,';, = [it/, + c;)-|/7'. 

1 o del ive the aggreg.ite cfiange in per capita consumption, weight 
these groups tiv the age clistiihutioii, considering the ages —/, s. « «= 
7 111 the simplest case of a iimfortn ,ige clisti ihutioii,/(u) = 1/7', the 

ctiange is 


At: = 


rr-'i 

J n 


At:, 


fla 4 - 




At:,, 


fUt. 


VV'cirking out the mtegtals atid dividing hy the total income that is 
taxed awav duriiig the period (/,; per capita), we obtain 


MPC: . ^(log 7 - log ,,) + 11 _ ^ I ^ IT-- 

where the thiee terms show the contributions of the three different 
population gioiips. 


this is csscmuillv the iiickIcI iiiiicKluced by Mcicligli.iiii and BiiinitK-r); (It).'j4), 
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To take a concrete example, suppose the typical lifetime of a 
household head as a household head is T = 50 years, I hen, according 
to this formula, the MFC for a 1-year tax ((, = 1) is .09, while that fora 
2-yeai tax (t, = 2) is .15.* The MPC = .09 for a 1-year temporary tax is 
only a rough benchmark representing the pure PIH, and there are a 
number of reasons why the theory probably systematically under¬ 
states the responses of consumers to temporary taxes (see below), so 
we should not lake this number too seriously. Still, there are two 
lessons worth drawing from this simple exercise—lessons that have 
often been forgotten in the temporary-tax debate. 

(v) Income gains and losses from tcmporar)' taxes w'ill cventualK be 
spent just like anj other increment or decrement to lifetime re- 
souites; if less is spent at fust (because/, < 7 ), then more will be sfjent 
later. 1 hits if we want to inquire al>oui the "effectiveness" of tempo¬ 
rary la.xcs, we must specify a time hori/on. Ovei a long eiirtugh run, 
they must lie just as "effective" as permanent ones. 

/>) 1 he so-called zeio effect view—that consumers igiune the stirtax 
and consume as if it never happened—doers not represent the PIH at 
till. Itistead. that rheoiy .says tfial consunieis should spe-nd precise!) 
what they would oti receipt of a windfall gain (or loss) of /,:. In the 
illiisii alive ciilcultition, tins turns out to be the "9 percent ef fect" \ tew. 

2. Caveats aiul Imperfetturns 

I'bere are .several reasons wh\ surtaxes ina\ ailed spending more 
siionglv that) indicateci In pine theorv. First, tax-induced income 
cltanges that ;tre not consumed must l>e s.ived. If windfall gains are 
used to puichase durable goods, consumer spending mas rise much 
more si rough than consumjttion; the conseise mas ha[)jH*n svhen 
theie are windfall losses. 1 he magnitude of the marginal projxmsits 
to spend windfalls on durable goods is, of ccnirsc*. an empirical ejues- 
tion •* 

.Second, some honscdiolds mas be snbjc’cl 10 licjuidits constraints 
lhal are iisiialh ignored b\ the PIH. If we stas svithin the certaiiitv 
context, these constrained households svill read strongh to esen tt*in- 
poraiv income changes.'* Thus the aggregate MFC, for a lemporars 
lax is a weighted .tvc*rage of the low MPf'.s of unconstrained hemse- 
holds and the high MP(^s of constrained ones, .\gain. the imyxirtance 
of this phenomenon is an empirical cpiestion. 

■' H> wiu ot (oiiipansoii. as (,-•/. ihc MPC^ —» 5/(i 

■ Sff f.jf . Dai l,y (1972) 

' .See Blinder (I97t)) ,iii<l Dolctc 11978) I-iii .1 look ,11 one pailicul.ii (\|x’ ot itncrr- 
I ,nnty. sec* toU^v and Hc'tlsvig (197.5), X’liicii shows that ihis rcssiili nias noi t arcs throii|cti 
to the niuc'il.iinty rase. 
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Third, as (>k.uii (1971) pointed out. consumer behavior depends on 
what people believe rather than on what the government announces. 
If consumers disbelieve the government when it tells them that a tax 
hike is only lemporary, then the spending response will be greater 
than that suggested by a naive application of the PIH.^ Since the 
perceived duration of the surtax, not the declared duration, is rele¬ 
vant from the standpoint of the PIH, this too raises an empirical issue. 

Final!), we must recognize the possibility that households may not 
do the kind of rational long-term planning envisioned by Modigliani 
and Brumberg (1954) and Friedman (1957) or, what amounts to the 
same thing, have very high subjective discount rates. If they are very 
shortsighted, then temporary fluctuations in disposable income may 
ha\e substantial effects on spending. 

II. Previous Empirical Work 

Okun's (1971) study ofxnied the empirical debate on this issue. Using 
the consumption ecjuations of lour econometric models, he compared 
the “full effect" \iew that the 1968 .surtax was just as effective as a 
pet inanent tax increase to the “zero effect” view that consumers totally 
Ignored the sunax. While he ccmcluded that the full effect view fit the 
data bettei, an iiuerinediate “5(1 percent effect" view actually does 
heller than either extreme.* Springer (1975) criticized Okun's 
ctonoinetric piocedures and then performed a similai experiment 
with a consumption function based on the PIH. He concluded that 
the zero effect view performed Ireiier. 

justei (1977). using a series of savings et|uaiions based on the 
Houihakker-1 ayloi (1966) model, reached conclusions about the 
1975 rebate simil.ir to those ofOkun for the 1968 surtax. But Mcxlig- 
liani and Steindel (1977) found that the rebate had very little impact 
ovci a horizon of 1 or 2 tjuarters, I'heir modihed version of the 
life-cycle model implied, however, a virtually full effect view over a 
6-tjuaiter hoiizon. Modigliani and Steindel assumed that the nonre¬ 
bale portions of the 1975 lax cuts weie treated like permanent taxes 
and handled the 1968 surtax with dummy variables. 

Fhe existing empirical literature thus offers little consensus. The 
issue seems (juite oiren. 


III. The Distributed Lag Model of Consumption 

fhe basic vehicle for investigating the effectiveness of lemporary 
income taxes in this paper is the di.stributed lag version of the PIH. 

■’ 1 his (XHMi has rather less oigcncy with respect 10 lax cuts; but here too cfinsumers 
may believe them to fie more permanent than the government announces. 

‘On this, see Blinder and .Solow (1974, pp. 107-9) 
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While this is the standard way of implementing the PIH empirically, 
the recent literature on rational expectations has seemed to raise 
doubts about its validity.^ This section shows how the PIH and 
rational expectations together lead to an estimating equation very 
much like the one I use in this paper. 

As Muth (1960) pointed out, the PIH is basically a Joru>ard-limhing 
model of consumer behavior. It states that consumers, in deciding on 
their current spending, weigh their current asset holdings, their cur¬ 
rent income from labor, and their expected future income from 
labor. Specifically, permanent income at time I is defined as 


Y‘,‘ =A,+ 


V ,Y.^. 

(1 +r)' ’ 


( 1 ) 


where .-1, is the stock of t eal assets at the beginning of period I, Y, is 
noninterest income in period I, and ,Y,^. is the mathematical (i.e., 
rational) expectation of I',+, that is formed at time I. (By convention, 
lY, — Y,.) However, if the stcKhastic process generating income tan be 
described by a time-series model such as 

1’, = + ■ . . + , -t- 6,, (2) 

where e, is a white noise error term, then the resulting empirical 
model of consumption will huikxt’tird looking. For example, il the 
llieoreiical cotisumption function is 

f'c = S + III. (3) 

ilien the empirical consumption function will Ik* 

c, = 5 + kA, + //„}■, + f>„y, „ -t- (4) 

I o see this, it is only ticcessarv to note that the (rational) ex|^x-cta- 
tion of income in pet iod I + \ can, in view of (2), be based onb on the 
infoi mation set {!,, I, ,, 1,2. • • • }. Thus, tor example, 

,>■/+, = a,)'l + lljY, , + «;,}■,-2 + . . . + 

, ) ,,-2 = c; 1,1 ,,, + CJ 2 I , + rt;,I , -I- . . . -f + I 

= («? + «2)I', + (fl,«2 + + . . . , 

and so on. Substituting all such expressions into the definition (1) and 
then into the consumption function (3), it is clear that (4) is derived. 
As pointed out by Sargent (1978) and others, the coefficients in 
e(|uation (4) will be complicated functions of the coef hcients n, in (2). 

Suppose theti that, as suggested by Dolde (1976), we can distinguish 
.imoiig two or more sources of income whose generating functions (2) 
may differ. The PIH in conjunction with rational expectations then 


’See Lucas (1976) aiicf csp. lUH (1978). 
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itn[)lies ihai ihe h\ in (4) should follow a diflcrenl pattern for each 
income source, A simple example will illii,siraie this point and a!.so 
give us some feeling (or possible magnitudes. Consider several in¬ 
tome souices, each of which is generated by a hrsi-order autoregres¬ 
sive: 

lit = + €»• (5) 

Wtu king out the expectations and plugging into (1) gives us a simple 
expression for the permanent imorne attributable to each .source; 

> R = - J — if p < 1 + r. (6) 

1 + » - p, 

Notice that, despite the long time horiztin contemplated by the PIH, 
consumption depends otily on current income. In general, con.sump- 
tion will depetid on jtast income only up to lag n, where ti -t 1 is the 
longexst lag con.sidered in equation (2). 

Now compare isso income sources, one of which is etitirely perma- 
tient (p = 1) and the other of which is entiiely transitory (p = 0). A 
$1.00 increase in the permanent component will, according to (6), 
raise permattent income by $(1 + r)/t and thus raise consumption by 
$A(1 + /■)/>. I hi.s aiiiy imply a very large immediate spending re¬ 
sponse." By contrast, a $1.00 fluctuation in the purely transitory 
component will, agitin according to (6), raise permanent income by 
only $1.00 and thus raise consumption by only $k. The les.son, of 
course, generali/.es and applies far beyond the coniines of hrsi-order 
autotegressives: iricomr sonnes dt'emed to hr more permanent m/l rltrtt 
promptet spending responses than income sonries denned to he more temporary. 
The application of this principle to iHrmanenl versus temporary 
changes in taxes is apparent and immediate and was ehicidaleci clearly 
by Lucas (1970). It is the basic notion underlying the empirical mcidel 
to be developed in the next section. 

However, lest confusion arise, 1 should stress that there is no sense 
in which the ratiofiality of expectations is either assumed or irnposeci 
in the consumption functions estimated Itere. My point is only that the 
distributed lag formulation of the PIH is consistent with rational 
expectations. As Sargent (1978) has emphasized, rational expectations 
delivers a set of restrictions acros.s equations (2) and (4) that can be 
imposed in estimating the two jointly. I have made no attempt to 
impose these restrictions here because my interest was in getting the 
best possible consumption-function estimates, not in testing 
rationality. Furthermore, it is well known that quite different models 

*sSuf>p<)se the ra(e of jiubjetlive time discounting is equal to the rate of interest, so 
thai a constant consumption stream is optimal, and thatB is the lifetime propensity to 
consumc(i.e, I — is the propensity to bequeath). Then A will he fir/( 1 + r), so thaiA(l 
+ r)/r will be B, which is close to unity. 
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of tonsumpiion behavior (e.g., habit peisislencc) can lead (o an es¬ 
timating equation very much like (4). It is not my purpose lo discrimi¬ 
nate among alternative ways of arriving at (4). 


IV. Derivation of an Estimating Equation 

The preceding discussion makes it clear that different distributed lag 
coefficients might be associated with different sources of intome. 
While the actual empirical analysis considered four types of income, 
the model is mo.st readily explained if I suppo.se there are only two: 
incctme (positive or tiegative) attributable to temporary tax measures, 
which I denette as .S, (“special income”); and all other disposable 
income (“regular income"), which I denote as/?,. 'I'he/?, shcjuld not be 
confused with permanent income, since it has both permanent and 
transitory components. The basic idea underlying the estimating 
equation is that S, is identifiably “less permanent” than /?,. 

Suppose consumption responds to R, according to a set of distrib¬ 
uted lag weights: rcj = dC,ldR,^j,) = 0, 1, . . . , n. Since the xcj depend 
oti the stochastic process generating/?,, it is worth reporting that the 
deviations of/?, from a logarithmic time trend are well described by 
the following second-order autoregressive:’* 

V, = l.‘/8y,-, - .35y,,j, /?'•' = .91, D-W = 1.86. 

(.09) (.09) 

When income follows a second-order autoregressive, permanent in¬ 
come as dehtied in (1) is: 


Y'l = /I, + + 


(1 + rf _ 

(1 r)(l + r - «,) - • 


+_ ^'2(1 + 0 _^ 

(1 -I- r)(l -H r - a,) - tJ 2 " 

if til + « 2 /(l -E r) < 1 + r, where /v, is the present value of the trend 
component of labor income and y, and y,-, are current and lagged 
deviations from trend. Given the estimates ofand aj above, and for 
r = .0074 (a 3 percent annual real interest rate), the implied 
coefficients are Vf = A, -h K, + 13.5y, — 4.7y,_,. This leads us to expect 
a very large value of iVg, followed by swiftly declining u)’s—possibly 
even turning negative, 'fhe empirical results bear this out. 

As Lttcas (1976) has argued, income changes that are clearly “more 
temporary” than regular income should get different spending 

Standard errors are in parentheses. Lxrnger auloregressives. however, give slightly 
belter hts. E.g., if ( - 11 is the longest lag allowed to enter the regression, significant 
(oefheients are obtained at lags I, 2, 3, 4, and 11 An F-lest for the zero restrictions 
implied by the second-order model, however, yields an f-ratio of only 1.77, which is 
well below the critical 5 percent point of the jd distribution. 
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coef ficients. I n develop a model of the distributed lag response ol'C, 
to St, first break down S, into its components: 

S, =Sf +S? + . . Sr, (7) 

where .S' indicates the income gain or loss in quarter I from the ilh 
tcmpoiary tax. (In the empirical work, m = 3.) It will help clarify the 
treatment of the .S| if I define a hypothetical set of lag coefficients jSj as 
the effect on C, of a $ 1.00 pure windfall gain received in quarter / — j. 

rhe treatment of S' depends on whether or not the t th temporary 
tax is still in effect. If it is, I assume that S} is treated as a weighted 
average of regulai atid windfall income, st) that it gels rhe distributed 
lag weights 

-y, = = Xte, + (I - M/3,, 7 = 0. (8) 

0«A.^1. 

If the tempotat y tax is no longeron the books, 1 assume thatconsum- 
ets look upon in reiros{X‘ct as if it had beeti a pure windfall and so 
apply the distributed lag coefficients /3j. By introducing a dummy 
variable dehned as; 

Dl — 1 if the /th temporary tax remains in force in quarter /, 

= 0 otherwise, 

it is possible to cotnbine these two hypitheses into a single expression: 

yj(/) = /;f|At/y + (1 - + (1 - D{)/3j, (9) 

where the tioiaitt)n now indicates that they weights depend both on 
calendar time and on the specific lax under consideration (because of 
the dutniny variable). 

An interesting point ari.ses heie. Standard pre-rational-expec- 
tations approaches to consumption-function estimation would sug¬ 
gest that S/S, and iy, lie constrained to etjual apparently 

meaning that the “long-run MPt"’ out of any type of income is 
identical. However, the FIH-cum-rational-expectations approach 
suggests no such adding-up constraint. lo see this, follow Sargent 
(1978, pp. (j8l-S2) in rewriting (2) in the form X, = HXt^^ -I- tj* 
(Sargent’s eq. 8), where 
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and Y, = dX„ where d = (1. 0 .0). As Sargent notes (his eq. 9), 

rational expectations implies ,Xi+, = H‘X,, whence = dH’Xt. 
Substituting this into (I) gives the following expression for permanent 
income: 


y? = /I, + 



dH‘ 

(I + r)' 



which is of the form (4) with coefficients 


ft 


(fto» ^1* • 



dH’ 

(1 + ry ■ 


The sum of the ftj has no obvious interpretation. Thus, in a model 
with several sources of income, there is no particular rea.son why the 
various sets of distributed lag coefficients should have a common sum. 

Where, then, does the lifetime budget constraint enter? The answer 
is that (4) implies a unitary lifetime MFC for any values of A and the ftj. 
The proof involves some straightforward but tedious algebraic ma¬ 
nipulations of the difference equations (4) and 

/!,+, = (1 + r)A,-f y, - C, ( 10 ) 

and hence is relegated to Appendix A. 

With ihe.se preliminaries out of the way, it is easy to explain the 
estimating equation. If there were no special taxes to worry about, the 
basic empirical model of consumer behavior would be as follows: 


Cjf = Aq + 

i-o 

+ AjflT, — At) -t- ^A 3 ^.jA|_, + u,. 


( 11 ) 


where r, is the rate of interest, W, is consumer net worth at the 
eginning of period t, and A, is the market value of slock market 
wealth at the beginning of period 1. The specific way in which assets 
are entered into the consumption function, including the constraint 

^ ^ suggested by the MfT-Penn-SSRC 

{MYS) model and is unimportant to what follows. 

Now consider the separation of disposable income into its two 
components: 

Y, = R, + S,. (12) 

1 he vray I have defined the -y’s means that (11) is expanded to: 

C,=A„ + k,r,Y, + AdIT, - A,) + 




where X', ^ /J/.S'-j + . . . + = 0.ri.This is not the actual 

estimating e(|uati<>n because additional income sources were distin¬ 
guished, because the distributed lag coefficients were constrained in 
sevetal wtiys, and because corrections were made for both hetero- 
scedasticity and serial correlation in the error term. Details are spelled 
out in Appendix B. Nonetheless (14) is the most u.seful form for 
iiueipieting the estimated parameters. The model is nonlinear be- 
tau.se ol the parameter X—the crucial parameter of this study. 

As noted above, theory does not imply that StCj = X/3j. To investi¬ 
gate this further, this adding-up constraint was imposed in (14) and its 
validity tested as follows.'" Let ( denote the likelihood ratio. Then, 
under the as,sumptioii of normality, —2 log ( — T log (SSR^/SSR) is 
distributed as a witfi t degrees of freedom, where; T = number of 
observations (= 100 throughout this paper), SSR = minimi/ed value 
ol the sum of squared residuals in the unconstrained regression (eq. 
114] in this ca.se), SSRr = minimized value of the sum of squared 
residuals in the constrained regression (obtained by imposing 2/3 = 
Sic), and r = number of restrictions (= 1 in this case). As reported in 
table 1, row 1, the constraint was rejected at the 10 percent level but 
not at the 5 percent level. There being no persuasive theoretical 
rationale for it, the constraint was dropped. 

Tacitly, however, (14) embodies a number of other constraints that 
are etjually lacking in theoretical justification—constraints that each 
type of regular income is subject to the same set of distributed lag 
coefficients. These constraints were tested by a series of likelihood 
ratio tests, which are described in the balance of this section. 


See Goldfeld and Quandt <1972, p. 74). 
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t/iicoristiatned 

Model 

C^iiisfraint 

Tested 


df 

Test 

Statistic 

1 E<p (14) 

Xw = xp 


1 

3.67 

2. K,q (16) 

Vj =■ wj for all 

) 

3 

12.02 

3 Eq. (16) 

Xt'i = XiOj 


I 

Approximately 0* 

4. Eq. (18) with Iv = Sat 

<^j ~ Wj for all 

1 

8 

19.02 

Eq, (18) wiih ia = iie 

X<f)] = Xwj 


I 

12,.52 

6 E(| (18) with Sii = Si/' 

X = () 


1 

1..32 

7 Eq. (18) with h' = hi' 

X = 1 


1 

2.84 

8. Eq (18) with Ea = Xw 

II 

o 


1 

17.81 

Snn —<;rinral IrvrU lor the x* arr. 




(If 

\m Point 

Point 

1% Point 

1 

2 71 

3.84 


fi 03 


6.2'’* 

7HI 


11 34 

* Due (•> roiiiKiiiiK cni n the a< tual < 

oTupuu-d irM KaiiAiK was 

sligitilv ntfgutivr When this consirdini w<is tested m 

lilt (tintcxi of t*(| ((H), ii prodtucti a 

tc'st s(a(i5(H of f 32 




First, “regular" disixisable income 

was 

disaggregated into its two 

main components — personal income 

and 

“regular” personal taxes; 


R, = p,-r,." (15) 


Fcrsoiial income is assumed to be s|ieiu according to the lag weights 
7Cj, while tegulai taxes ate assumed to be subject to a different set of 
lag weights Vy Special taxes are treated as previously explained, ex¬ 
cept that the v\ replace theie’s in etjuations (8) and (9). That is, while 
the tax is on the books, a special tax is treated as a weighted average of 
a permanent tax and a windfall. I hus the basic consumption lunction 
becomes: 




k„ + k,i,Y, + k-iiW, - A,) + , 

J=0 

+ , - '^Vj(r,.j - kXj) 


« m 


-I- u,. 


(16) 


Since (as explained below) the distributed lag coefficients are con¬ 
st rained to follow a third-degree jxrlynomial with a zero end-point 

In iiiakijig tins .sepaidiinn, 1 departed a bit Irani natianjl intome actounririi^ 
iDiivemions by iru biding the employer’s share of social insurance toniributions in both 
P .md 7 
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constraint, the null hypothesis that the vj are equal to the Wj imposes 
three constraints on equation (16). Row 2 of table 1 shows that these 
constraints were resoundingly rejected by the data (x* = 12). How¬ 
ever, it turned out that the sum of the Vj was estimated to be almost 
e.xactly equal to the sum of the Wj (see row 3 of table 1), so this 
adding-up constraint was impmsed in subsequent estimates. 

'rhe final generalization considered was to disaggregate personal 
income into its two main components—factor income and transfer 
payntents:'^ 

P, = F,+ V,. (17) 


This made the estimating equation: 


C, = ku + -I- k^iWi - A,) + ^k^+jAt.j 

j=0 

n n n 

J f-J ~ ‘~J ~ 


(18) 


+ ^/3i(I - \)X^, + £(1 -/)|).S'j.j 


-f Uf. 


Once again, the null hypothesis that the </>j (spending coefficients for 
iratislers) are in fact t*t]ual to the wj (spending coefficients for factor 
income) was tested by a And once again it was resoundingly 

I ejected (xa = 19. see tow 4 of table 1). This time, however, the data 
also rejected the adding-up constraint = ^v'j, which therefore was 
not impo.scd (table 1, row 5). In fact, mo.st of the difference between 
the <t)j and the Wj was in their sums; the time patterns were remarkably 
similar. 

l o sunnnarize these tests, we are left with a model that assigns 
distributed lag weights Wj to factor income, <t>j to transfers, Wj lo 
regular taxes, and fij to windfalls. I'he St'j and Siaj are apparently 
ec|ual, but the other sums arc not. 


V. Issues in Estimation 

1. Dahl 

Following the suggestion of Darby (1975), I used consumer ex¬ 
penditures, rather than pure consumption, as C,. This seems most 
appropriate where the focus is on the evaluation of stabilization pol¬ 
icy, as it is here, rather than on testing the PIH. Furthermore, it 

For this purpose, Ixnh employer coniribution.s and business transfers are con¬ 
sidered to Ire iactor income, and the aspects of tt>e 197.5-76 tax cuts that are cla.ssiried as 
transfer payments in the national income accounts are grouped with temporary taxes. 



TEMPORARY INCOME TAXES 


39 


TABLE 2 


Effects on Disposable Income of 1975-76 

Tax Cots and Transfers* 

Quarter 

Taxes! 


TRAN.SFERst 

Total 

Tax Rate Cuts 

Rebate 

Social Security 

Earned 

Income 

197.5:2 

8.5 

31,2 

6.7 

0 

46.4 

1975:3 

12.3 

0 

0 

0 

12.3 

1975:4 

11.9 

0 

0 

0 

11.9 

1976:1 

H.2 

0 

0 

1 9 

16.1 

1976:2 

14.0 

0 

0 

I 6 

15.6 


* In htllion^ of current dolbra, at annual rate^ 

t from C S Burciiii of Eronfffnir AiuJysis ffebruary J976, March I977> 

t Kindly siipidied lo ihc ainhiir by Ji«tph C Wakefield, of the Bureau of Economic Analyaii, in converajiion 


avoids many complicated issues of definition (e.g.. Which goods are 
durables? How fast do they depreciate? etc., etc.). The cost of this 
•shortcut is that the theoretical interpretation of some of the param¬ 
eters is lost. For example, k, includes the effects of r, on both pure 
consumption (which may be positive or negative) and spending on 
durables (which should be negative). Similarly, the lag weights wj 
might be expected to be less smooth than the lag of pure consumption 
behifid income because of the lumpy nature of durables.’® 

Data for the 1968 surcharge were uken from Okun (1971) and 
converted to 1972 dollars by the deflator for personal consumer 
expenditures; these comprise S). Data for the various 1975-76 tax 
cuts are shown in table 2; they were similarly deflated and segregated 
into two time series. The Sf series is defined as the explicitly one-shot 
measures: the tax rebates and the .social security bonuses (hereafter 
referred to as “the rebate”). The rest is considered as Sf. Since the 
1975 cuts were extended several times and are now a permanent 
eature of the lax code, an arbitrary decision had to be made as to 
when they became “permanent.”’" I decided to cut off Sf after 
1976:2, because by then the cuts had already been extended once in 
the Revenue Adjustment Act of 1975 and a second time in the Tax 
Reform Act of 1976. 


»p«ial feaiurcs of expendifiires on consumer durables several 
addiuonal variables were ined in some eariier regressions. Neither the stcKrof dura 
f. relauve price of durables, nor the unemployment rate succeed^ in 

gmhcantly lowering the sum of squared residuals. In several cases the signs of the 

hnai-cia^ fotnTlh^ relevant to the choice between saving in the form of durables vs in 

tem,^ ^ '^'^“rch, 1 experimented with a learning model in which a 

long^’^tndTonl'^t^ considered permanent as it remained on the books 

longer and longer. This experiment was unsuccessful. 


fOVKNAl- OF POUTJCAl. ECOSOMY 


^ . anj Its breakdown into stock 

Data on tonsuiner net wo ' ’ _ 4 s tomponeiUs. were taken 

/nmi the d.u.i . .. .,„,rrfs. the most important 


nf which if. the How of funds. ... 

Heciiusc of the recent hnding.s of Boskin (197H), 1 thought it im- 
poi rani to use a real afier-lax iiuerest rale. Since the construction of 
thi.s series was a fairly involved af fair, 1 explain it only in Appendix C. 


2. Dotribuled Lag EstimiUion 

'I’lie many distrihiitecl lags in equation (18) were estimated by an 
adap(atif>n of the Almon (1965) lag technique, as a method of con¬ 
serving on paranieiers. ('renerally, a third-degree polynomial with a 
zero constraint at the far end was used. Preliminary tests suggested 
that these end-point constraints (ii',i,i = 0, etc.) could not be rejected. 
An unconsiraitied version of one specification, run as an experiment, 
showed that the polynomial constraint had very little ef fect on the w, 
coefficients hut did tnffuence the coefficients. 

The distributed lag ef fects of assets were estimated as follows. In 
some preliniinary regressions, the two components were combitted 
and a cubit distributed lag over ri quarters was estimated. Theti the 
two components were di.saggregatecl. 'I'fiese preliminary tests showed 
c|uite clearh («) ifiai the coef ficients were not the same and {b) that the 
lag was much sfiorler tfian n (|uarters. (As explained just below, n was 
cho.sen to he 7.) In tlie case of non-stock market wealth, an estimated 
disiiihiited lag over 4 (|uartcrs attached virtually all of the weight t(t 
the current (siait ol period) value, so all lagged values were omitted. 
In the case of sio< k inaiket wealth, when a cubic was estimated over/ 
= 0, 1, . . ,7, the coefficients turned out to be almost linear and tc) be 

virtually zero after / = 2 (a small positive value for; = 3 and small 

negative values for/ = 4.7). 'I'hus a linear distributed lag over/ = 

0.3 was adopted as the final specification. 

1 he length of the distributed lag, 11 , was .selected by running a 
preliminary version of the regression for alternative values of’ n 
ranging fiom 6 to 10. A very clear minimum in the sum of squared 
residuals was found around a = 7 or ri = 8, with the former having a 
slightly better fit and slightly better coefficients. On this basis, n = 7 
was selected for all subsequent work. 

3. Treatment of the 1975 Rebate 

As has been noted already, the model divides the 1975-76 tax cuts 
into two parts: 57 includes the rebate, while 5? includes all the rest. 
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This makes a strong (and questionable) assumption about how con¬ 
sumers treated the rebate. In particular, it assumes that they treated it 
just like the 1968 surcharge and the other 1975 reductions; essentially 
as if a fraction A of it was a regular increase in income, while a fraction 
/ — A was a pure windfall. Given the nature of the rebate, this is 
questionable, to say the least. 

An alternative assumption—equally strong as the first —is that con¬ 
sumers treated the rebate as a pure windfall right from the start.'* 
While it is not obvious that this is true, since consumers might have 
anticipated a repeat performance with some reasonable probability, 
it does seem a plausible working hypothesis. Fortunately, it is not 
difficult to modify any of the three models to accommodate this 
alternative hypothesis; all that is necessary is that the lag weights ft be 
applied to .S? starting immediately in !975;2. When this was done in 
one version of the model, the resulting equation had virtually an 
identical SSR, almost the same estimated X, and very similar implica- 
lions about spending patterns out of the rebate. Thus the conclusions 
of this study seem insensitive to the treatment of the rebate. 


VI. Empirical Results 

1. Parameter Eitimates and Interpretation 

Estimation was done by the numerical optimization package deveH 
oped by S. M. Goldf'eld and R. E. Quandt. The results from estimat¬ 
ing equation (18) on quarterly data covering 19.53; 1-1977:4 are 
presented in table 3. The number in parentheses next to each esti¬ 
mated coefficient is its asymptotic standard error (or rather a numeri¬ 
cal estimate thereof). 

In interpreting the .standard error of the regression, it should be 
mentioned that, as explained in Appendix B, the equation was ac¬ 
tually transformed so that the left-hand variable was the average 
piopensity to consume (APC), C,/F,, rather than consumer spending. 
Thus, the standard error of .00.38 is relative to a typical value for the 
APC of about .90. This represents an excellent fit."’ At 1977 income 
levels, it translates to a standard error of about $3.5 billion in pre¬ 
dicting consumer spending. Of course, obtaining a good fit with a 
consumption function is hardly a notable achievement, and the 
equation-—like .most consumption functions—does suffer from some 
autocorrelation (p = .54). 


Accession t^Urnber 
. 

Bate .^5. ^ 


" hence treated as a windfall in any 

case, only 1975:2 is at issue here. 

'* errors of a comparable equation in Modigliani and Sieindel (.1977) ; 

are .0056 with an autocorrelation correction and .0065 without. ■■ * 






42 


JOURNAL OF POLITICAL ECONOMY 


1 ABLE 3 

Nonuneak Consumption FifNcrnoN Parameter Estimates* 


kn AI ki A p 


30 1 0002 .021 ..50 .54 

(7 1) (.0007) (.005) (.32) (.11) 


Dis I BIBKTED Lao CoEfTICIENrst 


/ 

U'j 

<i>) 



^a+j 

Yj 

Rebate 

0 

m (05) 

.50 (.16) 

.34 (.09) 

-.03 (.26) 

.009 (.002) 

.16 

.16 

1 

.16 (.02) 

- 02 ( 09) 

14 (.(M) 

-.03 (.11) 

.006 (.001) 

.06 

-.03 

2 

06 (.o;s) 

- 20 (.10) 

.04 (.05) 

-.01 (.09) 

.004 (.001) 

.02 

-.01 

3 

12(02) 

- 14 (.08) 

.01 (.05) 

03 (09) 

.002 (0005) 

02 

.03 

4 

08 (.01) 

.04 (.06) 

02 (.03) 

.08 (.07) 


.05 

.08 

5 

.02 (01) 

26 (.07) 

.05 (.03) 

11 (.06) 


.08 

.11 

6 

.10 (.02) 

.39 (.09) 

08 ( 04) 

.12 (.08) 


.10 

12 

7 

1 1 ( 02) 

34 ( 08) 

07 (.04) 

.09 ( 07) 


.08 

.09 

Sum 

.74 

1.17 

.74 

.36 

.021 (.005) 

.55 

.55 

Ntm —*<, - 

loiist.iiii, ki 

UM'Uit-icrH of miercii r^tc.At 

^ ctiefiitit-m <»( nt>n'StfK.k niarkci wealth. X • 

» weight on 

rt-Ktildt ihconK 

p ~ aufix tin 

efddon <oc(fku‘ni Sumof^u 

uired re^idiiaK ~ 

,(K)i-47. SE= ()058.H. SF .)! 

unddiusted 


<”r»<»n (wnh«ju< <i a»riocii<>/i Tor .^ufoc4»^rtf^«{w»n) » 004W. A' 4>l>9orvatioit« - l()0, 
* Asvinj^oiu Sts .ut III jwrcnihoscA 
^ (-otn^Kiiirrics riMV not add «i oit.ils due to roiindinjt 


1 uniin^f to the coefficient estimates, the most critical parameter for 
purposes of this study is X, the weight attached to regular income in 
ec|uation (8). The point estimate of .50 suggests that temporary taxes 
that are still on the books are treated like 50-50 blends of windfalls 
and regular taxes. However, the standard error is regrettably large; 
there are, alter all, pitifully few observations that ran be used to 
estimate A. The null hypotheses A = 0 or A = I can nonetheless be 
tested by likelihood ratio tests. When these tests were run with equa¬ 
tion (18) as the unconstrained regression (see table 1, rows 6 and 7), 
the null hypothesis that A = 0 (temporary taxes are regarded as pure 
windfalls) could not be rejected (xf = 1.32). But the null hypothesis 
that A = 1 (temporary taxes are regarded as regular income) could be 
rejected if we were not too fussy about significance levels (xf = 2.84). 

I turn next to the distributed lag coefficients of the various income 
terms. The te’s for factor income are very large and positive at first, 
then turn small and negative, and finally become positive again at the 
end. This general shape accords well with our expectations.” The <f>j 
coefficients for transfer payments follow a similar shape but are much 
more erratic and less well pinned down econometrically. A notable 

’’ Because of the estimating forin, it is unlikely that simultaneity has much to do with 
the large esiimaic for See Appendix B. 
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feature is that their sum is nearly 1.2, indicating “overspending” 
during the first 2 years after receipt of a transfer payment. 

The Vi coefficients for regular taxes also exhibit a characteristic 
U-shape, but in much more muted fashion. As compared with factor 
income, spending in the first year after a regular tax cut is apparently 
substantially less, after which it catches up. This was surprising at first, 
since Dolde (1976) and Modigliani and Steindel (1977) had suggested 
that regular tax changes are “more permanent" than regular in¬ 
come.'* However, it turns out that the following second-order au- 
toregressives describe the deviations from trend of personal income 
(P,) and regular taxes (T,): 

P, = 1.32P,-, - .38P,_j, 

(.09) (.09) 

7',= 1.1 ir,., - .21T,_,. 

(.09) (.10) 

Using the formulas derived earlier for permanent income, these 
time-series models imply that “permanent personal income" and 
“permanent taxes" are given byPf = 14.9P, - 5.7P,_, and Jf = 9.37', 
- 1.9so that, contrary to Dolde and Modigliani and Steindel, we 
should actually expect a stronger short-run response of consumption 
to fluctuations in P, than to fluctuations in T,, which is exactly what 1 
find. 

The next column, the /3/s, are in some sense oul-of-samplc ex¬ 
trapolations since there are no “pure windfalls" recorded in the 
data.'" Their only use is to form the weighted average (1 - \)/3j, 

which is reported in the column marked “yj." These are the ex¬ 
penditure coefficients for income from a temporary tax that remains 
on the books for the entire 2-year horizon. To illustrate the opposite 
extreme, the column marked “Rebate” shows the spending 
coefficients for a temporary tax that lasts only 1 quarter. These two 
columns differ in details but are quite similar. There is a moderate 
spending response in the initial quarter, followed by very little 
spending over the next 3 or 4 quarters. Most of the spending out of a 
temporary tax cut, according to these estimates, comes 5 or more 
quarters after the cut, 

I he coefficient of assets (.02) is comparable to what others have 
estimated, though a bit on the low side. A likelihood ratio test of the 
null hypothesis ^2 = 0 (which, in this constrained form, al.so implies Jk„ 

“But see Dolde (1979), where the transitory nature of allegedly permanent ux 
changes is stressed 

1 made an attempt, in some early regressions, to treat the Nauonal Service Ufe 
Insurance Dividends of 1950 in this way, but I was not successful. 
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1 ABLE 6 

F.siimaieu Efeuvis of the 1975-76 Temporary 
Tax Cuts on C-onsumi.r Expenditures* 


Qiiarttr 


Estimated Spending Effect 


1975:2 5.9 

1975 5 1.6 

1975 4 2.7 

1976 I 4.8 

1976,2 6.7 

1976:3 6 7 

1976 4 7.1 

1977.1 6.6 

1977-2 7.2 

1977 3 7.5 

1977 4 7.3 


* hi hilliiMis >>l ]*)72 clolldis 


perceiil—ihe lowest Hgiii-e thal had then been recorded since the 
Korean War.) 

According to the estimaies presenled in this paper, the temporary 
tax cuts of 1975-'76 contributed to both phenomena. Using the 
spending coefhcients presented in table 4, table 6 shows the estimated 
diieci (excltiding multiplier) ef fects on consumer spending of the tax 
cuts of 1975:2 through 1976:2, inclu.sive.*' It appears that (1) very 
little of the rebate was spent in 1975:2, (2) rather little of the disposa¬ 
ble income attributable to the temporary tax cut package was spent 
dtiring the remaindei of 1975, and (3) more spending out of the 
temporary tax cuts was done in 1976 and yet more in 1977. Both the 
low APC of 1975: 2 an<l the high APCsof late 1976 and early 1977 are 
tracked very well by the model. One observation which virtually 
jumps from table 6 is how very small these estimated spending im¬ 
pacts are relative to the size of the economy they were meant to 
stimulate {real ONP in the neighborhood of $1,250 billion). 

Finally, there is one more question. If, instead of the 1975:2- 
1976:2 package of temporary measures, the government had cut 
taxes ‘‘permanently" in 1975:2 and then restored them to their origi¬ 
nal level starting in 1976:3, how large a tax cut would have achieved 
the same average effect on aggregate demand?** 

Table 7 summarizes the model’s answers to this question for three 
different choices of the horizon over which the “average effect on 
aggregate demand” might he defined. The first column gives the 
average direct impact on sfiending attributed by the model to the 

/ he reader is reminded that only these H tjuarler.s are considered temporary cuts. 

For this calculation 1 assume thal consumers were .successfully fooled into thinking 
that I he 5'quarier lax rut would Ik* permanent. 
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TABLE 7 

Equivalent Pekmaneni Taxes*-! 


Average Impact on 
Spending of Actual 
HoriAon 1975-76 Cuts 

4 quarters 3 7 

6 quarters 4.7 

8 quarters 3-3 

* Srr icxi fnr <lefinition 
t 111 billions 111 1472 lliillars, al annual ral« 


Cumulative Revenue Loss over 
a Quarters of Permanent Tax Cut 
with Equal Average Impact on Spending 

9.5 

12.4 

15.9 


1975-76 tax cuts. The next column shows how much total tax reve¬ 
nue the government would have had to relinquish during the same 
5-quarier period (1975:2-1976:2) in order to achieve the same direct 
impact on spending through a permanent tax cut. Since the total 
5-quarter revenue loss from the 1975-76 package was |20 billion, 
these numbers mean, for example, that a permanent tax cut about 
half as large ($9.5 billion vs. $20 billion) would have had the same 
first-year ef fect on aggregate demand. Over a 2-year horizon, how¬ 
ever, the 1975-76 package had about 80 percent as much “bang for 
the buck" as a permanent tax cut. 

VII. Summary 

Both economic theory and casual empirical observation of the U.S. 
economy suggest that short-run spending propensities from lempo- 
rary tax changes are smaller than those from permanent ones, but 
neither provides much guidance about the magnitude of this differ¬ 
ence. This paper offers new empirical estiinatesof this difference and 
finds it to be quite substantial. 

1 he analysis is based on an amendment of the standard distributed 
lag version of the PIH that distinguishes temporary taxes from other 
income on the grounds that the latter is "more transitory." This 
amendment, which is broadly consistent with rational expectations, 
leads to a nonlinear consumption function. 

I hough the standard error is unavoidably large, the point estimate 
suggests that a temporary tax change is treated as a 50-50 blend of a 
noimal income tax change and a pure windfall. Over a 1-year plan¬ 
ning horizon, a temporary tax change is estimated to have only a little 
more than half as much impact as a permanent tax change of equal 
magnitude, and a rebate is estimated to have only about 38 percent as 
rntich impact. The model tracks both the extraordinarily high savings 
rate of 1975: 2 and the extraordinarily low savings rates of late 1976 
and early 1977 very well and attributes part of both phenomena to the 
temporary tax measures of 1975-76. Finally, it is estimated that a 
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permanent tax cut of about $9.5 billion (in 1972 dollars) would have 
had the same impart on aggregate demand over the first 4 quarters as 
the $20 billion of 1975-76 tax ruts. 


Appendix A 

The Lifetime Budget Constraint 


1 his Appendix demonsirates a icsiilt that, 10 niy knowledge, is not very well 
known; thai die long-run MI’C lorrespoiuling to a runsurnption (iinction of 
the form 

ft 

c, ^ U, + (Al) 

J-O 

IS um(\ for any value of h (greater than the real rate of interest) and for any 
h\ 

Pnki/: Write e<|tiation (10) in the text as 

II - (1 + = K, , - C, ,, (A2) 

and wme (.41) as 

r;, = Lf, + f>(Z.))„ (A3) 


where L is the lag operator and />(/.) = h„ -I- /),/. -I- ... 4 h„I“. Applying the 
operaioi 1 - (I 4 1 )/. to (.4,3)and tisiiig(.42) yieltls 11 — (I 4i ~ i)L](.'/ = {W. 
4 [1 (1 4 1 )/,]/;(/.))>',, which can he written 


where 


C,-=«(/.) 1„ 

fit/ t = AL 4 [1 - (I 4 >)LI/AL) 
1 - (I 4 1 - A)L 


(A4) 


(A.-i) 


I o ohl.nn die hleiitne spending generated h\ a $! .00 1111 pulse in ),, we must 
(ompnie the dis<(iniited sntn of coefficients: 


Ml't; - 2] 

»»0 


(I + r)' ■ 


(AG) 


1 o simplify die noiaiioii, lei f) » 1 4 1 — k. Assninmg lhal 0 < 1 (i.e., that k > 
r). (A.4) can fie w 111 ten: /f (/.) = -{ A’/, 4 | I — (1 4 r)L |A(Z,)}( 1 + HI. 4 9^/.'^ + 
fins IS ol die lot III /If/,) = /!„ 4 Ii,L 4 /Ij/A 4 . . . , with the following 
roetfic tents: 


/I, = ffl>„ 4 (A, 4 A - (f 4 r)/»„J. 

= 0'‘b„ 4 4 A — (1 4 i)/>,i| 4 (/ij — (I 4 l)A,J, 


li„ = H’% 4 '[h, 4 A - (1 4 r)/i„| 4 ... 4 [A, - (1 4 r)h„. ,], 

Pnu = 4 riA, 4 A - (1 4 r)h„] 4 ... 4 Blhn - (1 4 r)A„_,) 

- (I 4 ; )/i„, 

= 1 = 1,2 . 
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Substitution of all of these into (A6), and some truly horrendous grinding, 
establishes that MPC = 1 regardless of the magnitudes of if and the Aj (as long 
as k > 0). Q.E.D. 

A brief word on the interpretation of the sum of the Aj in (Al) may be in 
order here. Should K, rise permanently by $1.00—a statement that is basically 
meaningless if the autoregressive process assumed in the text (eq. 12)) really 
holds—the eventual change in C, would, by (A4), be P (1) = According 

to (A.5), this sum is 

k-r'^h, 

1 bus SAj controls die si/e of the spending respon.sc to a hypothetical perma¬ 
nent rise in income; it does not influence the lifetime MPO. 


Appendix B 


Details on the Estimating Equation 

I bis Appendix dei ives and explains the equation that wax actually estimated 
1 U'gin liy fepcaling equation (14) of the text: 

A„ -I- - d,) + 




m 

kX', - S,.,) + (1 - A)A7 + V 0 - 




I-oi put poses of 1 educing beteioscedasticity. the assumption was made that 
the standartf deviation of was proportional to so the whole equation was 
divided through by to get 


APd, - + kyi, k-i 

11 


V Tr* r 

j - q,-j) + (1 - \)x{ + Yd - D|)s|_^ 

J=0 L 


(Bl) 


+ tf. 


whert 


^t-j = (note: z, = 1 for all I), 

x) = xi/y„ 


-FT’ 

9f-j = h-j - kxj, 
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To estimate (Bl), the assumption was made that both wj and /3j follow 
third-degree polynomials in /: 


u'j = ri„ + a H- 
lij == h„ + h, I + h.(j^ + h.jf. 


(B2) 


The end-point constraints mentioned in the text {w„+, = I3„+, = 0) are thus 
a„ + in + l)rt, + (n + + (n -I- Ifo;, = 0, 

i,i + (>/ + l)/>i + (« + + (« + = 0. 

K.cpiations (B8) were used to eliminate the parameters Aq tmd />„. The 
adding-up constraint discussed (and rejected) in the text was thus 

il 

V {a,|; - (tt + 1)] -t- - (» + 1)*] -i- - (n + 

(B4) 

n 

= {/t,L; - (« -t Dl-t- - (» + 1)^] + - (ti + I)-']}. 

which was used to eliminate the parameter a,. 

Kmallv, in estimating (Bl). «, was assumed to lollow a lirsi-order autore¬ 
gressive scheme, (, - pe,-, + c,, where r, is while noise. Ksiimation was by 
nonlinear least sc|uares. which is et|uivalcnt to maximum likelihood if o is 
noniially distributed. The lunction actually minimi/.ed was 



with all the above-meiuionetl definitions and parameter restrictions stihsti- 
iiiled in 

Ihe esi imatmg forms when regiilai iiicomcwas turlber disaggregated were 
derived in precisely analogous ways from etpiations (16) and (18). 


Appendix C 

Calculation of the Real After-Tax Interest Rate 

The real al ler-tax inlcresi rale is dehned as r, = t,( 1 " r,) - rr,. where t, is the 
nominal interest rale, t, is the marginal tax rale, anti tt, is the expected rale of 
inffatioii. 

Sinninal interest rate .—Four different nominal interest rales were tried; a 
coiporaie bond rale, the 4-6 month commercial paper rate, the 8-monlh 
1 1 easury bill rate, and a weighted average of rates received on various time 
and savings accounts. While all four led to very similar results, the Treasury 
hill rate gave the best fit and .so was adopted. 

Marginal tax rate .—The marginal tax rate was created from the average tax 
rate in the following way Lcl T(y) be the tax function facing an individual 
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taxpayer, and lei f(y) be the frequency distribution of income. The only 
directly observable tax rate is the aggregate average rate, which is /f = 

VlT(y)f(y)dyyi^yf(y)dy'\- 

Now suppose the whole income distribution shifts to the right with no 
change in its shape: that is, it shifts from/(y) to/ly(I - A)], where A connotes a 
‘small” multiplicative shift. The average income is then Y(h) = />/Iy(l - 
A)]afy, so that, for small shifts (h near zero), dYIdh = - /yV’(y)d>. Similarly, 
average tax payments after the shift are T(h) = jT(y)f[y(\ - h)\dy, so that 
dTIdh - - lyT(y)f'(y)dy. The aggregate marginal tax rate, M, is the ratio of 
these: M = [jyT{y)f‘{,y)dy]l[iyj’(y)dy]. 

To evaluate these integrals and obtain a closed expression for M/A, 1 
adopted the following functional forms: 

T(,y) = ay* {h > I, a > 0), 
f(y) = (y > 0). 

With these assumptions, the two ratios of integrals work out to be ,4 = 
(ay^ly''^')V{h + 1), M = (ay ^+ 2), where r(n) is the “gamma 
function,” namely, r(n + 1) - nVin). Hence M/A = (b + l)/2, and using a lax 
elasticity of 6 = 1.6 gives M = (1.3)/f. 

The average lax rate was computed, quarter by quarter, by dividing the 
sum of federal and state-local personal tax and nontax payments by personal 
ituome excltiding transfers (an approximation to the lax base). These are all 
official national income accounts series. 

Expected rate of inflation. — InHationary expectations were generated by a 
model based on what has been called “economically rational" expectations.*'’’ 
The idea is that agents, in informing themselves, begin with the data that arc 
cheapest per unit of informational content and then proceed to process more 
costly data until the marginal cost and (expected) marginal benehis are 
equated, In this particular application, I assumed that consumers base their 
expeciattons of the inflation rate (P,) on its own past history and on the history 
of the growth rate of the money .supply (,Vf,). Thus 1 estimated an equation 

j / 

/’i=«o + 

J-t i-O 

on actual U.S. quarterly data, using the deflator for personal consumption 
expenditures for P, and M 2 for M,, and assumed that consumers used this 
equation to generate expectations. In estimation, I used the Almon (1965) lag 
technique with third-degree polynomials, no end-point con.strainus, and vari¬ 
ous choices lor j and /. 1 he best results were obtained with / = 11 and 7=17, 
namely*'* 

n 17 

P, = -.60 -H ^ajP,_j + 7f* = .73, D-W = 1.98, 

(.38) ' " 

staiidaid error = 1.42, mean of dependent variable = 3.35, 

la, = .67 lb, = .27. 

(.10) (.09) 


"See feige and Pearce (1976). 

" Standard errors are in parentheses. The D-W is the Durbin-Wat-son statistic. The 
^riod of estimation was 1951:3-1977:4, the longest period possible given the need for 
17 lagged values of ;Vf, o r r » 
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Consumer Search with 
Uncertain Product Quality 


John D. Hey 

I'nwfrsity of York 

Chris J. McKenna 

L’nnvnity C - nllrgf . ('.imhU 


I his papci presfius a unihecl model of consumer seaicti when both 
pi kes and qualities arc uncertain. It is assumed that price can be 
observed prior lo purchase but that qualil) can be observed only 
atier pun base and experience. I his two-stage deersion problem is of 
particular interest since, in practice, there is usually some connection 
belweeii quality and pike. Of the many interesting implications of 
ibis dcpeudcm.e explored in this paper is the result that the usual 
"reservation-price rule" may no longer be optimal; indeed, the op¬ 
timal stopping set may take one of several alternative (and more 
lealisiu) forms 


I. Introduction 

I'hc economics of search has made considerable progiess in recent 
years in describing and explaining the actions of economic agents 
operating in uncertain environments. Our concern in thi.s paper is 
with the application of this large body of knowledge to the consumer- 
search problem. In its simplest form this problem examines the be¬ 
havior (tf a consumer searching (at a constant cost per observation) 
over a known distribution of price.s for the lowest (economically 
sensible) price for the purchase of one unit of some gocjd. The 


We wduld like to thank an anonymous referee and George J. Stigler for some very 
helpf ul comments on two earlier drafts of this paper. We are also grateful to Louis L. 
Wilde for useful comments on the original draft. 

L/tmmrt/ <4 PoUlicai tronowiv. vol 89, no. Ij 

<£) 1981 by The Umversiiy <if Chitago 0022-3«0»/HI/890I-OOI2|01..'M) 
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solution, which is now well known, takes the form of a stopping rule 
uniquely determined by a “reservation price”; that is, the consumer 
behaves optimally by searching until a price lower than the reserva¬ 
tion price is obtained. 

In this simple form, and indeed in many of its more sophisticated 
generalizations, a crucial assumption is that search takes place over 
just one characteristic of the good—namely, its price. However, in 
practice, a good has several dimensions (of which price is but one); 
moreover, uncertainty surrounds not just the price dimension but 
these other dimensions as well. Of course, this observation would be 
trivial if information on these other aspects could be collected at the 
same time as information about price. However, in practice, some 
characteristics inevitably remain undiscovered until after purchase; 
thus, prior to purchase, residual uncertainty remains. 

I'he purpose of this paper is to explore the consumer-search prob¬ 
lem in this multidimensional form. In particular, we partition the set 
of chaiarteristics into those which can be observed prior to purchase 
and those which can be observed only after purchase. To keep our 
exposition simple, we consider the simplest of all possible formula¬ 
tions: a two-dimensional case in which price is the dimension which 
can be observed prior to purchase and^wa/t/y is the dimension observ¬ 
able only on purcha.se and experience. 

Our extension makes the consumer’s problem a two-stage one; The 
first stage relates to the price dimension—the consumer must decide 
what prices are “acceptable”; the second stage relates to the quality 
dimension—the consumer must decide what qualities are “accept¬ 
able.” What makes the problem particularly interesting is that, in 
practice, these two dimensions—price and quality—tend to be re¬ 
lated. This dependence means that the two stages of the consumer’s 
pKiblem are also related. Moreover, this dependence implies, as we 
shall show, that the optimal stopping rule at the first stage may well no 
longer be of the reservation-price form. For example, it may well be 
optimal for the consumer to search for expensive (rather than cheap) 
goods; alternatively, it could be optimal for the consumer to search 
lor goods in .some middle price range. Indeed, a variety of possible 
stopping rules may apply at the first stage; as we shall show, the form 
depends crucially on the form of the dependence between quality and 


This two-stage problem belongs to a class of similar problems now 
beginning to appear m the literature. A similar consumer-search 
pioblem has been developed independently by Wilde (1977). In many 
res^c .s hi.s model is similar to ours, though it prtKeeds at a somewhat 

Se Wilde’s work follows from an earlier paper 

( dt 1979) on the parallel problem in the job-search area (a pr^ 
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lem in which the wage is observed prior to accepting a job offer, and 
the nonpecuniary Job characteristics are observed after experiencing 
the job). This latter problem was studied independently, and in a 
slightly different form, by McKenna (1978). Work has also been 
carried out in this area by Lippman and McCall (in press). 

The papei is organized as follows; In the next section we introduce 
our notation and detail the assumptions of our model; the following 
.section finds the solution to the consumer’s two-stage problem and 
examines the pi opcrties of this solution; a concluding section reviews 
and appraises the key results. 


11. The Model 


We assume that the consutner-searchet is interested in two charac¬ 
teristics of the good—(luality and price. We denote by Q the tjuality of 
the good as mcasuied by the per period money value of the product’s 
<|ualit\; we assume that the value of (J for a particular good can be 
obsei ved only after purchase. The good yields (> for each period of its 
n-[)eriod lifetime; n is assumed to be the same for all goods. We 
denote In the price of the good as measured by the price per unit 
per [lenod of its liletitne (i.e.. the per period “user cost” of the good).’ 

1 he (onsiimei makes independent random selections from the 
joint distribution of P and (7; each drawing costs a (constant) amount r 
(> 0), The value of P is irnmc'diately revealed, while the value of (> is 
levealed only if put chase lakes place. We a.ssume that the searcher 
knows the joiiu distribution of P and (7 over all gocjds: We denote by 
gt ) the maiginal probability density function (pdf) of P, and by 
|(■\p) the ccmdiiioiial pdf of (7 given p\ we use f;( -) and F{ - ) p) to 
deiioie the coricsponding disiributioii iiinclions and permit P and (7 
to take values 011 tlie positive real line. 

I be consumer laces an infinite horizon, discounts all values at the 
rate p (0 p < 1), and wishes to have exactly one niiit of the good in 
each period. Given these assumptions, the lifetime discounted value 
to the consumer is 


whet e (7, and P, are the c]uality and price of the good being consumed 

in period t, and S, is the amount spent on search (possibly zero) in 
period /. r / / 

1 he individual’s objective is to find, and follow, the strategy which 


' Alien,ativcly the purchase pnce-payahle at the time c.l purchase- 
(wheic f>, see l)clow, i.s the discoum rate). 
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maximizes the expected value of the expression (1). The model is built 
around the following assumed sequence of events: 

1. The consumer searches, at a cost c per search, over 
values of P until an acceptable value of P is obtained. We 
assume that this search process is instantaneous and that no 
passage of time occurs.* Thus the consumer can obtain in¬ 
stantaneously n “quotes” at a total cost nc, whether the 
searching is done sequentially or not. 

2. The consumer then purchases one unit of the good. 

3. After purchase, the consumer experiences the good and 
thereby discovers the value of associated with the good. 
(We assume that all uncertainties are resolved after pur¬ 
chase.) 

4. Having discovered Q, the consumer decides whether to 
keep the good for the whole of its n-period lifetime or to 
dispose of the good (at zero cost) at some point within its 
lifetime. If the latter option is used, the individual returns to 
stage 1. 

5. If the good is kept until the end of its lifetime, the 
consumer must decide whether Q was sufficiently good for it 
to be repurchased. If so, the g<K>d is continually repurchased 
thereafter; otherwise, search continues again at stage 1, 

Ihe model is appropriate for regularly purchased nondurable 
goods for which the amount consumed per period is roughly con¬ 
stant, such as bread, cereals, eggs, washing powder, newspapers, and 
toilet paper. However, the purchase of some durable items such as 
televisions, cars, and washing machines may also be described in our 
framework.* 

III. The Solution 

The consumer’s problem is a two-stage one: The first stage relates to 
the price dimension, the .second stage to the quality dimension. Our 
solution similarly proceeds in two stages. 

We begin with the first stage (that described by 1 in the sequence of 
events above), namely, the stage relating to the price dimension. The 
tonsumer’s problem at this stage is to decide which price quotes are 

This IS crucial in view of our earlier assumption that the con.sumer requires exactly 
one unit of the good per period. In contrast. Wilde (1977) pustulates a time-consuming 
seauh process and assumes that the consumer derives no utility in periods when he 
di^s not have a unit of the good and derives utility V(,P, Q) in other periods. 

•' The repurchase of durable gcxids, however, may 1^ made after a much longer 
period of consumption. For our model to be valid in describing purchase decisions of 
this type requires that we assume unchanging market conditions over long pcriixls. 
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.alco pl« »,“!■ orr’ii 


,,nJd) .vW prut qi‘oi must decide on the 

,i,„iild < /« ” ""'■ ■ ,,„rf ,/,/s o/ioW *•/, »-? Hm 

if/’ e stdo searching and purchase good alP. 


if /’ ^ continue seart hing. 

(lere Z’ denotes the latest price tpiote obtained. We wish to find the 
best such set, which we denote by in the sense that it maximizes 
the expected value of the objective Itinction (I). To find this y»* we 
pioteed as follows. Considei the consumer starting from a position in 
which no acceptable ptice has yet been obtained; suppose that the 
consunici uses the at bin at y acceptance set with respect to the first 
[)utc liase dec isioti but tliereafter behaves optimally with respect to all 
future decisions (including those relating to any future return to stage 
I). Let < (('>") denote tfie expected value of the objective function 
following this sequence of events. Cilearly by definition, is a set for 
wtiic h attains its maxitnum value; if we denote this latter by F*, 
we ftave 


F*^F(.y>) foi all .i^£ 2’'\ 

I * ^ 

We will clemotisttate that .-if* need not be determinecf by a uni(|ue 
reservation price (i.e., .If* need not be of the foi m {P,P ^ P*})- This 
is because the direc t attraction of low prices may be more than of fset 
by the indirect detrac tion of associated low cpialities. In mathematical 
terminology, the return function from accepting P need not lx* 
monofonically decreasing in P Ix-cause of the relationship fxttween Q 
aned P in the joint distribution. We pursue this formally below. 

We now turn to the second stage (that described by 4 and 5 in the 
sequence of events above), namely, the stage relating to the quality 
dimen.sion. 7he consumer's problem at this stage is to decide what 
c]ualities ate acceptable, Iwith in terms of keeping the gcKid until the 
end of its lifetime and in terms of repurchasing that good at the end 
of its lifetime. In contrast with the decision at stage I, the.se acceptance 
decisions do have the reservation property—the higher the Q, the 
better off the individual.'* 


* In contrast with ihc hist stage where a higher F may make the individual better or 
worse off. For a formal demonstration of the textual assertion see Hey (1979) and/or 
Theorem 1 of Wilde (1979). 
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We wish to determine the optimal reservation qualities; to find 
these we first begin with arbitrary reservation levels defined as fol¬ 
lows. Let denote an arbitrary reservation quality used at the end of 

period i following the purchase of the good (i = I - ,n - I).‘‘ Using 

these arbitrary values, the decision at stage 4 is as follows: 

ifQ < q,, dispose of the good at the end of period i (if not 
already disposed of) and return to suge 1; 
if Q ^ qi, keep the good (at least until the end of period 
2 + 1 ). 

Here Q denotes the quality of the good purchased. 

f urther, let q„ denote an arbitrary reservation quality used at the 
end of period n (at the end of the life of the good). Using this 
arbitrary value, the deei.sion at stage 5 is as follows: 


ifQ<7«. do not repurchase the g<K)d; instead return to 
stage 1 ; 

if (2 3= q„, continue repurchasing that good forever. 

! Although not essential, particularly for arbitrary q,, it would seem 
[ appropriate to assume that </( «£ (i = 1 .n - 1 ). 

1 o find the consumer’s optimal strategy, we need to find the best set 
oi'qi (t = I, ... ,n), which we denote by (i = 1 , . . . , n), in the sense 
that this set maximizes the expected value of the objective function 
(I). I'o find these qf we proceed as follows. Consider the consumer 
starting from a position in which a purchase at price P has just been 
made. .Suppose that the consumer uses arbitrary reservation qualities 

q, U - 1 .n) with respect to this purchase but thereafter behaves 

optimally with resjsect to all future decisions (including those relating 
to any future returns to stages 4 and 5 ). Let W{P; q,, . . . , q„) denote 
the expected value of the objective function following this sequence of 

events. Clearly, qf (i = 1, . . . , n) is a set for which W'(P; 9 ,. q„) 

attains its maximum value (for given P); if we denote this latter by 
W*(P). we have 


I 


W*(P) S W(P; q, . q„) for all 9 , £ = 1.«). 

(b) 

W*(P) = WiP: qf, - qt). 

This completes our notational preliminaries, and we ran now proceed 
to an investigation of and the qf. 

If the stopping set 9 is used at stage 1, we have 

^( 00 ) = -, + _ ^^g(p)dp] + i,W*(p)g(p)dp. ( 7 ) 


We exclude 1 - 0 on gmunds of realism; its inclusion would not materiallv affect 
any of the results. 
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This ecjiuition stales that, starting from a position in which no accept¬ 
able price (|uote has been found, the individual searches once more at 
cost r and either receives an unacceptable quote (returning to “square 
one") oi receives an acceptable quote with expected value of lT*(/>).® 
Kquation (7) gives; 

!'(,/) = l-r + i^W*(p)gi,p)dp\ll^g(p)dp. (8) 

Nou, it can easily be shown (Hey 1979) that an optimal set is given by 
= {p; H'*(P) & r*}, where c = l^[W*(p) - V*]g(p)dp. (9) 

'] he fn St of these simpiv states that a price quote should be act epted if 
the expet ted value (of the objet live function) obtainable by accepting 
it is at least as large as that obtainable by rejecting it. The value of F* is 
determined by (9); we assume that this is positive (otherwise it is not 
optimal lot the consumer to initiate search for the good). 

Kxaminalion of (91 shows that the form of the set ."/** depends 
ci uciallv on the behavior of the function W’*. The familiar case is 
when 11'* IS a iiioiiotoincally decreasing function (i.e.. when highei 
price.s unambiguously make the consumer worse off); in this ca.sc, as 
(<in be seen fiom (9), the set y/‘* is given by {P, P ^ /’*}, where/-’* is 
the familiar reservation pt ice. However, in our model there is nothing 
to guatanlee that (I'•* is monotonically decreasing; indeed, IT* may 
take a variety of dif ferent forms, depending on the relationship be¬ 
tween quality and price in the joint distribution. In the extreme, it is 
possilile (although unlikely) that quality increases so last with price 
that the fiiiuiioii IT* is monotonicall) increasing; in such a ca.se the 
acceptance set ./** would take the form {P-. P 3= P*} wheie P* is a 
low'd lesetcation [nice—a buv-expensive [lolicy would be optimal, 
riie geiidal case (one ui which H * has lioih iiictc*asing ;ind decreas¬ 
ing segments) is illusliated in figure 1. 

.'ks the form of the function H'* is crucial to the consumer’s optimal 
liehavioi. we now pioceed to an investigation of its properties. From 
our story of consumer behavior, we have- 


. . r/„) = 


-ptl - p")/(I - p) + 


/: 


ql(q\p)dq 


( 10 ) 


^ lA 


\p)'iR qf{q\p)dq 


yiv l/'W? + P’J \(q-p)K\ - p)M(q\p)dq 


(where = ^^)‘ 


* N(ite thdi ihf searth protest is a.ssujTK'd to instantanrous and thus the experted 
values following the search are not discounted; see also n 2 above. Note further that if 
^ then (7) yields ~ -r + y{4>), which (il r > 0) implies that y{<^) is minus 
infinity. 
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Kic. 1.—Derivaiioii «( ihe sei 3** in the neiicral case 


'I'he optimal reservation qualities at the end of periods 1 to (ri — 1 ) 
—the qf {t = I, . . . , n — 1 )—are found by setting dW/dqi = 0 . It is 
easy to show that this implies that 

qf = y*(l - p) (/ = 1, . . . . » - 1). (11) 

Two ihings of inierest flow from this: First, q,* = . . . = q*-i, which 
means that if the good’s cjuality is sufficiently high for it not to be 
; thrown away at the end of the first period of its lifetime, it should l)e 
kept for the remainder of its lifetime. (This result is a conset|uente of 
out assumed constancy of the per period quality rate; if, in contrast, 
cjuahty were to decline as the good got older, this strong result would 
no longer hold.) Second, (fi = ('*(1 — p), which means that a good 
should be thrown tiway (before the end of its useful life) if and only if 
its quality is lowei than the per period value obtainable from starling 
■seaicli afresh. 

I'he optimal leservaiion (]uality at the end of the gtMtd's lifetime is 
found by setting dW/dq„ = 0 ; this implies that 

q* = F*(l - p) + p, (12) 

which states that the consumer should rrpureftase the g(M>d if and trnlv 
if its quality net of price is at least as large as lht‘ expected \alue 
obtainable from starting search afresh. Both this, anef the earlier 
result (II), accord with intuition. 

If we now substitute (1 1 ) and ( 12 ) into ( 10 ). we get an expression for 
If *; after some simplification, this yields 

W*(P) = pV* - \p(\ -p")/(l -p)]+ pll -F(q\p)\d<i 
+ [p(l - p"-‘)/(l -p)lf (I - F(qjp)]r/q 
+ lp"/(I - p)] f [I - F(qfp)],/q 


( 13 ) 
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(where, in vk'w of ihe equality of the fori = 1. n~ 1,we simply 

work with t/f and t/ti)- 

Equations (9), (11), (12), and (13) solve for F*, and 

in terms ol't.p, »,/{■ [ ■). and/rf-)'. they provide the complete solution 
to our model. 

Tlie form of (13) can now be investigated; from it and (12) we have 

r dF(q\p) , 

VV*'(/t) = -(1 -p")/(l - P)~J^ -^ 

- [p"/(I - p)][l - Ip)]. 

Inspection of this sh(»ws that IV*' is unambiguously negative if 
()f{q \p)ldp is positive everywhere; however, this latter condition 
implies that /• (• | p) is stochastically decreasing in/)—or, in economic 
terms, that (|uality is stochastically decreasing in price. This is highly 
unlikel) tti practice. A more realistic case is to assume that quality 
generally tticteases with pi ice so that F(- \ p) is stochastically increas¬ 
ing in p (i.e., F{( \p\) ^ F{t | p-i) for all t whenever p, < see Hadar 
and Russell [1969]). However, this restriction is not sufficient to sign 
H *'. On reflection, this is not a surprising result; one can imagine 
cases where quality increases very strongly with price (so that W*' is 
[RLsitive) and, also, t ase.s where quality increases very slowly with price 
(.so that W*' is negative—the direct price effect outweighing the 
indirect tjuality effect). In general, it is possible that W*' has Ixith 
positive and negative .sections, in which case the stopping rule at stage 
1 would be of a form similar to that shown in figure 1. 

Wilde (1977, 1979) also discusses similar possibilities (.see, in par- 
tit ulat, VS'ilde | 1979J, n. II) Moreover, because his model prtMjecds at 
a rather higher level of generality than our own (namely, his objective 
funt lion is not simply linear in tjuality and price), he finds restrictitins 
(sufficient to sign his equivalent tif our IV*') on the joint distribution 
virtually imptjssible to obtain. Instead, he simply assumes that the 
distribution is such that tfie return function (IV*) is monotonically 
decreasing. 

However, some fur ther insight can be gained as follows.’ Denote by 
Qj, the (random) quality associated with the price p and denote the 

' Although <itir model assumes ihat “utility” is linear in price and quality, our result' 
would still go through if utility was a separable function of price and quality if the 
distributitm.s /!• | i and gt') were reituerpreled appropriately. 
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distribution function of Qj, by Fp(‘) ■ F(- \ p). Suppose that the 
conditional distributions are such that 

Fp(q) = Fo(q - kp). (15) 

that is, Qp is a translation of Qo- (Thus the distribution of Qp is the 
same as that of Q,o shifted to the right by a distance kp.) It is a simple 
matter to show that 

dF(g\p)/dp = -k/(q\p). (16) 

If this is substituted into (14), we find, after some simplification, that 
lV*'(p) =(k- 1)[1 - p''F{q*\p)]/{\ - p) 

-*p(l -p"-)F(9f|p)/(l - p). 

There is an important special case in which the second term in (17) 
disappears; this is when Fiqflp) = 0, or, in economic terms, when 
there is no possibility of finding a quality so bad that it pays to throw 
the good away before the end of its lifetime." In such a case, equation 
(17) implies that 

W*'{p) § 0 according as A § 1. (18) 

This result makes good sense: It states that the return function is an 
increasing (decreasing) function of price if q - p (the quality net of 
price) is an increasing (decreasing) function of price. Moreover, if^ - 
p is independent of price (that is, if A = 1), W* is horizontal (at the 
level r*), and the consumer simply accepts the first price quote re¬ 
ceived whatever its value. 

The general case is when F{q*, | p) > 0, that is, when there exists a 
positive probability of getting a good whose quality is so low that it 
pays to throw it away and start search afresh. In this general case, the 
simple result (18) no longer holds. However, as inspection of (17) 
shows, if A is less than or equal to one, then fV*' is strictly negative. 
Thus, if "net quality” (q — p) decreases with price, the return function 
is monotonically decreasing and a buy-cheap policy remains optimal. 
However, A larger than 1 is not now sufficient for W'*' to be positive, 
because the (nonzero) possibility of getting a “lemon" destroys the 
generally greater attractiveness of higher-price gtx>ds.® However, by 
differentiating (17) (and using [16]) it can be shown that A greater 
than 1 is a sufficient condition for IV*" to be positive. Furthermore, 
from (17) it can be seen that lim^o„ IV*'(^) = (A - I)/(l - p) > OifA > 

"This e'fetuvely is the case considered by Wilde (1977). 

“ A sulhcient condition for W*' to be positive is that k is at lea.sl as large as (1 - p")/( I 

p), so that net quality increases sufficiently fast with price to outweigh the nonzero 
lemon pos.sibility. 
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Fi(. —l)*-iivaliim <)l when (15) holds with k > 1 


I ,"' These results imply that if k is greater than 1 (i.e., if net quality 
lilt t eases with price), then H'*' is monotonically inc reasing and that 
even if it is negative at lower values of p. it must beettme positive at 
some higher value. Thus, if k is greater than 1, one of the two 
possibilities pictured in figure 2 must exist. 

T,a,se h in figure 2 is paitkularly interesting, implying that the 
consumer should search until either a very low, or a very high, price 
ciuote is obtained. Ooods at intermediate prices should not be pur¬ 
chased, This result follows from the combination of net quality in¬ 
creasing with price and the lemon possibility: High prices are at¬ 
tractive because the associated high qualities are a sufficiently great 
incentive to outweigh the po.ssible lemon effect; low prices are attrac¬ 
tive despite the associated low' qualities because of the small loss in¬ 
volved in throwing away a lemon which cost little in the first place; 
intermediate prices are not attractive because they fall between 
these two stools. 

The results discussed in the paragraphs above are, of course, 
specific to the cc^nditional distribution formulation given by (15). 
However, these results do yield some insight into the general con¬ 
siderations underlying the form of the function W* - clearly important 
is the (stochastic) relationship between net quality (c/ — p) and price. 
Alternative formulations tan easily fie envisaged. For example, imag¬ 
ine that the conditional distribution of net ({uaiity increases at an in¬ 
creasing rate with price up to some intermediate price and thereaf¬ 
ter increases ai a detrea.sing rate. In such a case. W* may be an 
increasing function for low^ and a decreasing function for high p; the 
price-acceptance .set would then consist t>f intermediate prices, both 
very low and very high priies being rejected in favor of further 
sea rch. 

Clearly, a whole variety of pos.sibilities exist: Depending on the 

"’Since A^ty), as ciefinccl in (15), approacfies 7.ero as p approaches infinity for any 
finite y. 
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conditional distribution of quality given price, buy cheap, buy expen¬ 
sive, buy medium, or various combinations of these are all plausible 
buying strategies. I hus "gaps” in the stopping set can be quite 
consistent with optimal behavior. 

Before concluding we note the following comparative static results. 
They can be derived from equations (9), (11), (12), and (13); the 
detailed proofs can be found in Hey and McKenna (1979). 

1. Each of the following makes the consumer-searcher better off in 
that y* and lV*(p) increase: (i) a decrease in the search cost c, (ii) a 
decrease in the lifetime of the good n, (iii) a rightward shift of the 
conditional distributions of (Jj,, (iv) a rightward shift of the marginal 
distributions of P if W* is a monotonically increasing function, (v) a 
leftward shift of the marginal distribution of P if W* is a monotoni¬ 
cally decreasing function, (vi) an increase in risk (in the Rothschild 
md Stiglitz [1970] sense) in the conditional distributions of Qj,. Con- 
iequently the consumer becomes more choosy in that qf and q* in¬ 
crease and the set i?* gets smaller. 

2. The expressions 1^*(1 - p) and H'*(/>)(1 - p) increase if p 
increases, that is. if the consumer places greater weight on the future; 
'onsequently, the consumer gets more choosy in the present in that qf 
md q% inctea.se and the set gets smaller. 

All ofthe.se results accord with intuition, with the pos.sible exception 
if that I elating to n. However, when it is remembered that P is the 
price per period of the gtxid’s lifetime, it does make sense to conclude 
that a smaller n is better, in that it reduces the time spent “locked in” 
to an un.satisfaciory good. 


IV. Summary and Conclusions 

1 his paper has presented a unified model of consumer search with 
both puce and quality uncertainty. Because price is revealed prior to 
purchase while quality is not revealed until after purc hase, the con¬ 
sumer’s problem is essentially a two-stage problem. The first stage 
relates to the price and involves t hoosing an acceptance set for prices. 
I he second stage relates to the quality, and involves deciding whether 
a particular quality is sufficiently good («) to retain it after purchase 
and (f>) to continue buying it thereafter. The problem is made in¬ 
teresting and nontrivial by the dependence of quality on price. 

1 he major result is the finding that the price-acceptance .set is not 
necessarily connected and that the conventional reservation price rule 
IS optimal only in some circumstances. In the general case, depending 
on the stochastic relationship between quality and price, buy cheap, 

buy ex^nsive, buy medium, or various combinations of these are all 
plausible buying strategies. 
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In the form presented in the paper, the consumer’s utility is as¬ 
sumed to be linear in price and quality. However, all our results still 
hold if utility is a separable function of price and quality, as long as the 
distributions of price and quality are reinterpreted appropriately (as 
distributions of the utility of price and the utility of quality). Of 
course, in such a generalization optimal behavior would vary from 
consumer to consumer, depending on the differences between their 
utility functions. It would be of interest to explore the implications for 
price/quality equilibria in markets resulting from such behavior. It is 
to he hoped that this paper has laid a firm foundation for such an 
exploration. 
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Starting from an initial (baseline) budget line, income-compensated 
price changes always resulted in substitution cHecls consistent with 
the Slutsky-Hicks theory. This Itehavior cannot be explained by a 
simple random^behavior model. Similar changes in relative prices 
dial did not originate from the initial (baseline) budget line resulted 
in undersubstitution elfects ; I he coin|Ki.sition ol consumption 
changed in the ex|w ted direction, but the magnitude of change was 
not large enough to lie consistent with the initial commodity bundle 
chosen. These undersubstituiion effects arc not explainable by 
shifting preference patterns or anchoring effects found in inconsis¬ 
tent choice sequences with human subjects. 


1 his paper reports tests of the negativity of the Slutsky-substitution 
effect in commodity-choice experiments with pigeons as subjects. 
Starting from a given (baseline) budget line, income-compensated 
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increases or decreases in relative commodity prices always resulted in 
substitution effects consistent with the SIutsky-Hicks theory. These 
results, which replicate earlier studies using rats as subjects (Kagel et 
al. 1975, 1980), are not explainable by simple random-behavior mod¬ 
els proposed in the literature (Becker 1962; Chant 1963). 

Similar changes in relative prices that did not originate from the 
baseline resulted in “undersubstitution effects"; that is, the composi¬ 
tion (jf consumption tended to change in the expected direction, but 
the changes were not large enough to be consistent with the original 
(baseline) commodity bundle consumed. These undersubstitution ef¬ 
fects ai e not explainable in terms of shifting preference patterns or 
anchoring effects ( I vcrskv and Kahneman 1974) found in inconsis¬ 
tent choice sequetices with humans as subjects. Further experiments 
will be required to identify the basis for this behavior. 

Data on learning patterns as subjects adjust to changes in experi- 
metital conditiotis ate also reported. Questions of interest here con¬ 
cern identifyitig uniformities in both the speed of the adjustment 
pi oress and its pattern, whether it can l>e characterized by concepts of 
habtt fottnation or stock-adjustment pitnesses (Houthakker and 
f avior 1970). rhe nature ctf the learning process is, of course, strictly 
an cmpiiica! matter in terms of the static .SIutsky-Hicks theory. 

Space considerations do not permit a detailed discussion of the 
rationale beliiiid using nonhumaii subjects to investigate ecotiomic 
theories (see Kagel and Batlalio fl980] for this argument). For the 
more skeptical leader we note that if one defines econcrmics as “. . . the 
study of the allocation of scarce' resources among unlimited and 
competing uses" (Rees 1968). theti animal psychologists, ecologists, 
and biologists have been involved in studying economic behavior for 
sometime now (Rapport and riirner 1977; Hirshleifet 1978; Rachlin 
1980). It IS but a small step to take the technologies of these related 
disciplines and apply them to behavior of interest to economists, for 
example, commodity-choice bc'havior. At a minimum, such studies 
expand consideraftly the scope for comparative economic analysis. At 
a maximum they provide a lalxiratory ferr identifying, testing, and 
better understanding general laws of economic behavior. Use of this 
laboratory is predicated on the fact that behavior as well as structure 
vary continuously across specie.s, and that principles of economic 
behavior would be unique among behavioral principles if they did not 
apply, with some variation, of course, to the behavior of nonhumans. 

The plan of the paper is as fcrllows; Section I describes the hypothe¬ 
ses tested and the prexedures u.sed to operationalize the budget con¬ 
straint. Sections II and III report the two sets of experiments con¬ 
ducted; Both comparative static and temporal patterns of behavior 
are examined. We end with a discussion which summarizes the test 
outcomes and their implications for future re.search and returns to 



COMMODITY-CHOICE BEHAVIOR 69 

the question of the relevance of the experiments for understanding 
human economic activity. 


I. Hypotheses Tested and Experimental Procedures Employed 

A necessary condition that all members of the class of Slutsky-Hicks 
demand functions must satisfy is that for income-compensated price 
changes (of the Slutsky type) the consumer purchase a commodity 
bundle that was previously unattainable. More precisely, 

if/)' 7 ^ and /r'x' = then (p^ — p*){x‘ — x') < 0 , ( 1 ) 

where /?' and x' represent vectors of prices and quantities of goods 
purchased in period i. Condition (1), also referred to as the weak 
axiom of revealed preference, provides the basis for a direct consis¬ 
tency test of the theory (Hicks 19.56). 

Condition (1) was tested by having f<K)d- and water-deprived pi¬ 
geons choose between the consumption of these two commodities. 
I’he subjects were placed in an experimental chamber for a fixed 
period of time each day. Each chamber contained a ftxid hopper and 
water reservoir located on the front panel. Access to the food and 
water was controlled by separate variable time schedules. Under a 
varial)le time schedule commodities become available for a fixed 
period of time every <x seconds on the average, where a is determined 
as pan of the experimental design. Only one of the variable time 
sdiedules was in operation at a given time, and subjects could switch 
between the food and water schedule by a single peck on a control 
kev. Pecks on this key were not required to continue food or water 
deliveries, but only to change which commodity would be delivered, 
tailor-coded lights were used to signal whether the food or water 
schedule was in effect. The price of each commodiiv was varied bv 
altering the average time between deliveries of that commodity (access 
times remaining constant), while the income constraint was the total 
time available for the delivery of the two gtxids. 

Under these procedures time spent under the food and water 
schedules {Tf, T produces commodities at an average rate of 



Ch = -Th, 
On 


( 2 ) 


where ay, a„ are the average values of the variable time schedules, 
and C,y and f'l,. are food and water consumption measured in number 
of payoffs. Since time required to switch between schedules is 
sufficiently small that it can be ignored for all practical purposes, 
substituting ( 2 ) into the total time {Tj-) constraint 


Tt— T}/y + Tf- 


( 3 ) 
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gives an expected value for the budget constraint of 

atCi-' + oc\f/Cw = Tt- (4) 

From (4) it is clear that increasing (decreasing) the average time 
interval between deliveries of a commodity has the same impact as a 
price iru tease (decrease) in the usual specification of the budget 
constraint. Analogously, increasing (decreasing) the total time availa¬ 
ble for delivery of the goods increases (decreases) nominal income 
(see Becker 11971] for the comparative statics of this problem). 

lb establish conditions for testing the negativity of the Slutsky- 
siibstitiition effect, consumption patterns were first recorded under a 
set of initial (baseline) price (a)-, aU ) and income {T\) conditions. The 
price of one of the commodities was then increased, accompanied by a 
corresponding decrease in the price of the other commodity so as to 
achieve, as nearly as possible, perfect compensation 

otfC) + aicf-M = ai-Cji- -h (5) 

where the superscripts refer to the initial (i) and price-change period 
(/). Since cammodity consumption varied somewhat from day to day 
undei cotistani expeiimenlal conditions, the median bundle con¬ 
sumed fluting the last 5 days of a condition served as the reference 
point loi (oinpensation. Total time available for consumption re¬ 
mained (onsiant across experimental conditions, as did the amounts 
o( food and water delivered per payoff. 

Consrstent choice behavior (a negative Slulsky-substitution effect) 
requires that 

n - < 0 . (6) 

That is, with liie compensated change in relative prices the consumer 
purchases a commodity bundle in the price-change period that it 
c ould not afloid to purchase during baseline. Since, as already noted, 
consumption was not the .same from day to day under constant ex¬ 
perimental conditions, t-tests applied to the mean value of (6) calcu¬ 
lated over the last 10 days of the price-change period were u.sed to 
determine consistency. 

In cases where (6) is negative, a .simple random-behavior prcxress 
could explain the results (Becker 1962; Chant 1963), For example, if 
the pigeons randorniy choose the amount of time sjjent receiving each 
of the commodities, and total time remains constant across experi¬ 
mental conditions, then reducing the lime between food deliveries, 
which corresponds tc.t a reduction of food costs, will result in in¬ 
creased food consumption. Correspondingly, increasing the time 
between water payoffs, which increa.ses its price, would result in a 
decrease in average water consumption. Under random behavior the 
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expected value of time spent receiving each of the commodities 
would be 


^ ^ k=F,W 
Tr 0 « <7* « r 


(7) 


where is constant across exfierimental conditions.' 

Chant has characterized consumers who behave in this fashion as 
“money deciders.” In the case where = .5, the pigeons would be 
spending an equal amount of time collecting each of the commodities. 
In Chant’s terminology they would be “impulsive money deciders.” 
For the more general case, where the pigeons spend a constant but 
unequal proportion of time collecting each of the commodities, they 
would be characterized as “proportionately inert money deciders.” In 
cases where (6) is negative, we will test whether behavior is 
significantly different from (7) using /-tests for paired comparisons 
and appropriate analysis-of-variance procedures for comparing across 
several conditions. Rejection of (7) suggests that something other than 
a simple random behavioral process is guiding choices. 


II. Experiment I 

Procedures 


Four male White Carneaux pigeons with no previous ex|>erimenul 
history served as subjects (S’s). Subjects were completely dependent 
on food atid water consumed during experimental .sessions with no 
access to these or any other commodities in their home cages. Ex¬ 
perimental sessions were run once a day 7 days a week starting at 
approximately the same time each day. Consequently, subjects' body 
weight varied somewhat between conditions depending on consump¬ 
tion patterns. 

All S s had extensive training with the procedures prior to the start 
of the experiment. Experimental conditions were changed when in¬ 
spection of graphs of the data indicated an absence of any significant 
drift in consumption patterns, although at times conditions were 
maintained for considerably longer periods simply to see what would 
happen. 


Food payof fs consisted of 8 seconds access to a food hopper con¬ 
taining mixed pigeon grains. Fluid payoffs consisted of 3 seconds 


at cess to a reservoir containing tap water. Access to the food hopper 

Negative values for (6) rule out the cases of "inert” or "impulsive goods deciders" 
1963). Of course, behavior in accordance with (7) is observation- 
^BL'ker"l'^)'*™° ' "•"“vaied by a Casbb-Douglas uuliiy function 
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and Hater reservoir was blocked at all other times. Time spent eating 
and drinking did not count against total time available for delivery of 
commodities which was 60 minutes for all S's/ 

For thice of the four S’s, food and water were delivered on the 
avei age at 60-second intervals during the initial (baseline) period. 
This was followed by an increase in the relative price of food. Baseline 
condiliotis were then reinstated, followed by a decrease in relative 
food prices, which was followed by a return to baseline conditions. 
(Fable 1 shows the sequence and magnitude of relative price 
changes.) To account for any shifts in preference structure, the me¬ 
dian commodity bundle taken in the last 5 days of each baseline 
served as the refeience point for the Slutsky-compensated price 
change of (he following period. A fourth subject, S 85, started with 
food payof fs coming tw ice as often as water deliveries (relative price 
of food e<)ual to .5), followed by a Slutsky-compensated increase iti the 
price of food. Baseline conditions faced by the other birds were 
iinplemented in period 3 for this subject. Since the mean commodity 
bundle purchased during period 3 could very nearly have been 
botighi during the previous period, we compare these two points also. 


('.omparatn'f Slatu He-Uilts 

1 he ellects of the .Slutsky-compensated price changes are reported in 
table I. The results of the increase in food price relative to baseline 
rondiitons are reported between conditions 1 and 2 Ibr.S's 14, 46, and 
50 and between conditions 2 and 3 for 5 85. In all casc.s the weak- 
axiom test statistic (6) has the correct sign and is statistically significant 
at the .05 level for three of the four ,S’s. Price elasticities for water 
ranged from a low of -.07 for S 50 to a high of —.81 for 5 14, 
averaging - .38 across all four birds. I'hese price elasticities are a bit 
larger than the highest values reported for food-fluid experiments 
with rats (Kagel el al. 1980), The incrca.se in the relative price of food 
lot .S' 85 between periods 1 and 2 al.scr rc-sulted in substitution effects 
ill the expected diieciion. 

Decreases in the relative price of loot! for .S’s 14, 46, and 50 in 
period 4 resulted m a negative (correct) sign for the weak-axiom test 
statistic, which was significant at the .0.5 level or better in all cases. 
Price elasticities here ranged from a low of - . 10 for .S 50 to a high of 
-.24 for 5' 14, averaging -.16 across S's. 

Rejilication of baseline conditions following the price-change 

‘ Initial values were dclermined s<»ihat .S'scould muiiilain healthy body weights from 
forxl and water consumed within the sessions. Values were determined on the ba.sis ol 
past experiments and experience during the training periods. Additional procedural 
details may be found in the Ap[>endix. 



TABLE I 

Results of Consistency Tests: Experiment 1 


E-Test for 
Equality of 





Mean Payoffs^ 

Baselines 

Subject and 


Weak-Axiom 


(Sm) 

(Marginal 

Experimfntal 


Test 



Sigtiibcance 




Condition 

OLyldw 

((-Statisuc)* 

Food 

Water 

Level) 

Subject 14. 






1 

1 0 

-636 

46.4 

13.6 


2 

2.88 

(2.44)** 

(1.54) 
39 5 

(1.54) 

31.1 





(2.20) 

(6.55) 


3 

1.0 

-312 

37 1 

23 3 

3.36** 


28 

(3 26)*** 

(3.05) 

(2.92) 

(.05) 

4 

48.7 

16.5 





(221) 

( 82) 


5 

1.0 


42 4 

18.2 





(1 81) 

(1.16) 


Subject 46 






1 

1.0 

-498 

34 2 

2.5.8 ■ 



2.86 

(2.74)** 

(1.47) 

(1 47) 


2 

32.8 

35.5 





(1.63) 

(4,65) 


3 

1.0 


37 2 

22.9 

94 



-288 

(1.58) 

(1 57) 

(.40) 

4 

.22 

(2 79)** 

43.7 

21 1 





(2.08) 

(.41) 


:> 

1.0 


36 0 

23 4 





(1.61) 

(1 64) 


Subject 50: 






1 

1.0 

-48 

41.1 

19 1 

! 


4.0 

( 30) 

(1.62) 

(1.55) 


2 

40.0 

20 8 


3 



( 98) 

(3 66) 


1.0 

- 168 

42,6 

17 7 

1 2.39 

4 

.32 

(2 43)** 

(.82) 

47,2 

(.92) 

15.6 

(.1 1) 




(1.58) 

(.50) 


5 

1 0 


44 7 

160 





(1.04) 

(.94) 


Sub)e(T 85: 






I 

5 

-5f>l 

33.5 

33 4 


2 

4.5 

(1.87)* 

(1.59) 
31 3 

(.82) 

42.0 

N.A 

S 


-798 

(1.33) 

(4,74) 

1.0 

(3.88)*** 

31.4 

28.2 





(3 28) 

(3,31) 


--- 
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TABLE 2 

Tkm ok SiMPLt Random-Behavior Model 


Subjfii and 
(’.rouping* 


Inrome 

Spent 

on Food F^xpeiimental 
(Vf) Condition 


Relalivc 
Price 
of Food 


E-Test for Equal 
Percentage Hypothesis 
(Marginal Significance Level) 


Sub|t‘(l 14 

A 

79 

A 

.77 

A, B 

71 

B 

.62 

C. 

.46 

Sublet 1 46 
■A 

.73 

B 

62 

B 

.60 

B 

.'•.7 

c: 

32 

Sublet 1 .5()- 
A 

89 

B 

75 

B 

71 

B 

69 

C 

50 

Sub|t'il 8,5 
.A 

,78 

B 

52 

c 

34 


1 

f> 

3 

4 

2 
3 

5 
1 
d 


2.88 

1.00 

1.00 

LOO 

.28 

2.86 
1.00 
LOO 
I 00 
22 

4.00 
LOO 
I 00 
1.00 
.32 




I 


4.50 

1.00 

50 


I3.4t 

(. 001 ) 


S2 7 
(. 001 ) 


55.4 

( 001 ) 


.34 8 
(. 001 ) 


• < (tfidniDHs NsHh ih<r oupun^ Jriiri luvc fhc vinir |>rrcri>( of tot.)J cievotrd to (mxl ^rtordiOK 

((i iJtiiu.tn's (itufciplc-rari^r nr « 05) 

■f /■ <'X£lu< 0 n){ p<‘iKwi I )4 7 


IJcriods shtmfd toii.'.uiTiptioii patlcrn.s to be reversible, ai least di¬ 
rectionally, in all ca,se.s. Preferences were sufficiently stable that for i ’s 
46 and 50 there were no significant differences in consumption pat¬ 
terns between baseline periods. While consumption forS 14 in period 
1 is found to differ significantly from that of periods 3 and 5 (at the 
.0.5 level using Duncan’s multiple-range lest [Winer 1971J), there are 
no differences between periods 3 and 5. We note that the ratio of 
food to water consumption for5 14 in peritxl 1 was the highe.st value 
found across all S’s in both experiments under baseline prices. 

Table 2 reports tests of the simple random-behavior model (7). In 
all ca.ses T-tests show that we can decisively reject the hypothesis that 
the percentage of total income spent on commodities was the same 
across differing experimental conditions. Using the letters/4, B, and C 
to indicate experimental conditions that can be grouped together as 
having the same percentage of income spent on food (at the .05 
significance level according to a Duncan multiple-range test), we see 
that the price-change periods are responsible for the differences in 
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each case. For S’s 46 and 50 the percent of total income spent on food 
remained constant across baseline periods (conditions 1, 3, and 5) but 
increased significantly in period 2 when relative food prices increased 
and dropped significantly in period 4 when food prices dropped. 
Subjects 14 and 85 show similar response patterns with the exception, 
already noted, that consumption patterns differed between baseline 
jjeriods for S 14. In all cases demand is far too inelastic for consistency 
to be attributed to the fact thatS's spent a constant proportion of their 
time collecting each of the commodities. These pigeons cannot be 
characterized as either “impulsive” or “proportionately inert” money 
deciders. 


Adjustment Patterns 

The comparative static analysis tells us nothing about the learning 
path as subjects adjust to changes in relative prices. To investigate this 
we fit, within a given experimental condition, the following hyperbolic 
function to the commodity whose relative price decreased: 

Cic, = a + b(\lt), 


Cm = consumption of commodity k on day t, commodity k 
being the good whose relative price decreased. 


( 8 ) 


l = time in days within a given experimental condition. 

The function (8) builds in the assumption that consumption ap>- 
proaches a steady-state value as time passes and there are no further 
changes in experimental conditions. We fit these functions because (1) 
In terms of conducting the experiments, conditions were only 
changed when inspection of the data suggested no significant drift in 
consumption patterns, and (2) graphs of the data suggested that in 
most cases we did not need to use more complicated functions to 
adequately represent the time path. 

1 he coefficient b of (8) is readily interpretablc in terms of a number 
of concepts within the economics and psychology literature. U b is not 
significantly different from zero we have, in effect, instantaneous, or 
at least very rapid, adjustment to the price changes, relative to the 
day-to-day variability in consumption patterns. Fitting (8) to the 
commodity whose relative price decreased under a given experimen¬ 
tal condition and finding a negative value for b would imply that 
subjects responded gradually to the change in relative prices, in¬ 
creasing consumption of the now cheaper good with the passage of 
time. Such behavior would be consistent with concepts of habit for- 
niation found useful in describing human consumption for a number 
of nondurable goods (Houthakker and Taylor 1970; Phlips 1974). 
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The presence of habit tormation W{»uld also be consistent with psy¬ 
chological notions that it takes S’.s time to perceive the changes in 
relative prices and even more time to learn the implications of re¬ 
sponding to these changes. 

In ettntrast, a positive sign for h in (8) would indicate that the 
immediate response to a decrease in relative prices was to consume 
more of the < ommodity than with the passage of time. This sort of 
r esponse pattern goes urtder the heading of a stock-adjustment effect 
within the Houthakker-Tatlor framework. It might be present in our 
data foi any number of reasons. 

In general there is no a priori basis for deciding whether habit- 
formation or stot k-adju.stmcnt effects will predominate for a particu¬ 
lar commodity. Fot example, Houthakker and Taylor (1970) found 
that housing exhibited strong habit-formation effects, while domestic 
scrviies and water use showed strK k-adjustment effects generally con¬ 
sidered to be charatlerisik of consumer tlurable goods. The empirical 
Work in psycliologs shows much the .same diversity since learning 
(iirves of almost c\ei y conceivable shape have been leportcd in the 
literature (Ma/ui and Hastie 1978). 

Figure 1 shows the daily water eonsumpiion for condition 2 when 
the relatne price of water decreased. Fhe mean consumption of water 
during tfie last 10 da\s of the previous period is also shown as an open 
ciicle on the watei axis. Fhe estimated h values and the Durbin- 
VVaisori siaii.stit s olitained f rom the least-srpiares estimates of eijuation 
(8) are also given for each subject.^ Fhe data show little trend and a 
great deal of saiiafiility. This is evident in the Icast-.scjuarcs estimates 
of the titne-irend loeffir ients, h. which are not statistically significant 
for anv of the .S's. Note, fiowcver, that in each i ase h is positive in sign. 
So that wfiai little tieiicf there is in tfie data is inconsistent with the 
notion that habit foitiiation dominates choices. That is, theie is a 
tendency to liave ni>)rc' substitution into water during the first few days 
tfian is shown in the comparative static compari.sons. 

Fiptre 2 shows the daily IckkI consumption for condition 4 when the 
telative price of fooci dec reased. (7ncc again there appears to be little 
trend, a lesuli captured in the /-test ratios for the least-scjuares c.sti- 
mate.s of b, which are insignificant in all cases, although the signs of 
the ctreffic ients art^ soinewhal more favorable to the habit-formaticjn 
hypothesis. 

I he absence cjf significant time-trend coefficients, in conjunction 
with the data re|>oried in table I, iinplicxs thaiii's adjusted very cjuickly 
to the changes in telative prices, at least within the time frame for 

, breaks in the sequential data plotted in hgs. 1-4 correspond to days when 

the equipment malfunctioned Ilummy variables are used in these cases when adiuslinit 
(8) to the data 
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• Maan Conitifn^ton 

Pn. I —I <'iiipt)ial paiierii of water <(>ii!>umptKm lollowin); a detrcase in its relative 
price experiment I, eonditicm 2 Bioken linev tiHlkate dayv with ecjuipment (ailurex 
Isrimated values ol the time trend locnicicnt. h. with the turrexpondinn l-xtalistK 
repuned in biaekets are (jiven ftir each stibjcxi along with the value of the Durhin- 
W'alson statistic I he symlxils +, •, and •* indicate stalisiK'al significance ai the 10. !t, 
and I |X'r( ent levels, respectively I'lie symlwils - and o indicate the Diirhin-Wat.von lest 
was iiidelerininale <>i there was an insufficient nunilier of ohservalions for the test 


which we have data (daily totals) and relative to the day-to-day vari¬ 
ability in the data. For example, in going frtim condition 1 to 2, S 14 
more than doubled water intake but has a time-trend t oefhcient with 
a /-statistic considerably less than 1.0. The fretjuent trips to the 
cafeteria windows (.S"s received 60 or more payoffs per .session), the 
low search costs involved in chexising within the confines of the ex¬ 
perimental test chamber, and the physiological limits on suhstiuitabil- 
ity imposed by the commodities emploved all facilitate S's adjusting 
rapicily to the changes in experimental conditions. The data here 
indicate that at least some of the subjects tcxrk advantage of these 
conditions. 

[he Durbin-Watson statistics, while indicating a tendency to nega¬ 
tive hrst-order autcxorrelation in the re.siduals, do not show a serious 
problem in these respects. Such a problem might be anticipated from 
the physiological interdependencies between fotxl and water, which 
would result in the pigeon’s getting caught short on food or water for 
a given day, only to be compensated for in the next day’s consumption 




• Contumption Pr«v/ou« Pptktd 

Kk. 2 — 1 t'rii[K>rdJ pjaorn ol lood (omunipiion lollowiiiK a detreasr in ils relative 
piKf iNpeniiiciit I, (onditioii 4 Nolaimii is the same as in figure 1. 


Inindle^ I ha tiafa indit ate, however, that this did not happen. Sub¬ 
jects eitltci had sufficieiitlv good “planning,” given the Irequcnt trips 
to tlie cafeteria w'indow in a given day, or sufficiently frequent cross¬ 
overs between food and water schedules so as not to generate imbal¬ 
ances strong enough to dominate the residuals. Durbin-Watson 
siaiistics computed on the basis of fitting (8) to the data from the 
baseline periods show similar results. 

III. Results For Experiment 2 

Prorrdurr\ 

Ksseniially the same procedures were employed as in experiment 1. 
The major differ cnce was ihe sequence ol price changes. 

Four experimentally naive male White Carneaux pigeons served as 
subjects. Initial (baseline) conditions were the same as for ,S”s 14, 46, 
and .‘>0 in experiment 1 with the exception that S 38 obtained 13 

' Wc arc assuming that .S's can and will correct imbalances within one day. This 
4S.suinpnuii i.s based r>n data lor days following equipment failures (e.g., a malfunc¬ 
tioning focKi dispenser) which show that, for rats and pigeons alite, excluding data for 
the day of the failure and the following day generally leaves the lime trend unaffected. 
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seconds access with each food payoff and 6 seconds access to the water 
reservoir. The difference in price-change sequence from experiment 
1 consisted of (1) decreasing relative food prices first and (2) not 
returning directly to baseline conditions but following this with a 
compensated increase in food prices relative to baseline before re¬ 
turning to baseline conditions. (Table 3 shows the sequence and 
magnitude of price changes.) 


Comparative Statif Results 

Table 3 reports the effects of the compensated price changes. The 
decrease in relative food prices in period 2 led to statistically 
significant substitution effects in the expected direction in all cases. 
The price elasticities of demand for food ranged from a low of - .21 
for i 38 to a high of —..51 for S 36, averaging —.32 across .S"s. 
Applying /-tests to the percentage of total lime spent obtaining ftx)d in 
periods 1 and 2, we again reject the hypothesis that the simple 
random-behavior model (7) explains any of the data; Demand is far 
too inelastic in this case also. 

1 he compensated increase in relative food prices in period 3 does 
show some surprises, however. In all ca.ses the relative price of food 
was greater than during the initial baseline period. However, the 
predicted magnitude of change in consumption did not materialize 
(oi any of the .S 's. Although the compttsition of consumption (ratio of 
food to water payoffs) changed in the “right" direction from perifxf 2 
to 3 for three of the four i’s, in no case was the change sufficiently 
latge to result in a correct sign for the weak-axiom lest statistic (6). In 
tontrast, comparable increases in relative food prices starting from 
hascline conditions in experiment 1 resulted in a correct sign for the 
test sialistic in all cases. Further, a comparable increase in relative 
food prices fori 85 in period 2 of experiment 1, which started from a 
relative price ratio similar to that faced by .S's 36-39 in pericad 2 of 
this experiment, did result in consistem substitution effects significant 
iit tne 10 percent level.' 

One potential explanation for the behavittr observed in period 3 is 
that preferences had shifted. The return to baseline conditions in 
petiod 4 enables us to test this hypothesis. For the one subject, S 36. 
whose behavior is clearly inconsistent in period 3. preferences have 
clearly shifted (/ = 5.15, df = 18. P < .001).» For the other three 5‘s 
s owing nonexistent or undersubstitution effects, the hypothesis 
receives only weak support. Although fix>d consumption has in- 

higher for .S 8,5 in peri<K» 1. 

probability that we tan rejetl llie null hvpoihc.sii of no eflect. 
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ViiL Of'tirt it-Uinr t<MMf (irti - ^uiufATtl rrriir (hr mr^fi 
[hr .ir irn i^rlrH witli S (1rgir(-s ul trrraoin 

(-^Irulalrd iurr the Um In o( rrrh prrMxt 
Ilte /-irsCi jif (H-j lailrrl wjih (M drffrm of frrriUtm 
• S(f(iiifi(ant .ti ilir 10 prKrnl lr\r( 

*• Slffntfif3nt .11 [fit ^ prrrrnt lr(rl 
Si|fnifKan( 111 (hr I prrrcnt Irvrl 


35 9 
(1.26) 

29.7 
(.37) 

23,3 

(.3.15) 

26.8 
(1.24) 

18,8 

(1.47) 

15.6 
(1.27) 

12 6 
(3.71) 

16.2 I 
(2.39) J 

15 1 
(1,3.5) 

12.6 
(1 09) 

14 0 
(I 48) 
1.3.0 
(2.21) 

20.2 
(2,01) 

15.4 
(1.06) , 

16 9 
(1.78) 

15,9 
(1 25) 


(-Test for 
F.qdai.ity of 
Baskiines 
(Marginal 
■Signifitaiire 
Ix-vel)^ 


5 . 15 *** 
( 001) 


93 
( 37) 


.78 

(. 4 . 5 ) 


1 . 78 * 
( 09) 


8o 
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S38 



• MMn Cenaumptton Pr««ieu« ^lod 

(■K.. — Icmporal pallfrn ol Iikk) ioiisumpiion (ollowing h det rcasr in its relative 

piice evpeiitnem i!, condition 2 Notation is the same as in figure 1 


11 eased it) period 4 relative to the initial pei iod for all three, in no case 
does the increase seem to be suf'hcient, by itself, to explain the 
liiidersubstitution effects. Tlie magnitude of the change in relative 
prices, and the fact that we are dealing with Slutsky-c«)m|>ensatcd 
prit e changes which, if anything, overestimate the Hicksian substitu¬ 
tion eflect (Friedman 1976), would seem to more than (ornpensate 
foi any small shifts in preference structure that might have tx-curred." 
Finally, the behavior reportecJ in pc-riod .‘f is inconsistent with the 
landom-respoiise model (7) also. 


A(l]u\tmrnl Patterns 

figure 3 shows fcxid consumption in jx’iiod 2 following the decrease 
in Its relative price. Fhe data show more ol a positive time trend, 
consisieni with the notion of habit formation, than comparable obser¬ 
vations from experiment 1. Fstimaied values of the time-trend 
coefficient, b, from equation (8) confirm this as we have negative 
coefficients in three of the four ca.ses, each one of which is significant 
at the percent level. Fhese negative coefficients indicate that sub¬ 
jects responded slowly to the decrca.se in relative IchkI prices, consis¬ 
tent with notions of habit formation. 

Wc are assuming here that both ftxid and water are normal goods. 
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• Hl««n ContumptMn PhnMia takMl 

Fi(. 4 — I emporal paliern of wjter ton.suniption following a decrease in its relative 
price expenmeni 2, condition 3. Notation is the same as in figure I. 


Ill contrast, the data for period 3, shown in figure 4, during which 
the relative price of water decreased, show stock-adjustment effects in 
all rases, being statistically significant at the 1 percent level for S 37 
and at the 10 percent level for .S' 38. Thus, the initial response to the 
decrease in the price of water was greater at the beginning of this 
period than at the end. In fact, for all S’s the commodity bundle 
purchased on the first day of period 3 satisfied the basic consistency test 
(1). Further, for .S’s 37-39, who did not shift preferences between 
baseline periods, the weak-axiom test statistic calculated over the first 
3 days tjf period 3 has the correct sign, although it is somewhat 
smaller in magnitude than values observed under comparable condi¬ 
tions (period 2) in experiment 1. These data would appear to rule out 
arguments that the undersubstitution effects found in period 3 were a 
result of not maintaining experimental conditions for a sufficiently 
long period of time and/or that S’s failed to correctly process the 
magnitude of the changes in relative prices. 

The adjustment patterns in the two experiments are qualitatively 
similar. Pooling the results from the two experiments shows an 
asymmetric response to increases and decreases in relative food 
prices. For decreases in prices we find a tendency to habit formation, 
or increased food consumption with the passage of time. For increases 
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in relative food prices the data consistently show anti-habit forma¬ 
tion, more water (less food) consumed immediately following the 
price change than later on. These conclusions are reached whether we 
combine the I-test outcomes for the price-change periods (Z = -2.28, 
P < .03 for the I-values associated with b in periods when relative food 
prices decreased;Z = 8.0, P < .001 for the I-values associated with 6 in 
periods when relative water prices decreased) or use a x* analysis 
based on the signs of the time-trend coefficients (x* = 9-6, df = 1, P < 
. 01 )." 

The asymmetric resjxmse to increases and decreases in relative 
food prices is quite unexpected. One piossible explanation lies in 
physiological asymmetries with respect to how animals process food 
and water. Food can be stored in the form of enhanced body weight, 
while the experimental procedures do not {jermit maintaining com¬ 
parable stocks of water. Thus, in the case of increased food prices, 
there is a stock of food available to buffer large and immediate 
increases in water consumption. But with decreases in the price of 
food, there is not a stock of water to produce similar results. Further, 
food-buffer stocks were larger in experiment 2 at the time food prices 
increased since it came immediately after a large decrease in fcxid 
prices in the previous period. This might have contributed to the 
stronger stock-adjustment effects found in this experiment. 

Although adjustment patterns in the two experiments are qualita¬ 
tively similar, a question arises as to what factors are responsible for 
the quantitative differences in the /-ratios and the size of the time- 
trend coefficients between the two experiments. The size of the re¬ 
ductions in relative food prices during condition 2 of experiment 2 is 
comparable to that faced by subjects in experiment 1 (see tables 1 and 
3), Further, the length of the baseline periods preceding these price 
changes was similar in the two experiments. This would seem to rule 
out arguments that these quantitative differences result from the 

" Under the hypothesis that the mean value for the l-siatistH in the population is zero, 
the statistic. 


" - 2)1 • 

wheie/j represents the dejfrecs of freedom associated with has a samplm)( distribu¬ 
tion which IS approximately S(0, I) (Winer 1971). The picture gels a bit fuzzier if we 
include periods of return to baseline (conditions 3 and 5 in expenmeni 1, condition 4 in 
experiment 2) to tire analysis. The Z-siatistic for l-values when relauve food prices 
decreased loses significance (Z = - 1.12, P = .26). while for peritxls when relative water 
prices dec leased the significaiKe level drops (Z »= 2.60. P < .01). Also, the significance 
level for the x* test drops (x* “ 4.5. df • l.P < .04). The primary reason for this is that 
the sign of the ft coefficient associated with foexi consumption in perkxl 4 of experiment 
1 is positive for S's 37-S9, while to be more consistent with the price change periods it 
should be negative since food prices decreased going into this period. 
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magnitude ot price changes or the length of time prices had remained 
stable. This leaves individual subject differences or differences in 
procedures (sequence effects or some undetected change) between 
the two experiments as the most likely factors responsible for the 
differences. 

Attributing part of the differences to variability in individual sub¬ 
ject responses receives some support. The learning-curve literature in 
psychology shows a lot of variability in response patterns using 
single-subject data, with some individual learning curves being very 
irregular 01 varying greatly from the general form for a given task 
(Ma/ui and Hastie 1978). Examination of the magnitude of the esti¬ 
mates of within each experiment reveals a similar variability in 
behavior. Foi example, in both experiments one of the four subjects 
has a positive point estimate when the relative price of food de- 
(leased. Similai dif ferences in individual subject responses are, of 
(ourse, not detectable in aggregate per capita data such as Houthak- 
ker and Taylot employed. However, the consistent negative signs of 
the time-trend coefficients for decreases in relative food prices and 
positive signs for increases in relative food prices (decreases in the 
u'lative price of water) suggest that an aggregate per capita analysis, 
under condition.s where all subjects faced the same relative price 
changes, would cleaily exhibit these patterns. 

Finally we note that the values foi the Durbin-Walson statistics 
lepoi ted 111 figures ‘1 and 4 are comparable to those found in experi¬ 
ment 1. While there is a tendency to negative first-order autocorrela¬ 
tion ill the resicliials, the.se effcet.s are rarely strong. E.ven though S's 
would readily consume more at the end of a session if given the 
opportunity, thev managed to keep food and fluid intake relatively 
well balanced on a daily basis. 


IV. Summary and Discussion 

L'sing figure 5 we can readily summarize tlie consistency lest results. 
Starting from some iniiial (baseline) budget line (line 1), with con¬ 
sumption bundle //. Slutsky-compensated increases or decreases in 
relative prices result in consistent and statistically significant substitu¬ 
tion effects (poimsTf and f.', respectively). This result replicates similar 
tests of the Slutsky-Hicks theory using rats pressing levers and 
chcKLsing between food and water, or flavored fluids with ad lib. 
access to food and water (Ragel ei al. 1975, 1980). Ehese results 
suggc.st the universality attd robustness of one of the fundamental 
"trutfis” of the static theory of consumer demand, that compensated 
price increa.ses will bring about a decrea.se in quantity demanded. 
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(haracU'i'i/ing choucs. Stc text lor delail.v. 

Further, this truth does not rest on simple random-behavior models 
suggested by Becker (1962) and Chant (1963) but apparently reflects 
a basic behavioral principle. 

Large rotations in the budget line which involved swinging from 
line 2 to 3 without stopping off at baseline resulted in inconsistent 
(hoice behavior (fxiint C”) relative to the initial commodit) bundle 
consumed. While .S's tended to alter their composition of consump¬ 
tion in the expected direction, they did not change enough to be 
consistent with the initial commodity bundle consumed.* Efforts to 
explain the.se undersubstitution effects in terms of shifting preference 
structures and/or delayed reactions (learning) in response to the 
c hange in relative prices did not prove successful. 

Although it would be nice to have an explanation for the inconsis- 

" VVe seal ched our ral data for similar rotaiions in the liudj^ei line In ihe one case ihis 
happened the rc.siilis aie .similar 10 ihose re|K>rlcd here. I'hi.s was overlooked at the 
lime since further rotalioii of the hudgei line in the same direriion resulted in addi- 
Uonal changes in consumpiion so that ihc weak-axioin tcsl siatisiir was satisfied. 
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tencies reported here in terms of established patterns of behavior 
derived from the choice experiment literature, we have not found one 
which is completely consistent. For example, anchoring effects, the 
fact that for humans estimates of quantities (Tversky and Kahneman 
1974) and strengths of sensory phenomena (Stevens 1975) are greatly 
affected by the starting point used in making these estimates, might 
be used to explain the differences in going from budget line 2 to 3 
diicctly rather than from 1 to 3. But anchoring, unless conjoined with 
a hypothesis specifying some minimum difference between points 
belore it takes effect, would also seem to imply systematic biases in 
(onsumption patterns in returning to baseline conditions following a 
lelative price change, something we did not observe in experiment I, 
Kxplainitig the inconsistencies in terms of changes in the animal’s 
body weight resulting from the relative price changes runs into simi¬ 
lar problems, although we cannot rule out more complex physiologi¬ 
cal interaction effects. Peihaps the behavior simply results from S's 
inexperience, so that once having experienced a compensated price 
change startitig from baselitie prices, the same commodity bundle 
would be chosen starting from elsewhere in the choice space. 
Identification of the fac tors underlying the inconsistencies must wait 
further experimentation. Identifying these factors would also help 
(iarify the behavioral mechanism underlying consistent choices, 
Looktng at day-to-day adjustments in consumption patterns result¬ 
ing from changes in relative prices we find a number of cases in which 
atljustments were completed within the first observation period and 
other cases with st.ttistically significant learning effects. Individual 
subject diffei ences rather than the size of the price changes or other 
environmental factors were cited as one of the factors behind these 
differences. The fact that the adjustment coefficients tended to have 
the same sign whether they were statistically significant or not sug¬ 
gests that an aggregate analysis under conditions where all subjects 
faced the .same price changes w'ould produce statistically significant 
learning effects. Unanticipated asymmetries in the adjustment path 
resulting from increases and decreases in relative ftKid prices were 
al.so found. We are looking at adjustment patterns from the rat 
studies in an effort to confirm the.se regularities. 

Questions naturally arise concerning the relevance of the animal 
behavior reported here to understanding analogous human behavior, 
the behavior of primary interest to most economists. Objections 
might be raised on two grounds. One is that there arc essential 
differences between the choice conditions (environmental conditions) 
the pigeons face and conditions facing human consumers operating 
in national economic systems. The pigeons are not given money (or 
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tokens, as in some human economic experiments) and told what 
prices are, but are simply faced with sequences of situations from 
which they must deduce what the rules of the game are. One might 
argue that this results in the pigeons facing a more difficult situation 
than is assumed in ordinary consumer-demand theory. Second, one 
might object that there are essential differences in the choice process 
itself, on what goes on inside the pigeon’s brain, so to speak, relative to 
human consumers. The sequential way in which subjects learn about 
changes in relative prices and the apparent sequential nature of 
choices even under steady-state conditions stand in marked contrast 
U) the textbook characterization of a consumer who maximizes a 
utility function subject to a budget constraint at a point in time, 
apparently deciding on all purchases at once. We deal with these 
objections below. 

There is no question that the choice conditions the pigeons faced 
here have no exact physical analogue to choice conditions outside the 
laboratory for humans, or for pigeons in the wild for that matter. 
What an experiment aims for is to develop procedures in the labora¬ 
tory which provide a logically consistent interpretation for the con¬ 
cepts and relationships of the theory under investigation. Within the 
context of consumer-demand theory, this involves developing proce¬ 
dures that alter the choice set in the same way that the symbols 
commonly designated "price” and “income” alter choices in the 
general theory. This is true of any application of the theory, be it 
experimental or nonexperimental, using human or nonhuman sub¬ 
jects, so that there are always some differences in environmental 
tonditions in different applications of the theory, and these differ¬ 
ences may affect decision outcomes. 

If the theory is to have any generality it must hold under a variety 
of such chtjice conditions. Consequently, confirmations and dis- 
confirmations are relevant to evaluating the truth status of the general 
theory under any and all logically consistent interpretations:'® 
further, it is often the case that the most preferred conditions for 
testing a theory will be "atypical.” For example, one of the crucial 
experiments of relativity theory is to determine if stars behind the 
edge of the sun appear to be displaced from their known positions 
during a total solar eclip.se. One does not object to such experimental 
observations simply on the grounds that they do not correspond to 
"normar’ conditions (Smith 1980). Similarly, perfect compensation, 

IdmoCying dif'fcrcncrs in behavior under dirferem interpretation!! of the general 
theory sets oil a search lor the factors responsible for these differences and eventually 
lesults in modilications of the general theory to account for these factors Unfortu¬ 
nately. this is often an exceedingly slow protess. 
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which is required to directly apply the weak-axiom test (1), rarely 
occurs in nature but has an important role to play in better under¬ 
standing commodity-choice behavior. Establishing procedures for 
irniilernenting these conditions using nonhumans is considerably less 
costly than with humans, while introducing tokens or some other 
conditioned reinforcer to serve as a medium of exchange and unit 
accounting, although technically feasible, would only complicate the 
experiment with little likelihood of seriously affecting behavior. In 
fact, we might argue that having subjects learn the rules of the game 
through experience rather than relying on price representations 
eliminates potential pyschological complications introduced by such 
representations in monetized economies (Russo, Krieser, and 
Miyashita I97.'i). proving a reference point against which to evaluate 
the predictions of the theory. 

The apparent sequential nature of the pigeons' choices, while 
standing in c()ntrasi to notions of consumers possessing full informa¬ 
tion and making decisions simultaneously, poses no conflict with rc- 
spect to observable characteristics of human choice in the mar¬ 
ketplace. It takes real time for consumption to occur; hence, real 
choices of all organisms have a sequential aspect that is commonly 
ignored iti comparative static analysis. Further, differences in “plan¬ 
ning hon/.ons ' between humans and pigeons must, almost by defini¬ 
tion, be one of length rather than existence, unless one is willing to 
argue that the pigeons’ behavior is the result of .some elaborate, 
genetically determined program. 

Finally, it must be remembered that whether or not pigeons or 
humans have consciously thought out their Ischavior is irrelevant to 
characterizing that hehavioi as a solution to a constrained optimiza¬ 
tion problem. As .Samnelson (1947) notes, “... it is possible to formulate 
our conditions of equilibrium as tho.se of an extremum problem, even 
though it IS admiltedly not a case of an individual's behaving in a 
maximizing manner, just as ii is often possible in classical dynamics to 
express the path of a particle as one which maximizes (minimizes) 
some quantity despite the fact that the particle is obviously not acting 
consciously or purposively.” In a competitive world all organisms 
must solve the problem that time and energy are limited and that 
efficient allocation of resources, over which the individual has control, 
is essential to success. T he fact that man is capable of self-awareness 
and reasoning and is oc casionally guided by it in striving mathematical 
puz.z.les, for example, does not imply that these factors are either 
necessary or sufficient to achieving efficient allocation of resources. 
Nor does this prove that human consumers in the marketplace are 
guided by these fac tors, while pigeons in the laboratory are not. 
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Procedural Details 

The experiment was conducted in a sound-insulated pigeon test chamber that 
measured 13 V* inches long, 11 '/i inches wide, and 14 inches high. The test 

TABLE A1 


Subject and 

Experimental Condition* 

No. of 

Days in 
(Condition 


aw 

Subject 14: 

1 

5 

60 

60 

2 

12 

72 

25 

,S 

46 

60 

60 

4 

16 

34 

120 

b 

14 

60 

60 

Sub|ect 36: 

1 

20 

60 

60 

2 

19 

20 

90 

3 

16 

120 

13.3 

4 

.36 

60 

60 

Stib|ect 37: 

1 

46 

60 

60 

2 

19 

45 

90 

3 

20 

80 

20 

4 

27 

60 

60 

Sub|cct 38. 

1 

18 

60 

60 

2 

20 

48 

90 

3 

1.3 

72 

27 7 

4 

48 

60 

60 

•Sublet 1 39' 

1 

4*t 

60 

60 

2 

17 

48 

90 

3 

1.3 

72 

30 

4 

31 

60 

60 

Subjctt 40 

1 

14 

60 

60 

2 

12 

80 

28 

3 

39 

6t) 

60 

4 

2.3 

26 

120 

5 

28 

60 

60 

Sub)ctl .SO- 

1 

8 

60 

60 

2 

11 

80 

20 

3 

30 

60 

60 

4 

19 

38 

120 

.3 

19 

60 

60 

Subieti 8.3: 

1 

10 

36 

72 

2 

22 

90 

20 

3 

33 

60 

60 


Ntni —Up » dvrragr Um« heiwreii litod |Nivofh m <»■ « AvrT*jjr um« between wain m tmmdt 

* Mean reptmrd in the irxi foi .V'n 14 «nd in tondiimn 1 nrr over all d«iv» tn thu amditmn 
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chanibfi had three response keys (2.5 centimeters in diameter, fabricated 
from clear acrylic plastic) lot ated on the front panel where the food and water 
reservoirs were located. A single perk on the tenter response key switched 
modes ol dehvery from food to water and vice versa with no changeover 
delay. This kev was traiisluminated with white light at all times. The other two 
response keys, located over the fotxl hopper and water reservoirs, were color 
coded lo indicate the delivery mode in operation. Responses on these keys 
did not affect the contingencies, (ieneral chamber illumination was provided 
by a 7-\sati houselight. When commodities were delivered, the response keys 
were darkened but the houselight remained on, A fan provided ventilation, 
and constant white masking noise was providt'd. 

food and water deliveries were programmed by separate variable time 
tapes. The tapes established a random sequence of interreinforcement times 
following an exponential distribution under all conditions. The sequence of 
interreinfoicement times repeated itsell every 20-S() reinforcements. 

food reinforcements occurred only when m the food mode and consisted 
of the food hopper coming into reach and the food hopper lights coming on 
during leinlorceinent. Water reinforcement otciirred only when in the water 
mode and consisted ol pumping water into a reservoir at the rate of J 
tnillilitei per 15 seconds and ojierating a re.spon.se feedback clicker every 54 
second, niirmg reinfotcenicnt. |>ceks on the center response key did not 
result in a mode switch. 

fable A1 shows the ruimlier of davs each condition was in effect and the 
average absolute time deiavs (in seconds) Ix'tween focxl and water payoffs. 
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Taste Change in the United Kingdom, 
1900-1955 


Steven E. Landsburg 

I'fHvrmty nf ( hira^n 


\ scaicli 1' iiiiulf f<ii Molatioiis t»l tlif axiom of icvealcd piefei t-ncf 
111 <oiiMmiiuioii ilaia tor the loiiiecl Kiiij;tlom. No Mith 

liol.iiioiis aic lomiii I hi.s is lakcii as csulcnic ihai lasios rcinaint’d 
lonsiaiK ihmu^hoiM die period imdtT siiulv. The Mit-iigili of this 
(‘vidfiKc IS fslimaied within the tonicxi of a particular model of 
l.i'tc c hangc 


I. Introduction 

Hus papt’i investigates the plienoineiion of ‘‘taste change’' in the 
f lined Kingdom m the vears 19()0-195'>. In the paragra[)hs which 
follow we shall des( rilKf a method of seart hing lot evideiue of that 
phenomenon. In Section II we shall caret out seveial versions of that 
search and hnd no such evidence. In Section 111 we shall develop a 
general model of taste change in the {oiitexi ol which we shall 
strengtfic.i the negative result ol St'ction II. Station JV^ is a further 
di.scu.ssion of ifie motfel. In Section \' wc shall discuss some of the 
impficariems of our results for economic analysis. 

Our basic nieifiocfoiogy is similar to that of Houthakker (1963) and 
Kocr (1963) Fir.si we c hoose a set ol goods. ’I'hcn we choose a pair of 
years—sa\, lot concreteness, 19(10 and 19J0. We observe what cpian- 
tify of each good was lonsunied in 1900 (we shall refer to this set of 
quantities as the 1900 markci basket), and wc calculate what the price 
of this market basket W'ould have been had it been purc hased in 1910. 
If the price is lower than the price ol the basket which was actually 
ccjnsumed m 1910, then wc conclude that (m 1910) the 1910 basket 
was considered preferable ter the 1900 basket. V\'e repeat (his prexe- 
durc for every [xissible pair of years in .some time period. 'I'hcn we 

iftmm/tl oj poUiufil t.ifmimt, vi»l 8'/, J| 

@ mi by The C;niv«f^itv o( Chicago (1022 1/8901-OOOAIO 1.90 
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search for intransitivities—for example, the 1910 basket preferred to 
the 1900 basket, the 1900 basket preferred to the 1905 basket, and the 
1905 basket preferred to the 1910 basket. In other words, we search 
for violations of the Strong Axiom of Revealed Preference. Such an 
intransitivity is taken as evidence of a change in tastes. 

These methods can be applied either to total national consumption 
or to per capita consumption. We have done both. 

A problem arises when a good is available only in certain years—for 
example, there appear to have been no raspberries in Britain before 
1922. This problem (which was rare) was solved by declaring tbe good 
in question to be identical with some other good, the choice of “other 
good” being dictated partly by aesthetic considerations and partly by 
the observation that identical goods should have nearly identical price 
series. (Raspberries were classed with black currants.) 

When a good is included in the market basket, it is important to 
include all close complements and substitutes for that good as well. If, 
lor examfile, our basket consisted only of automobiles and bicycles, we 
might discover a preference for more bicycles and fewer automobiles 
in 1978 than in 1971—reflecting not a taste change but a rise in the 
price of gasoline. The problem would not arise if gasoline were 
included in the basket. 

II. Evidence 

■file methods described in Section I were applied to six sets of data, 
which are described below. No intransitivities were observed in any 
case. 

We first considered total national consumption of nine goods: fcHKl, 
alcoholic beverages, tobacco, clothing, transport and communication, 
fiousing, fuel, household durables, and "other." The study covers the 
vears 1900-1955. It is noteworthy that no intransitiviiies occur 
throughout this period despite (he fact that the definition of the 
Ibiited Kingdom changed with the secession of Ireland in 1920. This 
may Ix’ taken as a small bit of evidence that the tastes of the nation did 
not change even when the composition of the nation did. 

f he second study was identical to the first with respect to the time 
period and the gCKtds considered; it reflectecf per capita rather than 
total consumption. 

The third and fourth studies were identical to the first and second 
with the exception that housing was eliminated from consideration. 
This was because expenditure on housing was defined as the total of 
rents, rates, and water charges collected, a definition which is clearly 
objectionable (e.g., it fails to include the opportunity cost of owner- 
occupied housing). 
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Fic. I —Revealed prefcieiues anumg years for loial U.K consumption o( all items, 
classified into nine categories 


Finally, we broke food, alcohol, and tobacco down into 107 subcate¬ 
gories and SI tidied data on those 107 goods for the years 1920-38, 
Again this was done both for total and per capita expenditures. 

Figure 1 provides an example of the results which were obtained. It 
depit Is the conclusions of the first study described above, concerning 
total national consumption of nine broadly defined categories of 
goods. .An at row .d —♦ B indicates that the consumption of year A was 
l evealetl preferred to that of year 6. Revealed preferences which can 
be inferred by transitivity from the ones depicted have lieen omitted 
from the figure in the interests of clarity. Analogous charts for the 
other studies clest ribed above arc available from the author. 

J’he data foi the first four studies mentioned were taken from 
Mitchell and lleane (1954); for the other two they were taken from 
.Stone anci Rowe (1954). 

It should be noted that the upward trend in income, particularly 
lowaid the end of the period under consideration, would lend to hide 
changes in taste even if they did occur.' However, the tests in the next 
section will explicitly lake this phenomenon into account. 

III. Interpretation 

Our failure to observe a taste change supports the belief that no taste 
change cKcuiied; now we ask how strongly that belief can be sup¬ 
ported. Upon making certain assumptions and reexamining the data, 
we shall be able to infer that it is unlikely that the belief is far wrong. 


I am indebud to C.ary Becker (or this oliservalion 
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While the model which we shall develop could theoretically be 
applied to a market with an arbitrary number of goods, practical 
considerations such as limited computational resources require that 
we apply it only in the two-good case. For that reason, we shall restrict 
our detailed description of the model to the two-good case; the results 
for the w-good case are available on request. 

Let X and V be two goods with prices p,v and py. We shall suppt>se 
that the utility function is the “constant elasticity of substitution" 
function U{X,Y) = C(AX'‘ + BY")''", where A, C, and u are constants 
and 0 ^ u < 1. It follows that the elasticity of substitution between X 
and Y is everywhere equal to 1/(1 — u). 

We have dUJdX = AC(AX« + 'X''"' and dU/dY = BC(AX‘‘ -h 

jjyuyihD- iy«-i gy changing the units in which X and Y are measured 
we may assume that utility is maximized when one unit of each is 
consumed, from which we infer that (dUldX)l(dlJldY)\ = pxipy, 

that is, A/B = px/py- 

We suppose for the moment that tastes change while relative prices 
and real income do not. We shall interpret a change in tastes as a shift 
to a new utility function l'{X,Y) which is such that dV/dX = a(dUldX) 
and dVIdY = ^(dVIdY), where a and /3 are observations of random 
variable.s a and /0. That is, we a.ssume that at the margin,'any unit of X 
or Y will yield a or (3 times as much utility as it did before. The 
possibilities a « /3 and /3 ^ a should be equally likely, so we assume 
that d/$ has a median of one. The function satisfying these require¬ 
ments is y {X,Y) = C(<xAX" + fSBY")'"'. This has the same indifference 
curves as T(X,>') = L'[(«//3)TX" + BY"]'"', so we may replace a by a//3 
and /3 by one. Thus out assumption is that the new utility function has 
the form F(X,y') = ('.{aAX'‘ + BY")'"', where a is an observation of a 
random variable d with a median <jf one. (The hypothesis that tastes 
do not change is the hypothesis that d has variance zero.) 

NowdfVdX = a.4(:(AaX“ + «)'■■)<' and dF/dY = BC(AaX'‘ + 
BY"y'"‘'''Y'‘ '. In light <il the budget constraint, the new levels of 
consumption must be 1 + t units of X and 1 — (AIB)I units of)' for 
some number l. In order U) maximize utility, t must satisfy 


that is. 


dVIdX I 
dl /d) 


= P^,.lpy^AIB, 


or 


0,1 {I + f)"-' 
BH - (A/B)/)“-' 


= A/B 


t = 


J _ 


(A/B) + • 
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Now lot us su]jposo tliai in our second period we have not only a 
new utility luiKtion hut a new bndfret line as well. If the two budget 
lines fail to cioss, then there is no ht>pe of observing a taste change, 
because no point on the inner budget line can ever be revealed 
preferred to an) point on the outer one. rhus we restrict our atten¬ 
tion to the case in which the budget lines do cross, as in figure 2. We 
can rename the goods so that the price ofX falls relative to that of V 
between the first and .second periods; thus we may assume that LM 
repre.sents the first-period budget line and JK the second-period 
budget line. 

When the basket consutiled in the first period is on segment OM 
and the basket consumed in the second period is on segment ()J, each 
IS revealed preferred to the other and a ta-ste change is observed. An 
estimate of the probability of this occurrence would yield a measure of 
the strength of our negative result; however, we shall argue that the 
hypothesis of taste change can be more strongly rejected through 
consideration of the “exact opposite" situation—that in which neither 
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l-H,. !i —II llu'io l^ <111 iiKlillcrciKi' ( urxo uiigciit loO.Vf, there must l>e one tangent 
to OK 


baskci is rcvealfd prcfcTred U) ihc other. In terms ol ligiire 2, this 
sitiiatiott (which we shall refer to as iiiccnnparabilit\ ol the baskets) 
ar ises when the first period’s basket is on LO and the second period's 
basket is on OK. 

Now suppose that the first {)ericKl's basket is observed to be on LO. 
In keeping with our earlier a.ssumptions. our supposition is that the 
point (1,1) lies on LO. II the utility function had changed and the 
budget line had not, the new utility-maximizing market basket would 
have been at the point 11 + /. I - (AIB)l] on line LM. If this point 
happens to be on OM, then with the new budget line the uiility- 
rnaxitnizing market ba.sket will be on OK, as is evident from figure 3. 

It follows that if we know that the first period’s market basket lies on 
LO and the point [1 + t, 1 — (A!tt)t] lies on OM, we will observe an 
incomparability. 
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Denotinj^ the- coordinates of 0 by Xo and Koi we see that [1 + /, 1 — 
(Alli)t] is on OM if and only if 1 + t ^ Xo. Since 

I _ •• 

’ (d/B) + ’ 


this IS the case if and only if 


that is, 


I _ ^l/IU I) 

{MB) + a'‘'“ •• 


X„ - 1, 


I -f (1 -Xo)(.4/B) ^ 


or—.since AiB = (V',, - 1)/(1 - ,Vo)— YJX„ > a''"' where f) is 

the clastic il\ of substitution between X and V. 


({ortdu'sion 

II, m figuie tile first period's market basket is observed to be on LO 
(i.e., outside the budget constraint of the second period), and if Ko/X„ 
is greatet than ct ", then we will observe an intomparabilily, provided 
\ and )' are ineasitied in units such that the first period’s market 
basket is at the point (1,1). 

.Siinilar consideratioiLs show that if the .second period’s market 
basket is observed to he on OK and if units are chosen so that its 
cooidinaies are (1,1), then an intomparability will be observed 
provided thai 3- « 

Rcnuirks 

1. Because we shall be applying our model only in the two-good ca.se, 
we shall have to asstiine that corisunijition of all substitutes and com¬ 
plements lor the goods in c|ue.stion remains relatively constant (this is 
•SO in light ol the discussion at the end ctf Section 1). We shall approxi¬ 
mate this condition by reejuiring that the first and second periods be 
successive years. 

2. While the form of the utility function pietlucles the possibility 
that T) = ], we shall assume for ea,se of exposition that it is very clo.se to 
one. (In Seciicjii 1we will explore the ef fect of letting tj take on other 
values.) Thus we reinterpret the conclusion as saying that an incom¬ 
parability will be observed provided Y„/Xo or XJY whichever is 
appropriate, is greater than a"’. 

The data which were used to construct figure 1 were reexam¬ 
ined, and values of K^/Xu or XJY„ were ccjmputed for every pair of 
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successive years and every pair of goods with crossing budget lines. 
The results, for total and per capita consumption, with and without 
the questionable housing data, are displayed in table I at the end of 
this section. (In these new studies, as in the earlier ones which in¬ 
volved more than two goods at a time, no intransitivities were ob¬ 
served.) 

Under total consumption there are 604 relevant cases (i.e., cases 
where the budget lines cross). Only nine incomparabilities were ob¬ 
served. More important, there are two cases in which an incompara¬ 
bility would have been observed if a ' had been less than 1.00. If we 
assume that tastes do change (i.e., that d is not identically one), then, 
by our assumption that d has median one, a”' must be less than one 
50 percent of the time; therefore the probability of our observing an 
incomparability in one of these two cases is greater than 1 — (1 — 0.5)* 
= 0.75. Thus our failure to observe an incomparability in either case 
allows us to reject the hypothesis t>f taste change with 75 percent 
confidence. The above remarks apply to total consumption with or 
without housing data. 

Under per capita consumption, there are 673 cases and only three 
incomparabilities. There are 14 cases in which an incomparability 
would have been observed if ex '' had been less than 0.95. (There are 
eight such cases if one ignores the housing data.) Among these cases 
there is one incomparability. If we were to assume that a"' is less than 
0.95 even 25 percent of the time, the probability of its being stt only 
once in 14 trials would be only 10.1 percent. Thus we can say with 
89.9 perc ent confidence (63.3 percent confidence if we do not use the 
housing data) that a ' is not less than 0.95 even as much as 25 percent 
ol the time. 

In three cases, none of which involved bousing data,-Yn/y,, or fVA’,, 
was greater than 0.995. No incomparabilities were observed in any of 
these cases. Calculating as above, we find that this allows us to reject, 
with 57.8 percent confidence, the hypothesis that a"' is less than 0.995 
more than 25 percent of the time. 


Summary of Results 

Using total consumption, never differs from one (75 percent 
confidence). Using per capita consumption, a ' is not less than 0.95 
more than 25 percent of the time (89.9 percent confidence. 63.3 
percent without housing), and a ' is not less than 0.995 more than 25 
percent of the time (57.8 percent confidence). These results say. with 
some degree of confidence, that d .seems to have very small 
variance—and this is just what is meant by the assertion that tastes do 
not change. 
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OK 

With Hoomnc Data 

WiTHOtll 

Hoo-sinc Data 

oi 

(>ases 

No oi Incom- 
parahUiiies 

No. of 

Cases 

No. i>f incom- 
parabilities 

I cit.il coiisiiiiipiion. 
(m.049 .■>•} 

0 

46 

0 

05- 009 

40 

0 

31 

0 

10 .149 

43 

0 

32 

0 

ir>- 199 

$9 

0 

27 

0 

L’O- 249 

54 

0 

43 

0 

25- 29‘! 


0 

33 

0 

,10- .349 

34 

0 

26 

0 

3.5- 399 

30 

0 

21 

0 

40-.449 

22 

0 

16 

0 

45 - 499 

28 

0 

22 

0 

50- 519 

33 

0 

27 

0 

-55 - 5^^^* 

20 

0 

15 

0 

60 - (549 

28 

0 

21 

0 

65- 699 

21 

0 

14 

0 

70- 749 

25 

0 

20 

0 

.75- 799 

21 

0 

17 

0 

MO - 849 

20 

1 

13 

I 

85- 899 

1.1 

0 

11 

0 

.9i)- 94^t 

15 

0 

8 

0 

.95 - 999 

15 

8 

10 

7 

1 00 

o 

0 

2 

0 

1 Old) 

604 

9 

455 

8 

IVi toiisionpluio 

(10- 049 62 

0 

47 

0 

0.5- 099 

44 

0 

38 

0 

10- 149 

52 

0 

40 
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15- 199 

56 

0 

37 
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20- 249 

51 

0 

39 

0 

25 299 

42 

(1 

30 

0 

.10-.349 

.19 

0 

22 

0 

35- 399 

32 

0 

25 

(1 

40- 449 

34 

0 

25 

0 

.45- 499 

32 

0 

23 

0 
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.10 

0 

25 

0 

.55-.599 

2 ^ 

0 

21 

0 

.60-.649 

29 

0 

I'J 

0 

.65- 699 

29 

0 

21 

0 

70- 749 

22 

0 

17 

0 

.75- 799 

21 

0 

1 1 

0 

80-.849 

24 

0 

20 

0 

85- 899 

15 

0 

9 

0 

90- 949 

I9 

2 

13 

1 

95-999 

13 

) 

7 

1 

1.00 

1 

0 

1 

0 

Total 
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3 
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It is important to note that the results listed above were derived 
without any reference to the fact that in all of our studies we never 
observed a taste change (the phrase “all of our studies” refers both to 
the set of studies described in Section II and to the 1,277 separate 
studies described in this section). This striking fact, which is really the 
main result of this paper, surely mitigates our disappointingly low 
levels of confidence, but I know of no way in which to quantify the 
extent to which it does so.* 


IV. Discussion 

Like most models, ours is laden with assumptions with which a reason¬ 
able man might take issue. Here we reconsider some i>f the points 
which are most open to <|ueslion. 

1. 'ihe Form of the Utility Function 

It is certainly the (a.se that our thoice is arbitrary. However, utility 
(unctions of this form are often assumed in economics, and our 
results do indicate that we should Ix^ wary of models which assume 
such utility functions and also allow for changes in taste. Lbus, even if 
we were to grant that our utility function is not an accurate reflection 
of economic leality, we would still be able to draw conclusions about 
the consistency of certain economic models, and this is a topic which 
should t)e of concern to ecttnomists. 

It should also he noted that, alter some effort. 1 have not Ix-'cn able 
to find a ■‘rea.sonable” utility function which yields results substantialh 
different than d<K's the one adoptc-d here. 

' One wav i« iK-jfiii i<i ((uaniitv ii is ihc lollovong I rlp\ .iiidp) Ik- ibc pines ot .V and ) 
HI tilt* /th vcdr Hiid 



(so thni 1) IS ulways li'ss than one*. l>e(auM* ihr nniamed so that the u'lati\r 

j)i !(<■ of .V ffll). I lifii It (an fK' sliown ihai a lasu* t han^c u ill hv obs<*r\e(i if and oni\ if a 
< ot a < (f’„/-Vu)n (drjK*ndinj 4 on whctlici tht* fiiM \t.*ar‘s Iwskft is in.side ihe 

M'tonci veal 's biidgci line oi vue veisa). we (ouUI proteed as in ihe Iwxfs of ihe 

pajK.!. measuring values lov and instead of o aod YfyfXo. Vhis 

pUKedure—which amounts to measuiiiig the siiength of our failure to obsene a lasie 
(flange b\ riieasunn^ the ptohahihtv (hai a taste change could ckcui and go 
unobscTved—seems mote siiaighfforwatd than the one adopted m the text, but when 
applied to the same data it yields iiumcrKal results which are siibsianiiallv weaker 
(bven the assnmption.s of our model, this means that even if tastes changed sulistan* 
tiallv, oui faiiiire to ohsei ve a taste change in any of the 1.277 case.s of Section III would 
not Ih‘ tcM) suipiifting. However, this caU ulaiion ignores all of the findings of Section II, 
which upfx'at to Ik‘ tjuilc siiong. I do not know the best wa> to quantify tlicir 
strength—a naive gcnerali/.aiic>n of the current incxfel would recpiiie additional as¬ 
sumptions and very extensive computational resources. 
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2. The Exogeneity of Price Changes 

On the surface this is a very damaging objection. If prices and tastes 
do not change independently, as we have assumed, than a considerably 
subtler analysis of the problem is required. However, reasoning as in 
(1) above, we argue that the fact that many economic models do 
assume exogenous price changes implies that we can at least draw a 
conclusion about those models. The relevant question for economists 
is not ‘‘Do tastes change?” but "Are models which assume taste 
changes consisient with reality?" The latter is the question which we 
have really been addressitig; the answer, for a large class of models, 
appears to be “no." 

3. The Form of the Taste Change 

All that t an really be said here is that the form which we have assumed 
seems reasonable and that other rea.sonable forms yield similar re¬ 
sults. If, for example, we had assumed that the new utility function 
was given b\ l'(A'.>') = U(aX,Y), rather than being determined by the 
requirement that dVIdX = a(dU/dX), then our conclusion would have 
been exactly the same except that a*'’’ would have appeared instead 
of a ’’. 

4. The A'isamfjtion That r} /s Close to One 

Oiit calculation of the confidencre with which we can say that a never 
differs ftom one (i.e., that tastes do not change at all) is unaffected by 
this assumption; It would remain the same for any value of r}. With 
regard to out calcidation of the confidence with wliich we can say that 
a is not less than some given value more than a given percentage of 
the tune, our resedt would be stronger for larger r) and weaker for 
smaller rj. 

5. The Assumption That a Is Drawn JneJeperuiently Each Time 

'I'his assumption was essential when we calculated confidence levels in 
Section III. Our first calculation was that we could .say with 75 percent 
confidence that a never differs from one. The two cases involved in 
this calculation were transport and communication versus tobacco for 
1907-8 and fuel versu.s alcoholic beverages for 1937-38. The 30-year 
separation, together with the fact that entirely different goods are 
involved, seems to Justify the assumption that a was drawn indepen¬ 
dently in these two cases. 

Our strongest numerical result involved the confidence with which 
we could say that a was not less than 0.95 more than 25 percent of the 
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time; it was calculated to be 89,9 percent on the basis of 14 cases. The 
14 cases involved are listed in the unnumbered table below. 


1. 

Housing vs. food 

1913-14 

2. 

Housing vs. alcoholic beverages 

1942-43 

3. 

Housing vs. tobacco 

1916-17 

4. 

Housing vs. tobacco 

1943-44 

.5. 

Housing vs. fuel 

1951-52 

6. 

Household durables vs. tobacco 

1922-23 

7. 

Household durables vs fuel 

1927-28 

8. 

Hou.sehotd durables vs. fuel 

1929-30 

9. 

Household durables vs. transport and communication 

1905-6 

10. 

Household durables vs. transport and communication 

1909-10 

II. 

Alcoholic beverages vs. other 

1909-10 

12. 

Household durables vs. other 

1904-5 

13 

Housing vs. other 

1902-3 

14. 

Fuel vs. other 

I90I-2 


(Case 14 was the one in which an incomparability was observed.) The 
reader is invited to form his own opinion as to how many of these 
cases are really independent. If, (or example, he di.scards cases 8 and 
10 on the grounds that they are too closely tied to c ases 7 and 9, the/i 
we have 12 independent cases instead of 14 and our confidence drops 
from 89.9 percent to 84.2 percent. 


V. Conclusions 

I'he main result of this paper is that ncj taste change was observed in 
any of our studies. T hese iitcliided studies of total consumption and 
of per capita consumption, studies of broadly defined categories of 
goods and of very narrowly defined goods, studies of exhaustive and 
nonexhaustive lists of goods, and studies of as many as 107 and as few 
as two different goods. T'he two-good studies alone presented 1,980 
separate cases; 1,277 of these had crossing budget lines and hence the 
potential for an observed taste change. None of this potential was 
realized. In Section III we calculated levels of confidence with which 
we ccjuld say that tastes did not change by more than certain specified 
amounts; these calculations were indeptendent of the fact that no taste 
change was observed and therefore greatly understate the strength of 
our results. 

The mctrals are the obvious ones: In economics, we should be wary 
of models which include variables for taste change, and outside eco¬ 
nomics we should l>e skeptical of warnings about the “wild vicissitudes 
of taste” and the constant danger of having our vicissitudes manipu¬ 
lated. With regard to this last point, it is particularly interesting to 
note that our study covers the period in which the electronic media 
came into being and into flower. 
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h is tasy to postulate specihc forms of taste change among the 
goods whkh we have studied, and it is easy to postulate forms which 
our methods could not have observed. It is considerably harder to 
postulate such forms which satisfy the additional restriction that they 
be reasonable. 1 do not believe that is possible, at least not without 
violating exogeneiiv of prices. If this is correct, then our results have 
strong implications for all price-theoretic models in which price 
changes are exogenous. 
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The Determinants of Tariff and Nontariff Trade 
Restrictions in the United States 


Edward John Ray 

Ohm Sltile Umveruly 


I'his paper develops and tests a simple model for the determination 
of tariff and nontariff barriers to trade across industries within the 
United Slates, using 1970 tiade data. We find that nontariff trade 
lesiriclions have supplemented tariff protcelion in the United Slates. 
Both tariff and nontariff trade restrictions are bia.sed toward indus¬ 
tries in which the United .States has an apparent comparative dis¬ 
advantage in world trade and away from industries in which con- 
suitier welfare lo.sses from protection would he great. We also find 
substantial evidence that tariff anti nontariff trade restrictions pre¬ 
dominate in industries with ver> different market characteristics. 


The put ptise of this study is to synthesize a number of ideas expressed 
in eatlier work into a simple analytical framework to explain the 
structure of tariff and nontarif f barriers to trade across industries in 
the United States. Our central premise is that, subject to political 
constraints, trade restrictions are consistent with profit maximization 
across industries. Using International Trade Commission data for 
tariff and nontariff trade restrictions and industry characteristics for 
225 U.S. manufacturing indu.stries in 1970, we find that such an 
approach has significant explanatory power. 

fhe resulting empirical analysis is unk|ue in a number of imporunt 
respects. First, we provide separate reduced form estimates of tariff 


1 he author is grateful to Betsy Radigan. Steve txiok, F.d Honton, and |im Brown for 
aid in generating the 1 obil and Probil results. Professor I cisuiiori koi/umi. Jacob 
Frenkel, Howard Marvel, Donald Parsons. Jerry Thursby, Mane Thursbv, I hoinas 
Wolf, and, most particularly, J David Richardson provided numerous helpful com¬ 
ments on earlier drafts of this paper. 

[/numa/ tt/ Poltttcal Economy, 19H1, vol 89, no IJ 

(§> 1981 by The Univeniiy of C'hiLago 0022-9HOR/8l/890l-0009f01 M) 
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and nontarif'f trade restrictions in the United Stales. Second, by ex¬ 
tending our analysis to include the estimation of' a simultaneous 
model for the determination of tariff and nontariff trade restrictions, 
we provide empirical support for our hypothesis that nontariff trade 
restrictions have been utilized in part to compensate for internation¬ 
ally agreed-upon tariff reductions in the post-World War II period. 

I'he third important finding is that, while tariff and nontariff trade 
restrictions ro-exisi in industries with some common characteristics, 
there are some tlistinct differences in industry characteristics that are 
more strongly as.sociaied with one rather than the other alternative 
f orm of protection. Specifically, we find that both tariff and nontariff 
trade restrictions are biased toward industries in which the United 
Slates is at a comparative disadvantage in world trade, and away from 
industries in which losses in consumer welfare from protection would 
he particularly large. But there are some real differences too. Tariffs 
are biased tc)ward industries which are low-skill intensive, away from 
industries that are capital intensive, and unrelated to prttduct hetero¬ 
geneity, the concentration of production, and the geographical dis¬ 
persion ol domestit production facilities. In contrast, tioniariff trade 
restrittions are concentrated in industries producing fairly homo¬ 
geneous products using relatively tapital-intensive techniques of 
production that, at the same lime, are not intensive users of low- 
skilled labot. In addition, nontariff trade restrictions are found pre¬ 
dominantly in industries in which production is less concentrated and 
in whith production facilities are distributed acro.ss regions of the 
United States in a fashion consistent with the distribution of popula¬ 
tion and, therefore, voting power in Congress. 


I. The Political Economy of Trade Restrictions 

1 he general approach of this paper is that the structure of tariff 
and/or nontariff trade restrictions across industries is consistent with 
the simple joint maximization of industry profits subject to political 
constraints. In effect, we assume that individual industry characteris¬ 
tics influence the profitability of trade restrictions and that political 
factors similar to those discu.ssed by Cheh (1974, 1976), Pincus (1975), 
Baldwin (1976a), Caves (1976), Caves and Jones (1977), Helleiner 
(1977), Stone (1978), and others are critical in determining which 
industries are most successful in obtaining restrictions on imports of 
competitive producLs. 

The primary information to emerge from a simple analysis of the 
relationship between industry rents and protection is that the profit¬ 
ability of trade restrictions stiould normally be negatively related to the 
absolute value of the own price elasticity of demand for the product 
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and positively related to the foreign elasticity of supply of the prod¬ 
uct.' In effect, the more responsive consumers are to price increases 
induced by trade restrictions and the less responsive foreign suppliers 
are to changes in the international price of the good, the less profit¬ 
able any given level of protection will be. 

Furthermore, a discussion of the profitability of protection assumes 
that imports represent a reasonable threat to sales in the United States 
by domestic producers. If concern for industry profits is important in 
explaining the structure of trade restrictions, we would expect trade 
barriers to be biased away from industries in which the United States 
has a comparative advantage in international trade because imports in 
such industries would be expected to be inconsequential and restrict¬ 
ing them would provide little benefit.* We included a number of 
variables that previous studies have indicated are a.ssociated with U.S. 
comparative advantage or disadvantage in trade with the simple ex¬ 
pectation that trade restrictions should be found predominantly in 
product lines in which the United States is at a comparative disad¬ 
vantage vis-a-vis the rest of the world. 

Obviously, if there are positive terms of trade effects associated with 
protection sufficient to offset the deadweight consumption and pro¬ 
duction costs created by trade barriers, the government can generally 
increase aggregate .social welfare and support for itself by restricting 
trade. But substantial terms of trade effects are not likely to exist for 
more than a handful of industries at present levels of protection. In 
general, increased protection would be expected to promote industry 
profits at the expense ot the general public. In a bn)ad sense, profits 
should be thought of as the sum of rents to entrepreneurs and to 
workers with firm- or industry-specific human capital and/or a strong 
union. 1 he costs to the general public include artificially high product 


' Assuming ihai proleoion is less than prohibitive, that foreign supply tlcpetuis only 
on the foreign price, and that the domestic market clearing price, Pp. is pusilively 
lelated to the degree of trade protection given, we can generate the follimmg reUiion- 
ship between industry prohLs, O, and trade restrictions; 


an 

dr 


= \So + 




dP„ 


dr 


supply..MC„ is the domestic marginal cost of production. 
dem?,^d I, ""'a ‘ ” = ‘^e absolute value of the own price elasticity of 

elastiiitv and ’and P, represent the foreign supply, foreign supply 

uni eou.va^ PrtKlucl. respectively, and r represenis the tarifft.r 

author.^ available u[x)n request from the 

live Idvama.rrV.r'*'' "’'""''''8 Po»ibiliiy that the United States has a compara- 
LL herein lhat trade resiriclions are used to price dist rimi- 

a ph^ rr^eno^ “ monopoly i eL. That such 
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prices, niisallocation of productive resources, and waste in terms of 
adniiiiistralivc costs of implementing and maintaining protectionist 
programs. 

In order to ratioiiali/e the existence of trade restrictions for 
numerous commodities in ihe United Slates, we assume that, up to 
some limiting degree of trade restraint, industry interest groups are 
tiKire responsive to trade policy than consumers are.^ One simple 
ex[)Ianaiion of such an asymmetry in response to trade policy by 
indusiry interests aiul by consumers stems from the fact that infor¬ 
mation about market conditions is costly and imperfect and must be 
ufidaicd as market conditions change, l o the extent that producers 
ate less numerous than consumers and their well being (wealth) is 
more dircctlv affected than the wealth of individual consumers by 
trade poliev, the\ will invest more resources in keeping informed 
about and lx- mote sensitive to trade policy changes affecting their 
industi). UleaiK, the reverse could be true for industries in which 
buyers are more tonecMUraled than sellers and the wealth effects arc 
greatei per tonsumei than they are per producer. Rut suth cases are 
exceptional. I bis asymmetry in responsiveness does not imply that 
sellei concemration need be positively related to protection. In fact, in 
his study of the structure of tariffs in Uanada, (^aves (197(>) suggested 
that concentration could be negatively related to protection. Presimi- 
abh, the argument would Ix‘ that the spoils of protection will buy 
mote votes the more widely they are dispersed. In each industry there 
mav exist some critic al value of .seller conc entration relative to bu)'er 
ccinceniration, determined fty the cost of inarkei information which is 
ret|uired for the industry to obtain protection. And among protecied 
industries there may be a negative- assoc iation between protection and 
concentration, (X)NR. In short, the impact of seller concentration on 
the existence and extent of trade restrictions is ambiguous. 

.So far w'e tiave argued that producers and workers in an industry 
have a greater economic incentive on aveiage than consumers in 
general do to accpiirc costly information alxini the impact of trade 
restrictions cm their prices, profits, and wages anci in coniinning to 
invest in such information gatiu-riiig as market c onditions change and 
old information dtxays or becomes les.s relevant. However, constant 
changes in trade restrictions could easily attract media attention and 
thereby provide consumers with relatively chc-ap and current infor- 

’ A|>.irt ficmi the- olcservaiion ihai iicm/cro tiacic- rcsiriclioiis txi.st lor iiidiisirK's ihai 
cti)<»y no obvious tcriiis of Uadc effee Is. we provide no direct test olOui assiiinplion that 
industry iiucresis arc more iespon.sive than consumei inleieM.s to chanyces in trade 
policy However, the notion that iridci.siry iiucresis are weijchied more heavily than 
con.suiner interests in determining the strut ttire ol protection is consistent with earlier 
empincal work by Ray (1974). Baldwin (19766), and Caves (1976). 
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mation regarding the impact that proposed changes in trade limi¬ 
tations would have on their welfare. I'herefore, pressure to alter 
trade restrictions may follow a fairly discrete time path in order to 
permit industry interest groups to capitalize on the high cost and Itiw 
quality of information available to consumers.'* Maintaining a given 
quota in the presence of expanding demand or foreign supply would 
be comparable to imposing higher and higher explicit tariffs over 
time. Conversely, if demand for a product is declining and/or foreign 
supply is shrinking, a quota is comparable to a declining explicit tariff. 
Therefore, assuming some stickine.ss in policy changes, quotas or 
other nontariff trade restrictions would have some relative advantage 
in expanding markets while tariffs would be somewhat advantageous 
for protecting declining product markets.^ In our empirical analysis 
we will attempt to test the relative explanatory power of dynamic 
demand, AD, and supply, AS, conditions in tariff and nontariff trade 
restriction regressions. 

Finally, simple demand and supply analysis can lx* used to demon¬ 
strate that the deadweight lo.ss associated with a gtven trade restriction 
will be positively related to the ab.solute value of the own price elastic- 
tty of demand for the product, t), and positively related to the domes¬ 
tic elastitity of supply, C/;. lo the extent that the government is 
strongly ton<erned with the overall welfare effects of trade restric¬ 
tions, tariff atid nontariff trade barriers would be negatively related to 
the absolute value of the owti price elasticity of demand lor the 
product and negatively related to the domesiie elastic ity of supply for 
the product. 

We can siunmariz.e the discussion to this point by indicating the 
rat iff and notitariff trade barrier regressions to lie estimated, with the 
expected sign of ttie coefficient sjiecified below each variable; 

T = /'(T), e„, X .A',„ C'.ONR, Al). AS) (1) 

'• ~.-, ?. +, -I- 

aiicl 


^ - f’(v- Cf. c/). A',.A„. CONR, AD, AS, r) (2) 

.. ?. +. -f, -I- 

wluje A' represents a measure of nontariff trade restt ictions, the 
ambiguous sign on the elasticity of foreign supply, e^, reflects the fact 
tliat the sign on the foreign supply elasticity would only f>e positive in 


dvna;;:;;the ,x-lauvc „,encs »( carilts .,u.„as in chc cn.c-s. .,t 
< .97.5), •’> ('«70). Fcshc-lson and Flatters 

los./v!'Xr,h iilecl Xl "1 

Kinousiy applied to declining industries m the United States. 
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the noncompetitive case. Cd represents the domestic supply elasticity, 

X . X„ are alternative measures of U.S, comparative advantage, 

C’ONR is a measure of seller concentration, and AD and AS represent 
the expected {percentage growth in demand and supply at the time 
that trade restrictions are set (both are expected to be more positively 
significant in explaining nontariff than tariff protection). 

In the empirical section we treat the implementation of tariff and 
non tariff trade barrier.s as a sequential process. Tariffs predate non- 
tariff barriers in virtually every case, and nontariff trade restrictions 
are estimated as functions of both tariffs and of the political and 
ectniomic factors we identify as important explanatory variables, 
latci we test for this sequential effect and find evidence that it does 
exist. 


II. Empirical Results 

T he empirical result.s di.stussed in this .section were generated from 
data made available by the U.S. International Trade Commission in 
More precise definitions of the variables used in the regressions 
are giveti iti the Appendix. The observations consisted of 225 four- 
digit rnanufacturirig industries in the United States in 1970 and are 
specified in a separate appendix available upon request. 

Alternative forms for tariff regressions were estimated and are 
presented in table 1. while estimates of alternative specifications for 
noniariff trade harrier regressions are presented in tables 2 and 3. 
For regressions (I. ])-(i ..3) in table 1, the dejiendem variable u.sed was 
a weighted-average nominal tariff measure for each industry in 1970. 
In effect, the f out-digit tariff rates were calculated by weighting the 
tariffs of less aggregated c:omponenis by their import shares within 
the classification for the given four-digit industry. Regressions 
(1.4)-( 1.0) differ f rom the earlier ones only in terms of the specifica¬ 
tion of ifie dependent variable. In regressions (I.4)-(1.6) the depen¬ 
dent variable is the four-digit industry nominal tariff rate calculated 
as the simple average of component nominal tariff rales. 

The dependent variable in table 2 is ati Index of the Incidence of 
Nontariff Barrier.s in the U.S., 1970, constructed by the U.S. Tariff 
Commission.® I n effect, the index measures the comprehensiveness of 


‘ T he iiutfx was construcied b) tlio U.S. I arifl (aiminissiim and is explained in Tradr 
Biitnrrs, Rcpori to the Coniniittcc on Finance of the U.S. Senate, Pari 2, pp. 160-72. 
Wa.shington, D U., April 1974. The quantitative re»irirtic>n.s included and the weighting 
scheme used were as follows: Bilateral Quota (0.91); Global Quota (1.96); Quota 
(unspecified) (1.36); Prohibited Imports (embargoes) (1.36); State Trading (0.91), 
Automatic Ijcensing (0.4.6); Liberal Litensing (0,4.6); Discretionary Licensing (0.91); 
Licen.sing (unspecified) (0.91); Minimum Price System (1.36); Seasonal Restriction 
(0.91); Re.striction (unspecified) (0.91), Export Restraint (VERs) (1,36); Suspended 
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TABLE 2 

I'.S. NoN'iARiFf 'LradI' Rkstkk:ih>n.s (Tobir) 


Dufkndknt Variabi.f 


Index ol ihc Inndeiice oi Noiuarifl Barriers in the U.S.; J970 


iNotJ’tvrwvf -- 


V'AKlSIil )S 

(2 1) 

(2 2) 

(2.3) 

(2.4) 

(2.5) 

(2.6) 

t.oiisiaiu 

1 91 

.26 

6.68 

2 12 

.38 

6.16 


(1 0'->) 

(.11) 

(2.43) 

(1.11) 

( 16) 

(2,13) 

nilNMOS 


I I X l()-2 

,1 X 10 < 


.12 X 10-* 

- 1 X 10 ' 



( .5-1) 

(.38) 


(.46) 

(.03) 

CONRII. 

- 04 

-.05 

-.06 

-.04 

- 0.5 

-.06 


(1 92) 

(2 36) 

(2.86) 

(1.98) 

(2 44) 

(2.74) 

SKll.Ll) 

- ,5 0,5 

-6.78 

-2 42 

-6 26 

-8.16 

-6 04 


(1 1.3) 

(1 19) 

( .53) 

(1.42) 

(1 82) 

(1.37) 

SKINRI) 


4.99 

2 14 


.5 95 

4.06 



(1 OK) 

(.47) 


(1 29) 

( 89) 

|•..S(..VI.((i7) 

2 49 

2 7H 

.34 

1.61 

2.01 

-.31 


( SH) 

(1 00) 

( 12) 

( .59) 

(.73) 

(.11) 

l..ABI\ 1 



- 12 69 



- 10.05 




(3 76) 



(3 07) 

KI.R.A 


1 X lO '' 



1 X 10 “ 




(2 14) 



(2 02) 


WU.WI’U 

3 93 

- 1 004 

-3 75 

-3.76 

-3 87 

-3 .59 


(3 62) 

(3 6.3) 

(3 36) 

(3 49) 

(3..53) 

(3 26) 

JtlMP 

- 07 

05 

-.03 

- 08 

- 06 

- .05 


( 75) 

(.5(5) 

(.34) 

(83) 

(.64) 

(..50) 

AC.ON' 

1 to 

1 34 

09 

1 45 

1.3.3 

24 


(1 01) 

( 95) 

(.07) 

(1.04) 

(.93) 

(.17) 

T.SVV 1 I h 

,20 

20 

26 





(2.97) 

(2 99) 

(3 72) 




f.S.S.MPI .AK 




18 

18 

,19 





(2.70) 

(2,70) 

(2.84) 


Sf»i> — J'jH <i| ihf obM'fV jiioiis )Mri ihr hunt \.iJwr oj fern 


iidiiiarilf prdU'<ii<)ii in an indnsiry. Fificnn lypts of notuariff tracU’ 
resuiclions wt'ie lonsidiTcti and assigned weights reffeeting iheii 
relalivc effetiiveness in liniiliiig imports, 'Fhen ihe extent of ntintarif f 
protection given it) eat h industry was laltulated ;is the percentage of 
actual to potential protection to he dericed liom all l.'i nontatiff 

fmptiit RcbtriOiDii (0.91), Mixing Kt-guUlions (091). the weights on ihf right were 
determine-d hv assigning eat li resirk tioii a inuntH-r from 1 to ,H, high til low, depending 
oil Its degree of resultitveitesx. The average of those iiunibets was divided into ihe 
assigned tiumljcr for eath restrii lion, lesiilting in iJie weights .shown Index values loi 
indiisirv categories were then ohcaiiied as ftillows l-or any given toniiiiocfiiy. the sum of 
its spetiht weights divided hy the siini of weights foi the 15 categories (11.35) ex¬ 
pressed in percentage terms yielded the (icrcentagc ol maximum nontariff restraint 
of trade given to that iiidusuy. Total imports of major trading eountrics were used as 
weights If) aggregate the ba.s)c product category data .Such a weighting means that 
restric tions in a heavily traded category were counted more than restrictioni in lightly 
(world) traded categories. In cases in which the listed restrictions were known to apply 
to only part of a given product category they were arbitrarily counted only half as much 
as restrictions known to be applicable to an entire categnry. 
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I ABLE 3 

U.S. NoNTAHtKl- TraDK RESTRICTIONS (Priibii) 
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Depe.ni>en I Variabi.e 


Dummy Variable loi Noiiiarifl Barriers in ibe U S.- 1970 
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(2.37) 
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(I..34) 
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(2.14) 
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(1 0.5) 
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.52 

48 
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(1 46) 
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(.05) 
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06 

06 
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f resti i( lions. Industries with high perreniage index values are ihcn 
/ viewed as having received more comprehensive noiiiariff protection 
( than industries with low percentage index values. The dependent 
variable in table 3 is simply a dummy variable which et^uals 1 if the 
index foi noruariff barriers is positive and zero otherwise. In every 
other respect the equations estimated in tables 2 and 3 are the same. 

Ihe.e are three different speciheations for each of the tariff re¬ 
lationships and six different specifications for each of the nontariff 
re aiionships liecause we have taken an eclectic approach to the em¬ 
pirical explanation of U .S. comparative advantage. Within each subset 
! 0 l equations the alternative regressions differ only in their specifica- 
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noiuariff trade barriers regressions with a negative significant 
cuefbciem. 

('.aves speculated that a negative relationship between trade restric¬ 
tions and seller concentration might reflect a bias by politicians to 
prenide protection to industries with many producers in which the 
benehts would be widely dispersed. In order to test that hypothesis 
along with the hypothesis that protection has been sequential with 
nontariff trade restrictions supplementing predetermined tariffs, we 
ran the simple siinuitaneoiis model summarized in table 4. The results 
ptesented in table 4 are obtained from two-stage least-scpiares regres¬ 
sion models. I'he two models differ only in terms of the designated 
mea.sure for tariff rates. In the first model, we used the weighted- 
average tariff measure, fn the second model, we used the simple 
aventge tariff mea.sure. In both models we used the simple O-I 
clunimy measure of nontarif f barriers to trade. The results obtained 
in table 4 are (|ualitativelv invariant with the choice of the subset of 
compaiative advantage variables imhided. I herefore. for the sake of 
brevity , ihr c-stiniafors presented m table 4 are offered as represen¬ 
tative. fhe principal findings are that tariffs do positively and 
significantly affect nontariff trade restrictions, while nontariff trade 
restric lion.s have no significant impac t on tariff determination, .sup- 
fKHting our sec|ueniial model, and that nontariff trade restrictions are 
negatively and significtinilv related to iKith seller concentration and 
the geographical concentration of production, (iEOG.** The negative 
sign on GKOG means that nontariff trade restrictions are biased 
toward industries in which produc tion is distributed ac ross regions of 
the Ibiited .States in a manner similar to the distribution of population 
and therefore toward indiisiries with substantial repiesentation in 
G.cjngre.s.s. 

We used a measure of scale economies in If.S. manufacturing in 
1967. E.SC:Af.(67), as a proxv for the domestic, elastic ity of supply, 

"(iKCXi i.s mcasuird ac follows incles value ~ 


1 

N' 


vs, 


\' 


vs, 


Vi 


where VS, = value of shipments in region i, i = 1.4, and P,,,, - population ot 

region i, r - 1.4 1 o the extent that prcHluciioti in industry i is distributed acro.ss 

the north, south, ceniral. and western United Stales in a fashion similar to the general 
population, the index value will approach ecro. The basic data for the index are from 
the Como of Manu/acturfi for l%7. The lack of concordance between If'.SIC and Sit 
data means that 27 observations were lost and that ublc 4 was estimated using 198 
rattier than 225 observations. Apart from the fact that the inclusion of Cli-OG in the 
regi cssion runs for tabWs I -3 would have similarly reduced the sample size from 22.5 to 
198. the negative significance of GF,OCi in the nontariff dummy etjuations in table 4 
indicates that it should have been included in the earlier estimates as well. 




TABLE 4* 


SiMIJLl ANKOIIS EsilMAlfcSOl- I'Alill'l' AND NdNIARIFI- 
Trade Restrk.tions in the L'.S., 1970 


WEIfiHTED-AVElRAf.l. 

rARII-ES 


SiMI'I.E Averai.E 
I ARDES 


Df|x-ndeiii V’aiiables 


Indu’enden 1 
Variabi es 

('.onMani 

F I DNMOS 

(:()NR41. 

SKII.l.D 

SKINRU 

I-Sf.AI.((i7) 

I.ABIM 

kl.KA 

VVI)A\ I’l) 

A IMF 

A(;o\ 

IT S\V 1 1 1- 
I’l SSMI’I AR+ 
I’DlML't 
(;F()(. 

I SI.IDMk 
I'sciidci-rt* 

Fik(4ih(Ki(j 
lado lesl 


i. Wfi^hied- 
Avevajjc 

1 anfls 
(4.1) 

Oumiiiy Variable 
liM NoemaetH 
Barriers 
(4.2) 

I'.S .Simple 
Average 

1 ariils 
(4.3) 

Ournniy Varidhle 
(m 

Barriers 

(4.4) 

OLSQ 

I’robil 

OLSQ 

Hlobii 

9 69 

-2 6.3 

16 02 

-7 22 

(y 76) 

(1.34) 

(4 OH) 

(2.08) 


- IH X 10 ’ 

(1 61) 


-.23 X 10 “ 

(1 96) 

03 

- oy 

03 

- 02 

(1.97) 

(•y HI) 

(III) 

(2 91) 

-23.5'.’ 

10 HO 

-27.02 

14 33 

(4 49) 

(1 97) 

-0.(v() 

(0.34) 

(4.61) 

(2 1.3) 

0.21 

(0.13) 

- 4 '.'4 
(1 04) 


-4 .S3 
(1 06) 


10.19 

-7 42 

79 

- 33 

14) 

(4 02) 

( 1.’,) 

(3 41) 

y X 10 1 

1 X 10 * 

.2 X 10 • 

13 • 10 * 

(1 77) 

ll KHl 
- 44 
(2 29) 

- 9 X |()-i 
( 33) 

.3.3 
(1 17) 

32 
(2 H7» 

(1 36) 

(2 21) 

- 46 
(2 3H) 

- 29 

(ion 

HI 
(1 .36) 

«y 

( 94) 


27 
( 2H) 

.64 
(2 H7) 

1.10 

- 92 

1 73 

- 1 .33 

(.94) 

(2.00) 

(1 32) 

(2 3.3) 

.33 


27 

1 3 ,31 

31 

10 23 

31 


49 10 


49 10 


rt-klrd t„r aTT'*' Ml 11 Ml ...Ir. 27 ..1 ,l,r 22^ an..,, had n. Iir 

* IM SWI r >" '»•*• * Air ham ni, lEW .a»a,va..nn. 

",«n.mr„ul ’ lE.rSSMrfAR ami Dl VT.-nniEw 

' nil, ihr lirs, ElaRr „f (hr iwn-iURr lra.nM|i,arr. rMiinalna, (inRnliiir. 
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I'hc disiiission of Section I suggested that ESCAL(67) should be 
negatively ami significantly related to tariff and nontariff trade re- 
stri< tioiis it the governmem is strongly concerned about the overall 
welfare costs of protection. ESC;AL(67> appears in all of the weighted 
tariff and nontariff trade barrier dummy regressions with the ex- 
}x-cieii negative sign and generally significant coefficient. While the 
sign of KSCAL.(67) is negative, it is le.ss significant in the simple tariff 
regression.s. 

In Section I we indkated that we expected to find a negative and 
significant coefficient for the absolute value of the own price elasticity 
of demand foi the product in the tiade restriction equations. On the 
assninpuon that tfie own price elasticity of demand is reduced by 
product heiemgeneiiv, we u.sed a measure of product differentiation, 
U DAV'PD. as an inveise measure of the absolute value of the own 
jtrice elasticity of demand. Consequently, we expected to find a posi¬ 
tive and significant coef ficient on the product differentiation variable 
in die tariff and nontanff iratle barrier regressions. The relative 
negative significance of WDAVPI) in the nontariff trade barrier re- 
giessioiis compaied with the tariff regressions may be partially rc- 
iateti to anoiber factor that we have not discussed. C^learly, nonpricc, 
c)nantiiaiivc' restrictions are more easily applied and enforced if 
pictdiicts are fairly standardized. Con.sec)ueiuly, the negative 
coefficient on product heteiogciieity may partially reflect an admin¬ 
istrative pretereiice bv the government for nonpriie restrictions for 
homogeneous piodut ts. 

Earlier we argued that while market conditions change coniinu- 
otislv. (I may still make sense for iudusirv interest groups to push lor 
changes in tiade restiictions in discrete lashic>n. If so, we would 
exjHTi taiiffs to Ik- relatively more productive in industries that are 
ex pec ted to comiac i and nomai iff trade ban iers to fx‘ relaiively more 
productive in expanding industiies. We used the jjercemage growth 
in a[jparent domestic cc)nsum|>tion iHUween 196.5 and 1970, mea¬ 
sured by the percentage growth in domestic shipments plus imports 
minus exports, as a proxy for dc^mund shifts, AD, and the |ycrecntage 
growth in inqKU'is between 1965 and 1970 as a proxy for supplv 
shifts, A.S. (deaily, both the apparent domesiit con.sumption variable 
and the import expansion variable reflec t both demand and supplv 
phenomena and in that sense arc pcxir proxies for the effects of pure 
demand and supply shifts that we are trying to mea.siirc. With that 
(aveai in mind, we simply report that neither of the variables included 
to measure dynamic influences has any significant explanatory powei 
in the tariff or nontariff trade barrier regre.ssions. 

Unfortunately, we have no goiHi measures of the industry-specific 
revenue effects assoc iaied with tariff and noniariff trade restrictions, 
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or of the administrative costs of implementing alternative forms of 
trade restrictions across industries. Consequently, all of our estimates 
suffer from the potential biases associated with important left-out 
variables. 


III. Conclusions 

We presented evidence that both tariff and nontariff trade restric¬ 
tions are found predominantly in indu.stries in which ihe United 
States has no comparative advantage vis-a-vis the rest of the world and 
away from industries in which the deadweight losses to consumers 
from protection would be high. In addition, we presented direct 
evidence that nontariff trade restrictions have been used to supple¬ 
ment tariffs and, thereby, offset the trade-liberalizing effects of 
post-World War II tariff agreements. 

We also presented evidence that there are significant differences in 
•the industrial characteristics of industries with tariff proiecticjn com¬ 
pared with those with noniariff trade protection. Specihcally, tariffs 
are positively and significantly related to labor intensity and nega¬ 
tively and significantly related to the capital/labor ratio while just the 
op|)osite is true for noniariff trade restrictions. In addition, nontarifl 
trade restrictions are negatively and significantly related to l>oth seller 
fconcetttralion and geographical concentration in an industry while 
!both < haractet istics are positively and insignificantly related to tariffs. 

I 

Appendix 

Definitions of Variables 

])\(li'pctulnil I (iruihlrs 

' I l)NMOS—Hroduct c\cle prow: unweigliiecf average of Schc-dule B first 
trade dates (urresfMinding to r(;.SK; as ol |aiiiiar\ 1974. 
.,()NR4L —('onceinralion Ratio, 1970: |XTceiitage of sinpinents aicoiinted 

lor bv the four largc'si firms in the nufusirv, 
iKII.l.l) —.Skill>, measure. 1970: professional and kincfic'd woikers. plus 
managers and adrninisiralors (c-xrepi farm), plus craflsmeii 
aiicl kindiecf workcTs, as a |wrrc-nlage of total emplovnieiu. 
Based on three-digit .SICdala with values re|jealed at four-digit 
levels. 

tlNRl) —Percentage of scientists and engineers in R & U in 1970. Bascxl 
on two-digit Sl(' data with values rcjseaied at the four-digit 
level. 

St.Al,((j7)—tccinomies of Scale measure, 1970: value of the exponent in 
the regression ecjuaiion f = A.V®, where V is the ratio of value 
added in plants employing A' persons to average values added 
lor the indusiry, and K is a constant. 
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I.ABIM 

KIRA 

WDAVPI) 


AIMP 

ACON 


t;K(X. 


—Labor liileii.sity Ralii>, 1970 (payroll divided by/ value added). 

—{:apiUil/LalH)r Ralio. 1970: Total capital stock divided by em¬ 
ploy iiiciil (iboii.santl.s of dollar-s). 

—V\’fighied Average of Product Diffeientiaiion measure, 1970; 
The mea.siire is (he coefficient ol variation in the unit value.s of 
exports destined lo different ctnnilrie.s weighled by U.S. export 
sh.ires, i.e., the slaiulard deviation ol U.S. export unit values 
neighled by export shares divided by means of unit values. 

— I’ercentage change m iiniKtrts between I9fi.') and 1970. 

—Percentage change in apparent I'.S. cotisumption between 
19ti!i and 197t). where apparent LI.S. consumptton is measured 
h\ the calne ol U.S. shipments plus import.s minus ex|X)it.s. 

—See n. 9 


Di jx'tiili'Jif I '(inntilr\ 

1 antis — USWl IP—U.S weighled-average tat ills lor 1970 and USSMP- 
[ AR—U.S Simple average taiills lor 1970 are taken from the GA'I I 
Initfl Slud\, (ieneva. 1970. 

Index ol \oniaiil( h.iiiieis—Index ol the Incidence ol .Montariff Barriersm 
the I S.. 1970. 1 he inclc x was consirncled b\ the IhS. I'arifl ('.oinmi.ssion 
and IS explained in I uuL lUinii'n, Repot t to the Uommiitee on Finance ol 
(he U S Senate. P.iii 2. (ip. l(iO-72. U'asliington. I>.(;,. .April 1974 See n ti 
Diiinnn \’ai iahle lot Nontaiill Barrieis in the U.S,, 1970—))l'V12—I'he 
value ol the diinimv variable was sc't ecpial lo I if the index ol nonianll 
bairiers was nonzero lot a particular product and /eio otherwise 
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Inflation, Corporate Income Taxation, and the 
Demand for Capital Assets 


Richard W. Kopcke 

hi'ilnat Rfwn'f Bonk of Bn\ton 


1 hf demand (<>i lapital is not systeniatieally related to either the 
le\el oi the rate ot (Itangc of “effeetive" income tax rates on corpo- 
late c.tpiial assets. Ristiif' inflation during the last 10 years has raised 
ifie nsci tosi of ta|)iial for durable assets relative to that for short- 
Ihed .tssefs eteii thongli this iitflation has raised effective tax rates 
tor more dtirafile capital less lhati for short-lived assets. Even with 
replaiemetil-cosi deitrecialion allowances, the level and pattern of 
irtvesimem incentives prohahiv will continue to varv with the 
mllalion rate. 


I he past decade of high and i i.sing inHation rates has encouraged the 
sttich of L’.S. ituoiiie tax cckIcs attd inflation's influence on the de¬ 
mand for tapilaf assets. .Many have examined the ‘'effective income 
tax rate" oti c apiial to descrilte how inflation has altered both the level 
and pattern o( ittvestmcnc incentives. In a recent article, for example, 
Atjcrbach (1079) observes: ‘‘Willi a positive rate ol'inflation, a historic 
cost depreciation rule bia.ses the choice of as.set life toward greater 
durability ' (p. 621). .Such a conclusion, based on the behavior of 
effective income tax rates, contradicts the spirit, if not the letter, of 
much traditional analysis of investment incentives and the demand 
for capital (see, e.g.. Black 1959; Brown 1962; Samuelson 1964; Hall 
and Jorgenson 1967; Mu.sgrave and Musgravc 1976; and Boadway 
1978) so it warrants examination. 

One principal cctnclusion of this paper is that the demand for 
capital is not systematically related either to the level or to the rate of 

1 fie analysts and conclusions are not neccwanly endorsed by the Federal Reserve 
Bank ol Boston or the Federal Reserve System. 

iJiium/U nfEffrtumy. ISWI. vni HW no t| 

rc^ 1‘WI hi The I nivrtiily ol t.hitj)(<, 11022.Wmmi/WOl OWrtJOl .111 
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change of the effective corporate income tax rates on capital assets. 
Another principal conclusion is that the rising domestic inflation rate 
of the past 10 years not only has reduced investment incentives 
generally, but it has reduced the incentive to acquire more durable 
capital goods most severely. Even if U.S. corporate income tax codes 
were amended to embrace replacement-cost depreciation allowances, 
investment incentives would not necessarily be independent of asset 
service lives nor would the relative demand prices of capital goods 
necessarily be independent of the inflation rate. 

I. Inflation and the Demand for Capital 

Assuming perfect markets and certainty of foresight, investors 
maximize their wealth by purchasing capital yielding no less than p, 
the real discount rate for after-tax corporate cash How. In other 
words, the real price of capital, P (the output price is the numeraire),' 
equals 

e-^'f(l - T)e-*‘Qk + rPD(l, 6)e-"']d/ + P ■ ITC, or 

0 (1) 

*rpz*p- ,n:. 

where t is the corporate tax rate, 6 is the rate of decay of capital, Qjc is 
the marginal physical product of a unit of new capital, D(f, 6) is the 
schedule of depreciation allowances per dollar of investment under¬ 
taken t years ago, ITC is the rate of investment tax credit, Z is the 
present value of depreciation allowances per dollar of investment, 
and n is the inflation rate. Formula (I) represents the present value of 
after-tax operating income plus the present value of the tax shelter 
afforded by depreciation allowances and tax credits; D{i, 6) is ‘‘dis¬ 
counted’ by 11 because depreciation allowances are linked to the 
jonginal purchase price of capital assets, so the real value of these 
allowances must decline as nominal capital goods prices rise. 

In equilibrium, then, 

C.S’ = P,(p + S.)(l - tZ.)/(1 - t) = Q,.. 

CE = P,(p -t- 8,)(1 - tZ, - /'rc)/(l - t) = Qir.. 

jr the user cost of capital (Hall and Jorgenson 1967)—for non- 
residential structures, CE for equipment—equals the marginal physi¬ 
cal pnxlur' of capital. Becau.se structures and equipment are each 

' For the most part, the notation matches that in Auerbach's (1979) anicie. P may 
^ua unity as it does in Auerbach's article: however, it is often useful to relax this 
umpuon because the relative prices of capital assets are not constant. 
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represented by a variety of capital goods bearing different rates of 
decay while off ering different flows of capital services, d equi- 

libi inin for both equipment and structures demand prevails when 



/ 

cp(K) 

d(S,) 

/ 

f 

.4(6,,) 




(3) 


Relerring to the etpiilibrium condition described in expression (2), 
ii is evident that a changing inflation rate, fl, can alter the demand for 
capital. Higher inflation rates, other things equal, depress the present 
value of real depreciation allowances; this, in turn, raises the user cost 
ol capital. Krom (3). it is also evident that if higher inflation rates do 
not in( lease the user costs ol all capital goods proportionately, the 
pattern of insestment demand will change as well. 

In ec|uilibrium. 


CS(S,) ^ A (8,) 
CS(S.,) d(6i) ■ 


Should (hanging tax rates tend to increase C'.S'(S|) relative to 
PA^\) and ''id (ompcn.sate .so that equilibrium is maintained. Of 

<<Hii.se, if the new balance requires the price of more durable struc¬ 
tures, to decline relative to that of less durable structures, 

tnid il the supply of structures varies with pi ice, then the stork 
ol nonresideniial structures will become less durable. The influence 
oi inflation on the demand price of capital and the choice of asset life 
depend.s on the sensitivity of the pre.seiit value of depreciation allow¬ 
ances to changing inflation rates, d/Jdll, as well as the importance of 
these allocsances as a tax shelter, (1 — t/ — 

Table I shows how rising inflation rates reduce the present value ol 
real depreciation allowances for a variety ofcapiial goods according to 
curreni L'.S. (orporation lax law. The table illustrates two important 
points, i irst, the value- ol depreciation allowances is lower lor longer- 
lived capital assets. Second, with higher inflation, the value of depre¬ 
ciation allowances drops at a faster rate lor more durable capital. 
Therefore, the modest role of depreciation allowances in the cash 
flow accruing to durable capital does not mean that these assets are 
necessarily le.ss sensitive to changing inflation rates tfian are less dura¬ 
ble as.sc’Ls. 

In lad, as shown in table 2, the tax shekel offered by depreciation 
allowances is sufliciently important lor long-lived capital that higher 
inflation rates increase the user tost of capital for these assets more 
than that of less durable assets. As the annual inflation rate rises from 
7ero to 6 percent, other things equal, the investment incentives for 
40-year structures fall 22 percent, but the incentives for 20-ycar 
structures fall only 19 percent. Investment incentives generally drop 
much less for the tabulated categories of equipment. 



CORPORATE INCOME TAXATION 


•25 


TABLE 1 

Thi. Prksint Vai.uf ok Real Deprecation Ai.uiwani.es 
PER Dollar ok Investment 


A. Producers’ Durable E'^uipment* 




Asset Liketime 


Inflation Rate 


(Years) 





m 

.5 

10 

15 

0 

.44 

43 

.42 

2 

.42 

.40 

,38 

4 

.41 

37 

.34 

(> 

.39 

.35 

31 

H 

.37 

,33 

29 

1(1 

•36 

31 

27 

20 

.30 

.23 

.19 

30 

.2.5 

18 

14 


B .Nonrk.sidkniiai 

•SlRCI ICRfst 




AsSEI LlKUlMf 


Ini lA 1 ION Rai*. 

r'i) 


lYears) 


20 

30 

40 

0 

,41 

38 

36 

2 

35 

31 

.28 

1 

30 

,26 

.22 

0 

,27 

.22 

19 

8 

21 

19 

.16 

10 

22 

17 

14 

’0 

15 

10 

.09 

10 

1 1 

.08 

06 


S*i D --K nlriCN ctfiLil iKimui^l ^irjn c<i.«um miilliplirtl lt\ KS ihr nwr^lixal i itt (n>T.*lr iiMiunr lav lale 

liMotinied hi (1 0I5III v II). vvhric I f> jxicrm i\ ihc rral allci-lax iivli-adjiiMrd divcoiml i.tlr i%ar Bnmn 1962) 
jk) it IN iltr iniUuiin Mir 

• %Dni»n.il tiepi m.iiKin alUiw.iiicrs .itc calriiLiird UNing ibr vum ol-ihr vcjrN -dij<ti> formuU 
t ,\(>riiin<il (irprcxMiiDii .liloxaiMrN art* ialtuUicNl iisin); lUc I**C> ix-ftmi-dnlitiinj; <i»riinilj niih a »%*rtch m 
ItvUKtu-liiu* alUiwaiurs .dUM onr-iltitd uf Oh* jnsHa Mr has )vsnn(>«1 

[ Bciausc ri.siiig inllation reduces the present value of depreriaiion 
^liowaiues most quickly lor the longest-lived assets, however, the user 
to.sl ol capital for these assets becomes progressively less sensitive to 
mi tiler inc reases in the inflation rate. For example, if the inflation 
rate were to rise from 6 to 10 percent, investment incentives would 
rop another 5 percent for 40-year structures compared with an 
i< remental decline of 6 percent for 20-year structures. Should the 
illation rate exceed 20 percent, the erosion of real depreciation 
llowantes would be so severe for the shorter-lived structures that 
inger-lived structures bec'ome relatively more attractive investments. 
Despite a popular contention that the value of the depreciation tax 
ielter, hence its annual loss in value due to inflation, is greatest for 
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TABLK 2 

Inkatkjn ani> Investment Incentives 



A < rKs’ 

DuKABIV. f'.gillPMFNT 


- u - - - - - - 

-- “ 

As.set Lif(time 


InH \ 1 ION Ka IK 

(^') 


(Years) 


f> 

10 

)5 

0 

1.00 

1.00 

1.00 

<> 

06 

.94 

.92 

1 

.98 

.89 

.86 

ti 

.90 

.85 

.81 

8 

,87 

.81 

.78 

10 

.8.S 

.78 

,75 

L'O 

,77 

70 

.67 

.80 

72 

.65 

.63 


H \ONRKMmNlJyi SlKCt n KES 




Asset Likktimk 
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less durable assets, it ts ibe jKTeentage change in the effective pur¬ 
chase price of capital, P(1 - rZ - ITC), that influences investment 
incentives. Table 2 shovs’s that higher inflation rates raise this effective 
purchase price most for more durable, not less durable, assets lot 
relevant inflation rates in the United States. The more durable the 
as.set, the more its depreciation allowances are postponed, hence the 
more inflation may "distount” the value of these allowances befou 
they may be claimed, as shown in the first table. Consequently, high 
inflation rates discourage the purchase of longer-lived capital more 
than short-lived assets.^ Ironically, the class of investment goods mo's! 

* This is true wticnever the pattern of invesuncnl demand for high inflation rates » 
compared with the pattern of demand for inflation rates near r.cro. As explained in the 
text, however, as the inflation rate rises from an already high level, the relative demand 
price of some longer-lived awets may rise. 
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needing “protection" from the high inflation rates of the past 15 

years_nonresidentiaJ structures—has not received the benefit of the 

tax credits and accelerated depreciation granted to producers' dura¬ 
ble equipment. 

This shift in the relative capital costs that inflation induces will alter 
investment demand to favor less durable capital goods, thereby de¬ 
pressing the relative demand price for more durable capital assets. As 
a result, the stock of more durable structures, other things equal, will 
fall relative to that of less durable structures. To the extent that 
structures and equipment are substitutes, the stock of structures 
generally will decline relative to that of equipment. Of course, the 
higher user cost for all capital goods implies a declining capital-labor 
ratio in addition to this substitution among varieties of capital a.ssets. 

Assuming that the real price for all capital goods equals unity, a unit 
of output may be consumed or installed either as short-lived equif>- 
ment or as a very durable structure: consequently, changing rates of 
inflation cannot affect asset prices. Rather, from expression (4), if 
CS(S)/A{S) for a more durable factory exceeds that for a less durable 
factory due to increasing inflation, investors will install less durable 
structures; more durable factories are now economically obsolete. 

In .summary, with current U.S. income tax codes, which link depre¬ 
ciation allowances to the original purchase price of capital assei.s, 
changing inflation rates alter the relative user costs of capital, other 
things equal. Higher inflation rates initially raise these user costs more 
for long-lived assets than for short-lived assets, thereby reducing the 
average service life of the capital stock. 


II. Inflation and the Effective Corporate Tax Rate 

Auerbach (1979) defines the investor’s implicit discount rate as 

V ^ c - 8, (5) 

and then the effective corporate tax rate, 

0 ^ {v — p)/v. (6) 

In view of expressions (2) and (3), it is p>ossible for ^ either to rise or to 
fall with the service lives of capital assets. Consider, however, the 
simple case wherein d (6) equals unity and prices, P(8), are flexible for 
all assets (the definition of a unit of capital has eliminated the appar¬ 
ent differences in productivity among pure substitutes); then (from 
14]). C.S^S,) = 08 ( 82 ) and fl(8|) > ®(8t): The effective tax rate on the 
more durable factory is largest, though the user costs are equal. 

As explained in the previous section, a higher inflation rate will 
tend to raise the user cost for long-lived structures more than that of 
less durable structures. A higher inflation rate, then, will also increase 
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,:A^nnal sfruftures, other things 
slock of tionre nnitv. from (4i 


» uh pc increases less 

?-*'■""" rt, 

oHVrml'r.x rm., «, on less durable eapital rises more than that oil 
/on^oi-lived .issets, ihe tapiiaf sKH k is ttunpriscd ol more shorj-Jived 
assets ssiih higbei inllation rates berause(the price of'the more 
dutable asset) has fallen relative lo/'dSj) in et|uilibrium, as described 
after expression ('!). 

If .((S) rises iviifi S and if the prices of all capital were unity, a 
mediuiii-fived asset may have a relatively high user cost and yet be the 
asset in greatest deiiiatid due to its high productivity, d(5), (am- 
setiuently, the effet ti\ e tax rate, for this asset may exceed that for 
many alternative assets not favored by investors. 

Assume, for S, c Sj < d:,, that 




O'(«..,) 


, and 


d(fi,) ,-/(«,) AIS:,) 

f:s(s,) < cs{s,) < cs(s,).-' 

ft is reasonable to assume (:S’(S) < I (see n. 3); therefore, 

fl(8,) = (f;,S(8,) - s, - py|f;.S(8,) - 8,j > 

= [a'fSj) - 62 - pj/(f;.S(6.d - Sj] > 

0(8,) = K;,V(S,) - 8;, - pl/K-'.V(8,) - 8,1. 


( 7 ) 


(«) 


I he medium-lived asset is the optimal investment acc ording to (7), 
but expres.sion (H) reveals that its effective tax rate is not the lowest. 

Just as t.he level ot d is a poor gutde to investment incentives, .so the 
change in the effective tax rate is a pemr guide to the change in 
investment incentives. From (7). the short-lived asset ( 8 ;,) will be pre¬ 
ferred to the medium-lived a.s.sei once inflation is high enough to 
cause 


f'Vfg.) d(5,) 

C.S'( 82 ) ^( 62 )’ 


because c»lf;.S( 8 ;i)/fJ.S'( 62 )]yc)n < 0 .'' 


(9) 


■’f.S'(8) = ((I - tZ) ~ rip + l»Z'lf>)Vl) - r). According to tattle I, TZ’(fl) 2, w the 
first term in ifie bracket.^ i.v approximately 0.6 and the seettnd ts approximately 0.3; 
therefore, it i.s rca.sonablc u> asfume 0 < Ci tS) < I. .Similarly, f;S"<6) < 0. 

^ As shown in table 2, unless infhdon 1 ales exceed 20 percent, a higher inflation rate 
raise.s the user tost for lcs.s durablesnuctures (6,) proportionately less than the user cost 
of more durable structures I 61 }. I he relatise user cost of less durable structures declines 
as inflation rises. 
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Though rising inflation depresses the user cost of short-lived assets 
relative to that of longer-lived assets, however, it need not depress the 
relative effective tax rate on shorter-lived assets: dOIS^I/dfl is not 
necessarily less than dff(S,)/dn, nor is d[6>(S,)ms^)]/dn < 0 always. 
From expressions (7) and (8). for example. 


1 00(8,) , 

/I 00(8,) 

on y 

i on 


d In [CS(S:,)]/dn CSihj) lCS(8 i)-Sip 

~ a in [CS(Sj)]/d7r CS(Sj) ICS(S,)-S3F' 

The product of the last two factors in the right-hand side of (10) 
exceeds unity;* consequently, the left-hand side of (10) may exceed 
unity—the tax rate on less durable assets may rise more than that on 
longer-lived assets—even though CS(8n)/(^S(S,) is falling. Whenever 
d(S:,) rises more than 8 (8:,)/0(S.^) also increases because 0 (S:d < 

0(S.j) in this example. While relative user costs for shorter-lived 
structures fall with rising inflation, the effective tax rate on these 
assets may rise more than that on longer-lived structures. 

Finally, from (2), (5), and (6), assuming P = 1. 

S = [r(l - 2) - ITCUll -tZ - rrC - (1 - r)6/(p + 6)]. (11) 

Assuming that the intertem|K>ral discount rate (p) is constant, the 
elfective tax rate on corporate capital must vary with the inflation rate 
unless the statutory tax rate (r) equals zero or depreciation allowances 
are adjusted for changing capital goods prices— D{t. 6) is replaced by 
D(l, 6)e"' in (1)—.so that dZ/dfl = 0; I)(l, 8) may correspond to any 
capital consumption scheme, including ‘•economic depreciation." 

It is likely, however, that p changes whenever the inflation rate 
changes;'’ therefore, the demand for capiial may change whenever 11 

'■ S<T II Because r;,V(5) > 0, 0(6,) -> CS(6j). < I, however, implies ihai 

K .V(6.J - fijf -> K;.S (S,) - S:,F. Thcrelore. both ol ihe Iasi iwo larlors ol (10) exceed 
uiiiiv. 

" In many standard neex las.sical growth models, combining intertemporal tonsump- 
lioii and production decisions (e.g., Sidrauslci 1967; Inlriligator 1971). p b\ h>l*>thesi> 
isconsiant. In other versions ol this model the intertemporal utility lunciion is replaced 
by a saving rate and fKirtfolio balance relationship (Tobin I9.'>.‘>. 1965) In these latter 
inixlcls, higher tiiHation rates can lower the real return on money balantes. thereby 
encouraging investors to aci)ttire more capital, by reducing p, until its marginal physical 
product drops enough to restore poritolio equilibrium. In other mtxleLs, p is neilhei 
constant nor essentially ecjual to the real return on money. Inflation is a bv-producl of 
public and private policies intended to redistribute resources (.Schumpeter 19.19. 
Robinson 19.54, 1962; Pasinetti 1962). This reallcKalkm of wealth, income, or spending 
uncloiibteuly will c hange the six.ial rate of uine preference, p. Although Tobin's growth 
model suggestsp woulcl lall as inflation increases, Schumpeter’s model suggestsp would 
rise It is not unreasonable, then, to assume that higher inflation depresses p and. in 
turn, increases the average service life of capital if corporate income taxes are indexed. 
Yet, so little is known about the scwial rate of time preference that it is unreasonable to 
ignore the alternative conclusion; Higher inflation rales accompany an increase in p, 
thereby depressing the average service life of capital. 
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changes, unless this demand is also independent of p. By setting/TC 
= 0 and either setting t = 0 or allowing depreciation schedules that 
are a linear combination of “economic depreciation” adjusted for 
changing capital goods prices and immediate expensing of investment 
expenditures, not only is the effective tax rate independent of the 
inflation rate, but it is independent of the discount rate, p, and asset 
service lives. (Even if depreciation allowances were "indexed” for 
inflation, unless these allowances correspond to the pattern of an 
asset’s “economic depreciation,” the effective tax rate will not be 
independent of the as.set's service life [see Samuelson 1964].) Of 
course, these last reforms accomplish this goal by pegging the effec¬ 
tive tax rate at zero. Nevertheless, even though 6 = 0 for all values of 
n, 8, and p, these reforms cannot make the user cost of capital 
independent of p and 6 because (from [2]): 

f; = (p+S). (12) 

The demand for capital, therefore, is not systematically related 
either to the level or to the rate of change of the effective corporate 
tax rate defined by expres.sions (5) and (6). The demand for capital 
depends on the user cost of capital and the marginal physical product 
of capital alone in necxlassical macro models, so the income tax codes 
influence investment incentives only by altering capital’s user cost. For 
studying the demand foi capital, then, the suggested effective corpo¬ 
rate tax rate, 6, is not a sufficient statistic—it alone is not a definitive 
measure of investment incentives. Conversely, the user cost of capital 
does not measure the effective lax burdens borne by the various 
capital assets. Each concept—the cost of capital and the effective tax 
rate—serves a particular purpose, and their roles are distinct. 

III. Summary 

Unless the aggregate rale of time preference has declined substan¬ 
tially, the rising inflation rate of the pa.st 10 years not only has reduced 
business's demand price for capital generally but it has reduced the 
relative demand price for more durable assets most severely. Even if 
the corporate tax rate were zertj or depreciation allowances were a 
linear combination of economic depreciation adjusted for changing 
capital goods prices and immediate expensing of investment ex¬ 
penditures, thereby pegging the effective corporate income tax rate 
at zero, the level and pattern of investment incentives most likely 
would vary with the inflation rate. 

Of course, it is one thing for inflation to influence investment 
incentives directly; it is quite another for changing investment incen¬ 
tives to accompany a changing inflation rate because the intertem- 
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poral discount rate varies with the inflation rate. Although it is prob¬ 
ably unreasonable to expect any tax reform to insulate the demand 
price of capital from the influence of policies that reallocate resources, 
any schedule of depreciation allowances adjusted for changing prices 
can insulate investment incentives from inflation per se. 
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long-run effects of monetary shocks in a regime of fixed exchange 
rates, the effects of devaluations and changes in commercial policy in 
a monetary setting, and the short-run consequences of monetary 
imbalance on the inflation rate and the balance of payments have 
recently been the object of careful theoretical and empirical study. 
The importance of monetary variables in the determination of the 
exchange rate in a regime of floating rates has also been analyzed in 
detail.' 

Most of these studies, however, have dealt either with cases in which 
the exchange rate has been kept fixed for very long periods or with 
free-floating cases in which the exchange rate is predominantly de¬ 
termined by the interaction of market forces without government 
intervention." 

Yet in recent years many countries have expterienced simultaneous 
fluctuations in their exchange rates and international reserves. These 
fluctuations have occurred either under a managed-float system, 
characterized by government intervention in the foreign-exchange 
market, or under a crawling-peg system, in which the authorities 
determine, and periodically change, the country’s exchange rate." In 
order to analyze the experience of countries with these characteristics, 
an extension to the previous formulations of the monetary approach 
is rexjuired. 

Itt this paper we develop and estimate a model for the analysis of 
the Joint determination of the exchange rate, international reserves, 
and the rate of inflation under a crawling-peg system. Section 1 


' A iiuinix'i ol iinpoiianl <oniiibulions on ihe lopk are colletu-d in Frenkel and 
johnson (197(). 1978) and in the lnt(*rnaiional Moneiarv Fund (1977) volume A reviev, 
ot (he cnipirual evidenve on (he moneuvr) approach (o (he halaiue of pavmeniv and 
exihanse ialc>i iv preveiiied in Magee (1976). Johnson (1977) and Whiiman (1975) 
pieseni (oinpiehensive reviews ol the ictem conlribunons lo (he monetary-approach 
liieiaiure. while a critiejue ol some ot the central aspecl.s ol that approach is tound in 
Hahn (1977). All these deal with hxed-exehange-rale regimes. Among (hose dealing 
with floaiing rales are Kouri (1975), Dornbusch (1976). Frenkel (1976). Mussa (1976). 
and Frenkel and (Elements (m press). 

' Fxceptions aic the studies by Anus (1976) and by Girlon and Roper (1977). 1 he- 
laller develop and test a monetary mcKlel ol exchange-market pressure, which cs 
dehiied as a composile variable that inc ludes changes in both the exchange rate and 
mieriiational re.servcs. This mcxlel was recently applied lo Brazil by (atnnollv and da 
Silveiia (1979) who, using annual data, attempt to explain the behavior of the 
exchange-market prc.ssure variable. As will be seen, our model is designed lo deal 
. simultaneously with the joint deterininatioii ol each of the two components ol 
, exi hange-niarkei pressure as well as of the domestic rate of inflation 

' A theoveiic al analysis which concludes that the optimum exchange-rate regime will 
> cOl 1 espcvncl lo neither ol the extremes ol a completely fixed or a completely flexible rale 
; is picscnied by Boyer (1978) and Frenkel (1980). who concludes that "for the purpose 
: ol empiric at work it is useful to design a framework of the adjustment inechani.sm that 
< an accommodate simultaneously changes in international reserves and changes in the 
1 exchange late." 
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presents the basic modeJ and uses it to study the effect of changes in 
domestic monetary policy on the evolution of these three endogenous 
variables. The empirical implementation of the model, presented in 
Section 11, uses quarterly data from Brazil covering the crawling-peg 
period, 1968-77. The parameters of the model are estimated simul¬ 
taneously using a full-information maximum-likelihood method. 
Concluding remarks are presented in Section III. 

I. The Model 

Monetary Equilibrium 

1 he model is a variant of the monetary approach to the balance of 
payments. Its main feature is that it accounts for different degrees of 
exchange-rate flexibility (from a freely floating parity to a completely 
lixed exchange rate). We take the case of a small country, defined as 
one who-se international price of traded goods is exogenously deter¬ 
mined, and we allow for the existence of “nontraded" goods, defined 
as goods whose price responds, at least in the short run, to domestic 
monetary variables. 

The basic relationships of the monetary sector are 


M, = a(R -1- D). 

(1) 

Md - Pmd. 

(2) 

»n<t =y(>’. 

(3) 


where M, is the nominal supply of domestic money; a is the money 
multiplier; R is the foreign-exchange reserves held by the central 
bank; D is the domestic-credit comjKinent of the mcmetary base; Af d is 
the demand for nominal cash balances; P stands for a price index that 
includes traded and nontraded good.s; and is the real demand for 
money, assumed to be a function of real income,^ y, and of the al¬ 
ternative cost of holding money, proxied by rr*", the expected rate of 
inflatitm.* 

Although it is possible to postulate a mechanism of lagged adjust¬ 
ment, as is done in the empirical section, here we assume for 

* We assume that the rate ot growth of real income is exogenous with respect to 
monetary variables (we therefore assume a version of the natural rate hypothesis) 
Although restnnive. this assumptKin allows us to focus on the specific effects of monc> 
in an open economy in the tonlexi of the monetary approach. At the empirical level we 
found evidence supporting our assumption for the case under consideration. A com¬ 
plete report of the relevani lausaiity tests is available from us on request. 

’ As it happens, the results below are valid under a large variety of assumptions about 
the mechanism for the foriiiaiion of nr', provided that the latter depends on taggerl 
variables. In the empirical applications, we assume that w' is formed by a version ol 
rational expectations; sec .Sec. If. 
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simplicity that the money market clears in each period so that the 
nominal stock of money is equalized, ex post, with the demand for 
nominal balances. The assumption requires the existence of the flow 
equilibrium 

Mt = MS. (4) 

where the asterisk indicates the percentage rate of change of the 
variable. Differentiating equations (1) and (2) logarithmically, the 
flow-equilibrium condition can be rewritten as 

a* + (I - y)R* + yD* = P* + m$, (5) 

where y is a factor of proportionality equal to D/{R -t- D) and P* is the 
domestic rate of inflation. 


The Domestic Rate of Inflation and the Balance of Payments 

When traded and nontraded goods are both present, the domestic 
rate of inflation can be measured as a weighted average of the rate of 
change of the price of both kinds of goods, 


P* = XPf + (I - X)P5 t, (6) 

where Pt.Pnt the price in domestic currency of traded, nontraded 
goods, respectively, and X is the share of traded gtxtds in total ex¬ 
penditure. In a small economy Pf is determined by the world rate of 
inflation (PJ,) and by the variations in the effective exchange rate (p*); 

P^ = P* + p*. (7) 


The price of nontraded gtxjds, however, may be affected by 
domestic factors, at least in the short run. Since an ex ante excess 
supply of money implies excess demand in the goods market, and if 
excess demand for nontraded goods varies monotonically with excess 
demand throughout the economy, we can expect the price of non¬ 
traded goods relative to that of traded goods to be a function of 
imbalance in the money market, 

f 

I /’Bt = pf + en, (8) 

^ where fl is a measure of monetary imbalance and 0 is the elasticity of 
' the relative price with respect to the monetary variable.* 

; An important feature of equation (8) is that it includes only con- 


* ® is “ function of the elasticity of substitution between traded and 

nontraded goods in consumption and production and of the income claMicity of the 
nontraded goods. For a deuiled description of the dynamics of domestic-price deter- 

mmation m a moneury model with traded and nontraded goods, see, e.g.. Parkin 
(1974) and Blejer (1977). B . rariuii 
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temporaneous variables. In practice, however, it may well be that 
depends on lagged P^ and fl reflecting persistence effects. To allow 
for these effects, it is possible to postulate a relationship of the form 

(/"St - Pf), = - Pf),-,. (9) 

where / is a time subscript and a is the adjustment coefficient. Empiri¬ 
cally, however, we found that our results were quite insensitive to the 
particular specilication adopted. We therefore considered equation 
(8) as the relevant representation for relative-price determination. 
The structure of the model resulting from equation (9) and the results 
obtained using that specification are presented in the Appendix.' 

.'\s regards the measure fl, it is important to remember that a 
central conclusion of the monetary approach to the balance of pay¬ 
ments is that in a small open economy the nominal supply of money 
may be beyond the control of the monetary authority. Under fixed 
exchange rates the monetary authority can only determine the ex ante 
quantity of money by changing the domestic-credit component of the 
base or by manipulating the money multiplier. In conjunction with 
the flow demand for real balances generated by adjustments in the 
desired stock, such measures create an ex ante excess How supply of 
money to which the public reacts by changing the level of the interna¬ 
tional reserve component of the base through the balance of pay¬ 
ments and by affecting the rate of domestic inflation. The ex post 
nominal ()uantity of money in an open economy is then influenced by 
the public's respon.se to ex ante conditions in the money market." 

It appears, therefore, that the relevant measure to account for the 
monetary ef fects on the goods market in an open economy should be 
an ex ante measure which docs not include the endogenous reaction 
of the ftireign component of the base. For that reason we define fi in 
equation (8) as the gap (in pierccntagc terms) between the ex ante 
change in the money supply (i.e., a change in the domestic-credit 
component of the ba.se and in the money multiplier)" and changes in 
demand. Equation (8) can therefore be rewritten as 

P^T = Pf + + a* - ATS). (10) 


' Fq. (8) ig^ote^ po.'isilile re.jl sector cftecls on Uie relative price of nontraded 
goods (e g . real wages, intomc policies). .Although the equation can be nuidihed to take 
these (actors into account, this wmild result in a Iraniework lar tx-yond our intended 
extension of the monetary approach to the case of a crawling peg. 

" For time-series analysis of Kuropeaii data and econometric tests ol the inieraitioii 
between changes in domestic credit and in international re.seives implied by the mone¬ 
tary approach, see Blcjer (1979) and leiderman (1980). 

* Empirical examination of the assumption of exogeneity of (yP* + a*), on the basis 
of causality tests, indicates nonrejeciion erf the a.ssumption; complete test results arc 
available from ns on request. 
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Substituting (7) into (10) and then (7) and (10) into (6), we obtain, 
after some manipulation, the following expression for the rate of 
domestic inflation:'® 

P* = e(P* + p*) +(I - €)(yD* - m*a), (11) 

where e = [1 + 8(1 - X)] 

In addition to changes in the price level, there are also changes in 
international reserves operating to restore monetary equilibrium. An 
expression for the money account of the balance of payments, which 
is equal to the change in the international reserves held by the central 
bank, may therefore be obtained by substituting P* in equation 
(5)—the flow equilibrium condition for the money market—for its 
value in (11) and rearranging the terms: 

(I - y)R* = e(P* + p* + m% - yl)*). (12) 

Equations (11)-(12) present the domestic rate of inflation and the 
balance of payments as functions of world inflation, exchange-rate 
policy, and the rate of ex ante excess flow supply of money. When 
nontraded goods are absent (X = 1) or when their price is not sensitive 
to monetary imbalance (6 = 0), then « = 1, and the model is similar to 
the classical long-run formulation of the monetary approach (see 
Johnson 1972). In such a case, domestic monetary variables do not 
affect the domestic rate of inflation which, if the exchange rate is not 
altered, is fully determined by the world rate, and every ex ante 
monetary shock will lead to reserve depletion due to a balance-of- 
payments deficit. 


The Endogeneity of the Exchange Rate in a Crawling-Peg System 

f.xccpt in a fully flexible exchange-rate system (or in a managed float), 
the exchange rate is regarded by governments as a policy instrument, 
and its fluctuations are generally influenced by policv decisions aimed 
at one or more goals. Unlike under an adjustable-peg regime, under a 
crawling-peg system the exchange rate is changed f requently accord¬ 
ing to some set of rules adopted in order to attain the government's 
objectives." I he variation of the exchange rale can therefore be 
tonsidered as following a sort of reaction function which reflects 
policy goals as well as the parameters of the model adopted and the 


As' iming, tor simplicil). a con.staiii money muliiplirr. i.e.,a» = 0. In the empirKal 
se< non, however, ehan)(es ni ihe money multiplier are explicitly considcretl. 

A number of proposals foi the operation of the crawling peg as well as aiialvses of 
the stability of the system have recently been presented in the hlerature: see, e.g., 
(*^70), Kenen (197.A), levin (1975, 1977), and Mathieson 

(I97n). 
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values of the exogenous and endogenous variables considered rele¬ 
vant for the desired goals. The goals, and therefore the crawling 
rules, may vary from country to country."* 

With a view to the empirical implementation of the model for 
Brazil, we postulate here that the policy objective is to avoid long-run 
changes in the real exchange rate and that the nominal rate (p) is 
therefore altered to maintain purchasing power parity,We assume, 
in addition, that the full adjustment of the exchange rate may take 
more than one period, and we shall analyze the effects of differences 
in the speed of exchange-rate adjustment. The reaction function 
implied by the policy tule is 

It 

p,* =/3 2(I -T*),, (13) 

(=0 

where / is a time subscript, ^ indicates the portion of the current 
differential rate of inflation transmitted to the exchange rate in the 
current period, and L is the lag operator {L‘x, ~ x,-,).'* 

This formulation specifically assumes that, in addition to the 
current-period adjustment, the exchange rate will continue to change 
in each subsequent period by a fraction /3 of the still unadjusted 
differential until the whole differential has been transmitted to the 
exchange rate. Obviously, the greater is the faster will the rate 
adjust. If/3 = 1 our model does not differ conceptually from the 
monetaiy-approach model of free-floating exchange rates, since the 
rate of depreciation is then fully determined by the domestic-foreign 
inflation clifferential (see Frenkel 1976). 


The furutiormig oj the Model 

We proceed now to solve the model for the three endogenous vari¬ 
ables in which we arc interested; the rate of inflation, the rale of 

Kencn (197.5) atialv/cs in detail the relative efliciency ol a number of alternative 
sliding-parity i tiles. Mathieson (1976) studies the consequences of using a welfare— 
instead of a balaine-of-paymcnts—objective as the guideline for the crawl 
’’ The appropriateness of this assumption for Brazil is discussed in Sec. II. In a 
previous version of this paper a number of alternative rules were incorporated into the 
model, among them maititaining a given level of nominal reserves (R* = 0) and 
maintaining a given level of real reserves (R* — Pi = 0). Although the dynamics of the 
system change with the policy rule, its basic structure is not affected. For presentational 
convenience the functioning of the model is here confined to a single rule. 

A reaction function of this type follows Dean (1974) in the sense that endogenous 
target variables arc a function of the current values of other endogenous variables; this 
is justified on the grounds that the latter are a plausible representation of the structure 
of the model and may therefore provide prior knowledge of the structural-form 
coefficients. This differs from the approach of earlier works such as Friedlaender 
(1973) where endogenous variables depend only on exogenous or lagged-endogenou.s 
variable.s. 
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change of foreign-exchange reserves, and the rate of change of the 
exchange rate. Combine (11) and (13) to solve for pf, 

and substitute this in (11) and (12) to obtain 


- €(\-m-L) 

• 1 -)8« - (1 -i8)L 


(Pi). 


(15) 


(1 -e)(l -(1 -/3)L] , _ 

r-/3r-(i ->'• 


and 


(1 - ywr = (P* + mS - yD*),. (16) 

The three endogenous variables are functions of current and 
lagged values of foreign inflation and domestic monetary variables. 
We can consider now which type of monetary policy will equalize the 
domestic rate of inflation to the world rate. Because the coefheients of 
Py and {yD* - m*) in equation (15) add to unity, P* will equal PJ when 
the monetary authority expands the money supply at the rale neces¬ 
sary to .satisfy the growth in real demand and to replace the depre¬ 
ciated value of the nominal stock. This is achieved when the exoge¬ 
nous component of the supply of money (domestic credit) expands at 
a rate that exceeds the growth in the demand for real balances by the 
world rate of inflation, that is, when the ex ante excess flow .supply of 
money is equal to the world inflation rate. 



{yD* - m*a) = P*. 

(17) 

which implies 


P* = P* 

(18) 

and 


p* =R» = 0. 

(19) 


fo illustrate the functioning of the model further we ran analyze 
the effects of a domestic-credit shock. To do so, it is convenient to 
consider an economy whose initial equilibrium satisfies equations 
(18)-(19). Starting from such an equilibrium ]x>sition, and as long as /3 
and e are smaller than unity, an acceleration in the rate of expansion 
of the domestic-credit component of the moneury base, A(yD*), will 
cause the rate of domestic inflauon to depart from the world rale 
(here assumed constant), the exchange rate will rise, and a balance- 
of-payments deficit will be created. 
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If .hf rue <.f wrow.h of cfoniestit credit is sustained at the new. 
level, the exchange rate will eventually adjust to account fully 
the dillerential in the inflation tates, which will then be equal to 
A{yD*).'^ Once the thanges in the exchange rate fully reflect the 
dirfeieriikil rate of iriHaiioti. the excess supply of money created by 
(he got emtueiit in eath period is fiilly eliminated through an increase 
in donicMic pi kes, and no further Hows of foreign exchange reserves 
ocim. rhis process is illustrated in figure 1 for different values of j3 
.iikI e.'" Asfi increa.ses, forgiven values of €. the exchange rate adjusts 
faster to the inoneiarv sIkk k, and domestic inHation Ixith diverges 
faster from world inflation and {onveiges faster to its new equilibrium 
rate. As tan he obsei ved by comparing paths III and II in figure I. the 
balance ol pat nienis detci iorates less and returns to etiuilibrium more 
l apidlv (impK iiiga smallei total losstif reset ves) asincreases. When 
—* I. the svsieni approat hes a Hexible-exchange-rate model, like the 
one |)iesented b\ Frenkel (l‘)76). in which domestic inflation is always 
independetit of vtotid inflatioti and the balance of payments is always 
/ero. I.etitng/I = 1 in ctpiatiotis (H)-(16). we obtain 


f 1* = (vf>* - m%),. (20) 

pr = lyD* - w,? - F*), = (P* - P*)„ (21) 

R* = 0 . ( 22 ) 

f be speed of adjtistineni will also increase, lot given values ol fi. 
the kiwet the c, that is, the higher ifte share of nonitaded goods in 
expenditures, K. or the fiighei the ela.sliciiy ol relative prices with 
respeti to monct.ii v imbalance, f). Fbe more rapid adjusimeni of the 
rale oi exchange when € falls (which alsti implies a faster acteleiaiion 
ol the tale of inflation although a smaller loss of reserves during the 
adjustment piixess) is illustrated by the comparison ol paths II and 1 
in figure 1. 

Thus the results inciicaie that under a crawiing-peg system a small 
country can choose its own long-run late of inflation itide{>endenily 
Irom the rest of the world. However, halance-ol-paymetUs deficits 
and surpluses are experienced in the process of adjustment, owing to 
purchasing jiower disparities which arise because the exchange rate 
takes some time to adjust fully. 

We turn now to the analysis of the empirical results obtained by 
applying the model desc rilH-d above to the t.asc of Brazil. 


“ Thai IS, the diffcienic h«wren ihc new rate ol ex ante excess supply ol money 
lyD* + A{yP*) - m*] and the work! rate ol inHation. 

“ For preseiitaliorial siniplirilv the figure does nol consider the effects of the tempo¬ 
rary reduction in m3 as mflaiHinary expectations aic'rleralr. The pattern of adjustment 
IS. however, very similai when ihesc effects are taken into account. 
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11. Empirical Investigation 

I', stination Procedure.i 

In this section, the model developed above is restated in econometric 
01 m. For estimation purposes, we work with the following system. 

log m, = 8, log y, - 8j7rV + 83 log m,_, + fi,„ (3'^ 

P* = efPJ. + p*), + (1 - «)(«* + yD* - m*), + pLj,. (IT) 

(1 - y)R? = ((P* + p* + m*-yD* - a*), + ^ 3 ,, (12') 

p* = ^(p* _ + ^(1 - y3)(P* _ pj,),_^ 

+ ^(1 - mp* - p:),-, + M4,. 
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Equation (3') is the empirical counterpart of the model’s demand for 
real money balances. The specification is semilogarithmic, in that the 
log of real money balances depends on the log of real income and 
expected inflation. As the model is applied to quarterly data, we 
postulate a process of gradual adjustment of real balances to their 
optimum value, and that is the rationale for the appearance of lagged 
money balances in the equation.” Equations (11')-(13') are stochastic 
variants of equations (11)-(13) above."* 

The following stochastic specification is adopted: It is assumed that 
IX, has a first-oidei autoregressive representation ix„ = (ttifiit-i + 

(for i = 1,2, 3, 4), where is the autoregressive parameter, and vi, is 
the error term. Fiti therinore, we assume that the error vectors, v^,, 

(for t = 1. T) have zero mean and a constant variance- 

covariance matiix T; also, they are .serially unrorrelated and drawn 
from a multivariate noirnal distribution. 

Equations (.3') and (ir)-(13') constitute a simultaneous system, 
w hich is lineat in the variables but nonlineat iti (some oO the param¬ 
eters. Given these characteristics, consistent estimation requires the 
u.se of some simultaneous-equation method. We have used a full- 
infoi uiaiion maximum-likelihtHrd (FIML) estimator; FIML utilizes ail 
the a prioi i lestriciions on the system to estimate the coefficients of 
interest (in our case. e,/3, <p„ 5() by maximizing the model’s likelihood 
function.'” 

To make the estimation operational, a time series of the rate of 
change of real money demand, is required. As money demand is 
simultaneously estimated with the other equations of the model, the 
complete system is estimated under the restriction that corre¬ 
sponds to the hrst dif ference of the fitted values of equation (3'). In 
adefition, note that the estimathm of money demand requires a proxy 
for tt' . 'I bis prox\ is constructed by a.ssuming a version of the rational 
expectations hypothesis. Specifically, it is postulated that agents form 
their inflation expectations on the basis of the least-squares prediction 
of inflation, conditional on the set of available information. We have 


K<>i mall>, ihe dfmjiul lor money w ({iven by log = 6| log v, — SfirT. Money 
demand c<]uatioiis ol this form have pieviously been estimated for Bi'H/.il by .Silveira 
(1973) and Khan (1977) We assume an adjustment mechanism ol the form (log ffl, - 
log m,.,) = A(log m.,, - tog m, ,). {) < h < I. See Griliches (1967) for a dist us.sion of the 
conditions cinder which adyustinem mechanisms of this class are likely to be optimal. 

As can be seen, we arc now taking inio account changes in the money multiplier. 
Note also ihal we considei only three terms in the exchange-rate equation (this issue is 
discus.sed luriher below), 

'“.See Hendry (1971). 1 lie parameter csiiniates reported Ireltjw were obtained by 
u.sirig Wynier’s (1978) computer program, kesimci.. The program uses a Newton- 
Rayihson iterative proceduie. beginning with arbitrarily given initial values of the 
parameters. 
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assumed that the relevant information set includes lagged values of 
the variables which, according to the model, determine the rate of 
inflation.®* 


4 Crawling-Peg Experience: The Case of Brazil 


Brazil's adoption of the crawling-peg regime can be considered as an 
sxiension of indexation to the external sector. Starting in 1964, Brazil 
sngaged in one of the most comprehensive indexation programs 
inywhere. The financial sector was the first to be indexed. Exchange- 
rate indexation was introduced in August 1968.®' 

During the postwar period, Brazil had experienced a variety of 
;xchange-rate regimes. Until 1953, exchange controls and licensing 
ivere widely used. In 1953 an exchange auction system with multiple 
exchange rates was introduced. A prticess of unifying the rates was 
nitiated in 1961, and since then the country has had a pegged ex- 
rhange rale. Since 1968, when the crawling peg was adopted, new 
ielling and buying rates for the cruzeiro in relation to the U.S. dollar 
iTc announced hy the ccnwal bank at fvec^nent intervals Ion average, 
he rate was changed over 10 times a year). 

Despite the fact that there are no explicitly stated rules for the 
•rawl, it is evident that the method in effect applies a purchasing 
rower parity clau.se to the domestic price of foreign exchange, and 
he evolution of both domestic and foreign price levels is taken into 
iccount in adjusting the exchange rate. In this way, as emphasized bv 
he Brazilian authorities, it is possible to neutralize the harmful effects 
)t domestic inflation on the conifreiitiveness of Brazilian products in 
nternatiorial markets and, hence, on Brazil’s balance of payments.®* 
Although inflation differentials are certainly an important factor in 
Ictei mining the crawl, it was not the only one. However, it is evident 
lom the data (see Lemgruber 1977, tables 1 and 6: and Moura da 
iilva 1977, table 2) that since 1968 the variations in the exchange rate 
lavt followed purchasing power parity rales closely, suggesting that 
pu liangc-rate changes have been mainly determined by the differ- 


Lii,*'*^*^' ^ “’n.slrucicd from the tilled values of a regrcssHtn of the raie of 

kiL.'*,!!! values of Itself, foreign inflation, doniestk:<redii growth, 

“’’d real iiuome growth. For each of these variables two 
tggvd values were iiitluded in (he equation. 

"'flexation in Brazil, sec Nadiri and Pasture (1977). A 
emgiuterny77'l On «x«noniic cxperieme is presented bv 

"Oiioniv ■Jf M nf ‘8e crawling peg and its elTecis on the 

.ommiy, see Moura da Silva (1977). See also Lara-Resent ( 1 978). 

Ikiuiion 0 ?^**^* r j“'“ificalion put forward bv the authorities for the 

67^7) * " crawling peg; on this issue see Lemgruber (1977) and Moura da SUva 
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ence between domestic and foreign inflation.^ The specification of 
the exchange-rate reaction function above (eq. [13]) is intended pre- 
cisel) to capture this feedback process. Such a specification imposes 
restrictions on the data; econometric tests of these restrictions as well 
as the results from estimating the model are reported below. 


Empmcnl Results 

We now report the estimates of the parameters of equations (3') and 
(ir)-(13'), as well as of the derived reduced-form equations. All 
estimates are based on (luarterly data for Brazil. T'he sample period, 
HI/1068 through IV/1077, contains 38 observations. Most of the 
vat iables arc represented by readily available series.’*'* I'he results are 
])rescnted in table 1. All the parameter estimates have the proper sign 
and are reasonabh well determined; in most cases asymptotic t ratios 
are well aliove 2.’**’ 

.411 the coefficients in the estimated money-demand equation (3’) 
have the expected signs and are significantly different from zero. The 
estimated coeffic ients are fairly similar to those reported for Brazil by 
Silveiia (1073) and Khan (1977). Out parameter estimates imply a 
long-run income elasticity of real money demand of 1.307 and a 
long-run elasticity or real money demand with respect to expected 
inflation of -0.77. 

I he results tor (I r)-(13') indicate that Itoth foreign inflation and 
excess suppK iti the money market significantly affect the determina¬ 
tion of the domestic rate of inflation, fhese two factors, world 
inflation and monetary imbalance, also significantly affect the balance 
of payments, as shown by ecpiation (12'). .Specifically, an acceleration 
in the rate of doniestic-credit creation over and above the rate ol 
growth in money demand will increase domestic inflation and reduce 
international reservc-s. The overall effect of such an incTt?ase, how- 
evet, cannot be established solely from eejuations (IT) and (12'). This 


I he t',x( himge rule has tjcpii tolluwcd with a lasji.sidcrabli* degree ot con.sisteiio 
Alinoiiatid and Paslnie (l‘)77) present some evidence lo that eflei i and < nikize the tack 
of ftexibituv—ihe rule has noi been ailered U> deal wiih the oil crisis and with the 
mounting external debt of Brazil. Similar evidence is also presented by (amiicilly and da 
Silveira (1979, pt. 3), who found that purchasing power purity holds yerv well, particu¬ 
larly aftei 1962 M t , 1 y 

“ 1 he following series were used for P, R, and M: torisiimei price index, interna 
tional reserves of the central Itank, and the money supply; D was generated by sub¬ 
tracting/f from tfie monetary base; p is the market exchange rate, selling rate (average 
over the quarter) (all the- above were taken from Intrmattonal Ptnaruial Staltsttcs, various 
issues); y is repre.sentcd by the quarterly real income .sc’rics constructed by Wachtfi 
11976). (The series were extended through 1977 using Wachter’s methixl.) 

Asymptotic t ratios are defined as coeffic lent values divided by asymptotic standard 
errors. 
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TAB1,E I 

FlII-l.-lNfOKMA'I ION MaXIMUM-LiKKUHOOD EviIMATtS Of 1 HK MoDEI. 


(S') logmi = 0 162 log>, - l.TOOwf + 0.876log m,.,, 

(0.049) (0.229) (0.04.5) 

MSE = 0.0018>, = -0.613, G, = 0.873; 

(0.042) (0.297) 

(ID Pf = 0.958(PJ. + p*), + 0.042(0* + yO* - mj),. 

(0.018) (0.018) 

M.SE = 0.0003. <(., = -0.082, Gj = 0.002; 

( 0 . 110 ) ( 0 . 0012 ) 

(12') (1 - y)«,* = 0.958(/*» + p* + m? - yD* - a*),. 

(0.018) 

M.SK = 0.0021, <(>i = 0.2.51, Cj = -0.038; 

(0 043) (0.039) 

(13') pf = 0.526(/'* - P*), + 0.249(7** - /**),_, + 0 I 18(/** - P*),-,. 
(0 106) (0.005) (0.029> 

MSE = 0.000.5, <()4 = -0.050. G 4 = -0 008. 

(0.106) (0.003) 


0 0006 

O.OOIM 0.0029 

0.0005 -0.0(MI5 0.0005 


; Niii t —Set* irx« )(ir no(a(t(ii) Nuiiieiah in pAirrn(heM*& ai<r as>ni}»u>tK rnon ol irKirk5ii>n inribomu 

’ r IS (he I'iiimaird vjnaiue-ioMittaiM f mamk »( rrsiduals. (1 > I 2.5 4) i> ihr rsiimaird intisuni irrm 
iBiiiniL'gieMioii ((K'lfidcni. and MS( is ihr niran vfiurr ciroi 

s .so tx'cau.se the divergence of doinesiic from world inflation will, as 
nrlicated by equation (13'), depreciate the e.xchange, which in turn 
vill tunher increase the tale of inflation and reduce the rate at which 
oreign reservc.s are depleted. In order to analyze the full effect of 
dianges in the rate of growth of domestic credit (as well as in the other 
ixogenous variables), reduced-form coefficients should be con- 
idcred. Fhese coefficients are reported below’. 

I he parameter estimates in equation (1.3')—the exchange-rate 
eaition function—indicate that more than .30 percent of the 
31 eign-domestic inflation differential is transmitted to the exchange 
ite within the current quarter, and an additional 25 percent in the 
ext. After only three quarters the full differential is completely 
tflerted by the exchange-rale adjustment. In this context, it is im- 
oriant u note that equation (13) embodies a restriction regarding 
le effects of current and lagged values of the domestic-foreign 
ination differential on the exchange rate. Specifically, a pattern of 
jonietncally decaying weights was postulated. The results reported 
1 ta el ate derived under the imposition of this re.striction. To test 


0 0018 
-0.0004 
0.0009 
0 0002 
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Appendix 

In this Appendix we derive the strurlural and reduced-form equations of the 
model when persistence efletts are allowed in the price equation for non- 
traded goods, 'fhe structural equations using this specification are then esti¬ 
mated and the restilLs are re|X)rted below. 

Replacing equation (8) hy equation (9), we can express the rate of change of 
nontiaded goods prices as 

= (f'n + (A 1) 

I - od. 

Adopting now the definition of ft, given in Section I, we obtain equation (A2), 
the toiinterpaii of equation (HI). 

(/'St), = (/'t). + + «* - (A2) 

I — aL 

Tsing cqn.itions (7) and (A2), the rate of doineslic inflation, equation (6), can 
be written as 


/7 = <'(/',! + P), + (I - c'Xyft^ + ri* - mj),. 
wheie «' is now a poKnomial in the lag tiperaloi, 

, ^ 1 - aJ. 

I — al. + &(I — X) 


(A.‘f) 

(A4) 


Postulating the same tiawling lule as used in the text, that is. equation (13), 
we can obtain the teduced forms of the model,*' 


/3(1 - «') 


fit - (1 - 13)1- 


(yl)* + 




(A.')) 


p* 


e'(l - /iHI - /.) 

I - - (1 - PV. 


(I ' y)Hf 


(/',7I/ + 


(I - €')|1 - (I - J3)/- 
1 ~/3e' - (1 -13)1. 


(yl)* + a'' 




fif'd fi)l 


pirf ),. 

(Aft) 

(A7) 


1 he econometric lot ms of the model, in <oii|unrtion with the demand for 
tnoiiey, ecjtiation O'), arc estimated by the prtKedtirc dest ribed iti .Seclioti M. 
I he parameter estimates do not differ much from the model in which this 
type of persistence is ignored. 

For comparrson and cornplelencss we now report the estimates olrlained 
with the extended model: 


1 he reduced forms could be expressed in lerm.s of the underlying parameters, A. 
o. and 0 by substituting the RH.S of (A4) for I his results in rather c titnbersome 
expiessions, as, e.g., in eq (A5) 


p7 


_ mi - X) _ 

(1 - aL)(l- fi)(l - /.) + «(l - A)(I - ff)/- 


iyl)» +a* ~ mS - P*),. 
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logm, = 0.224 log^i - 2.224jrf + 0.828 log m,_,, (A3') 

(0.063) (0.300) (0.058) 

pf = 0.618(P* - P*), + 0.236(/** - P*),-, + 0.090(P* - (A8) 

(0.099) (0.023) (0.032) 

P* = O.OSKP* + p*), + 0.049(«* + yD* - mj),. (A9) 

(0.018) (0.018) 

(1 - y)Rr = 0.95UP* + p* + mj- yD* - a*),. (A 10) 

(0.018) 
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I his papf I htiikis upon soinr ivell-knowii fans ahoiil slaio pivprii- 
iiicni to (feiK'iiiK’ new<<)[K'lu.sio/isafxMit sotial (hour on ific national 
lesci of a Icdei.il irpiililk. (Im/cns vole against national laws lhal 
lestrift then slate's abiliis to ex|)ori (osts bin snpjx)ri laws ilial 
lediue ilic (osis imposed on tbeiii. Individuals niav seek to extend 
the laws passed in some slates to the entire rialion oi imn oppose 
preemptive laws because lliev benefit f'loni variety Siiue these moli- 
v.itions are abseni in a unitary sssieni. national support foi a law will 
depend u[h)ii wtieiher a unilarv oi a I'etferal structure prevails 


I. Introduction 

This paper liuilds upon some well-known fads about state govern- 
nient to generate new conclusions alKint stnial tlioiee on the national 
level of a fedeial lepublii. I he eentral feaniie of stale govennneni 
behavior I shall exploit is the incentive eat h state has to improve its 
own position hy im[Kising costs on the irsideiits of other stales. This 
seat eh for local advantage lakes many forms. On the one hand, stales 
may imfuise taxes and regulations which are borne hy oui-ol-staie 
residents; on the other hand, they may try to attract investment from 
other states by providing lax breaks and s|}e(ial piihlic services.' 

I his paper was presenied ai the Wurk.shop on Analvlital k'rixiii txoiiomics at 
Queens Univei.siiy, King.sion, )ui)c 1978 I wish lo ihaiik Biiue Aikerman. the work¬ 
shop parlKipaius. and the icfeiee of this founuil foi helpful loniiiieiils. 

‘ These inceniives are stressed by Walker (1969. p. 890) and Posner (1977, chap. 26). 
.Mcl.ure (1967) estimates ihai afKuit one-xjuarter of all state taxes are exported. Kxter- 
lulities such as air and water (xdlulion are emphasized by Breton (196.5), dlsmi (1969), 
Tullock (1969). Rothenberg (1970). and Oates (1972). I hesc authors di.v uss how a 
federal system can trade off mierjuiisdictional vaiiriy against the iinernali/.alk>n of 
externalities through boundary dehniemn. the assignment of fuiHtions to levels of 
government, and intergovernmentaf grams. 

{jfntrnal of Poiibtal f.trmamj, 1V81, sot >19, no 11 

S) 19*1 by Ilw L'niveoio "t ChKjgii (XK?.S8nn/»W890|.|XXMI»()l it) 
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Whatever their particular character, these interstate spilhins and 
spill-outs can alter the substance of national legislation. Citizens will 
vote against laws that restrict their state’s ability to export costs but will 
support laws that reduce the costs imposed on them by their own and 
other states' choices. Since these strategic motivations are absent in a 
unitary system, national support for particular laws will depend upon 
whether a unitary or a federal structure prevails. 

Despite its potential imjKirtance, the impact of federalism on the 
strategic position of voters has been largely ignored by .social scien¬ 
tists. In his excellent review of the literature, for example, William 
Riker (1975) argues that federalism has no impKtrtant influence on 
substantive national tiutcomes, suggesting that it does no more than 
delay the passage of national legislation opposed by a substantial 
minority of state legislatures.’ Although Riker’s conclusion is partly an 
empirical proposition, it is also based on an implicit mtxlel of how 
government systems behave. Riker argues (1975, pp. 155-56) that the 
underlying distribution of tastes in the population determines politi¬ 
cal outcomes. Since lederalism per se has no impact on tastes, it will 
not, therefore, affect national (X)litical choices in the long run. The 
present papet challenges the.se theoretical underpinnings. 

Since 1 am concerned not with an exhaustive taxonomy but with the 
leiationship between |)olitical power and political structure. I con¬ 
centrate on simple models. The system 1 discuss has only two 
"layers"—national and state.“ States have fixed boundaries and cover 
the nation's entire geographical area so that every citizen lives in only 
one slate. 1 will contrast the legislative choices of a unitary govern¬ 
ment with those of a "hierarchical” federal system where higher-level 
governments can always preempt the legislative choices of lower-level 
governments. If the superior government has taken no affirmative 
action, however, the statutes c»f inferior governments are binding.^ I 


‘ Rikrr (197.')) pr()p(>s<-> an iniagiiiarc rxprriim-nl in wliuh nialc hc-d pain, fil uniian 
and federal sysienic are examined to diwtiver d there are ini}>oriani public |x)Ikc 
dillereiue.'i, tie hyjMithesi/es ihai public ixiluy hiiIiiii each pair will lie "rc-niarkabh 
similar regaidless of lederalism" (p H-l). Fuilhermnre. he argues llral m the tinned 
Slates lederalism delayed iialiuiial regulation ol busiite.vs (p 154) and helped pet- 
peiuale rac ist acts (pp 1.54-.56) hut had no long-run inipai I on the c harat ter of national 
legislation. 

’ Foi an attempt to )ustily a thix-e-layered system, see Ylvisaker (1959) With the 
exception of Wechsler (I9t)5). Posnei (1977), and Wintei (1977). legal eoininentators 
have all but ignorexi the issues di.siussed in this pa}ierand have instead comenirated on 
the relationship In’twecn state and federal courts. For a recent artwle in this tradition, 
se- Clover and Alcinikoll (1977) 

1 his model should he contrasted with others which have a strict divisioit of lunc- 
ttons between htgh- and low-level governments. For exatnple, see Wheare (1953, pp. 
32-33). Recently, coitccrn for a strk t divtsion of authority has given wav to scholarship 
which rcx'ognizcs the importance of tnleractions between levels of government. See, for 
example, Giodzins (1960. 1966) and Ela/ar (1962). 
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assume that both state and national governments are direct democ¬ 
racies where the only political actors are individuals and where all 
decisions arc made by majority vote. Political issues are separable and 
well defined, and they appear to voters as simple dichotomous choices 
between maintenance of the status quo and a change.* The analysis 
concentrates on legislative choices. I do not discuss administrative or 
judicial issues or the possibility that the national government will seek 
to administer programs through state government agencies. 

Section 11 shows that federalism "matters” in a political system 
where capital can move but people cannot. Section III considers the 
impact of permitting interstate migration of voters, and Section IV 
shows that federalism can matter even in the long run when states 
respond to the legislative choices of other states. 


II. Federalism Does Matter 

The major difference between a unitary system and a hierarchical 
federal sii ucture is the characterization of the status quo. Lety = the 
status quo in a unitary system, and let .v = a proposed, exogenously 
defined, legislative change. Suppose, for example, that in the .status 
quo casino gambling is illegal. The federal government, however, 
proposes to make such gambling legal and to levy a lax on the 
profits. 

In a hierarchical federal system, some states have passed legislation 
that is similar u>x. Others have a .status (|uo that is .similar to v- Tet us 
call the former .v-type states and the latter y-Iyf>c states. C'.ontinuing 
the example, *-type states jx-rmit casino gambling and levy a tax on 
earnings, while casino gambling is illegal iny-iype stales. Assume that 
if all states are .v-type states, then the state laws taken together would 
have the same impact as v. Similarly, a.ssume that if no states have 
pa.s.sed the .v-type law, the .status quo is the same as in a unitary system. 
Thus, in my example the state gambling laws are duplicates of x and v, 
respectively. This assumption is not essential to the analysis, but it 
simplifies the expc)sition considerably. 

Because of interstate spillovers of costs and benefits, individuals' 
preferences over x and y will not nete.s.sarily be the same as their 
preferences for alternative state legal regimes. In the gambling 
example, some people may favor a .statewide gambling law and vote 


These issunipuons permit me u> look at the vote on a jingle issue and to avoid votinji 
ntajoriiy rule does not produce a determinant outcome (Sen 1970, p. 
3S). The actual relationship between state statutes and federal taw in the same area is of 
COTrse considerably more complex. In reality, the relationship is seldom olwious and 
le^ral courts are frequently called upon to sort out the overlapping authority of state 
and federal statutes (see Tribe 1978, pp. 378-86), 



DOES FEDERALISM MATTER? 


»55 

against x at the national level. A state’s residents gain when out-of- 
staters come to gamble and consume tourist services. The state would 
lose this advantage if gambling were legal in every state. In a unitary 
system, however, these same individuals might favor x over y since 
they would have no special advantage to protect. 

A voter’s preference for x over a system with a variety of state laws 
will often depend upon the number and location of x-type states. The 
economic advantages of permitting gambling in one’s state of resi¬ 
dence are larger the smaller the number of other states which also 
permit gambling. The benefits are also higher the more geograph¬ 
ically distant are the other jurisdictions with legalized gambling. 
Let gi = Z( if state i has passed an x-type law,g, = mj, if state i is ay-type 
state, and let H = total number of states. Then, the status quo in a 
federal system is a vector G = (g,,. . . , g„ . . . , g«) where g, = Zj or w,. 
Thus we can summarize the situation of those living in x-type states as 
z(G) and the situation of those in y-type states as w(G). 

In this portion of the analysis, I assume that migration across state 
lines is impossible, so that the only way individuals can affect the legal 
regime under which they live is to vote for new state or federal laws. 
Capital, however, is free to move between Jurisdictions. Given these 
assumptions, we can now compare the way voters with different 
preferences will cast their ballots in federal and unitary republics, 
i'urning first to a unitary system, individual preferences can be 
straightforwardly translated in a social choice. Law x passes if and only 
if; 


MxRy)>N{yRx). (1) 

where/? is the binary relation of “weak preference" (“at least as good 
as”). Then N{aRb) is the number of people for whom aR/> where /?> is 
the weak preference relation for individual j and R, is reflexive, 
transitive, and complete.® 

In a hierarchical federal system, under the assumption that only the 
current level of G is relevant, x passes at the national level if and only 
if: 


jV,[x/?z(G)] -I- A' 2 [x/?a>(G)J 3= AMz(G)/?x] + A,[«.(G)/?x]. (2) 

where N,(aRb) is the number of people living in x-type states for 
whoma/?^ and Nf(aRb) is the number of people living in y-type stales 
for whom aRjb. 

Expressions (1) and (2) need not yield the same resulus. Individuals 
who would prefer X toy need not also preferx to z(G) or w(G). If a 


* In Sen's terminology (1970, p. 9), R, is an ordering. The noution in (1) foUows Sen 
(1970, p. 71). 
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TABLE 1 



z{(,)Rx 

.xliwRi) 

U>((i)Rx 

xHy 

■f. 


II, 

(favor \) 

(positive spift-ins 

(lavoi x) 

(positive spill-ins 


lo X types) 


to y types) 

yRx 



■ R, 

(nc^alive spill-jiis 

(oppose v) 

(nej^alive spill-ins 

(oppose *) 

to \ types) 


10 y types) 



stale is able to export tlie costs of'a program, then its citizens are likely 
10 o[)pose preemptive federal laws even though they favor x-type 
legislation in their own state and would favor x over y in a unitary 
system. Similarly, if citizens can henelil from the legislative initiatives 
of other states then they may vote against .v although they have xRj 
with a unitarv government. The gambling example falls in the first 
categors. Public health or pollution control laws in neighboring states 
may produce the second voting pattern. In fact, individual prefer¬ 
ences may produce any of the possible rankings ofx.y, and i{(i) andx, 
and and one can thus divide the population into groups 

depending upon their preferences and the states in which they live. 
For a givett (i, let .1 (6) = .set of fieople in x-type states and let = 
everyone else. 

People living in .\-t vpe states have ) or )R\ or both. I'hose 

in set R (G ) have xRwRi ) or u'{G )Rx or both. In a unitary system voters 
have xRy or yRx or both. Table 1 is a matrix of the possible taste 
combinations. I he entties in the matrix are the sets of people in eat h 
taste class. Thus,•!(<';) = UA.,(G) U d df*) and «(f.’) = 

Bi(G) U B. 2 (G) UR:,{G} UB,(G).^ Jn expression (I), ,V(x/?v) = ,V(d, U 
At U B, U Bt). In expression (2), .V,(x/^^((';)J = .V,(d| U Ai) and 
)] = U B-i). Thus given fd the vote for x in a unitary 

system, R'ixRy), is greater than, equal to, or less than the vote in a 
hierarchical system,+ XtlxRw{G)l as: 

■Vf.T, U/T). (3) 

Expre.ssion (3) shows that federalism "matters." The size of the vote 
in favor of the national law may lx* higher or lower in a hierarchical 
federal system than in a unitary system. A law which passes in one 
system may lie defeated in another. J'he practical importance of this 
result, however, depends upon whether sets d.i, and dj, repre¬ 
sent common preference configurations. .Sets .4;, and B^i include indi- 

’ I he synifxil U stands lor che union ol secs, i.c., the set of people IjelonRiiiK lo eilhei 
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viduals who prefer x to as the national law in a unitary system but 
who each favor conditions in their own states over x." Sets A 2 and 
are the reverse ot and B 3 . They include individuals who would 
oppose* in a unitary system but favor preemptive legislation because 
it keeps them from losing at the expense of other citizens. 

If .sets 2 andfl 2 are large relative to sets/I,-) and /I..,, then a paradoxi¬ 
cal result is possible. Although federalism is often justified as a way to 
pre.serve diversity and prevent the centralization of power, (3) implies 
that the existence of low-level governments may cause some voters to 
favor central control of an activity that they would otherwise have 
wanted unregulated. Without constitutional limits, a majority of citi¬ 
zens may be so unhappy with the mixture of independent state 
choices under federalism that they favor a more powerful central 
government than they would in a unitary system. 

III. Migration and the Vote for National Laws 

When voters can move between jurisdictions in response to public 
policy changes, the vote for national preemptive laws will change. If 
migration is costless and if legal regimes are the only determinant of 
location, then, given G, national electoral support would fall in a 
hierarchical federal system. Voters are less likely to want naticjnal 
uniformity when they can move to congenial states. 

Reality, of couise, is somewhere between the extremes of a per¬ 
fectly fixed population and a perfectly mobile one. Furthermore, 
different types of people have different moving costs, and the relation 
between these costs will help determine the political power of various 
groups at the state level. Thus, mobile groups may impcjse costs on 
immobile groups as a price for not leaving the slate. Alternatively, 
undesirable groups may be induced to migrate by taxing or regulating 
them or by subsidizing their moving expenses. V'arialions in the cost 
of migration change the strategic environincni. Those who can move 
cheaply have an advantage over tho.se who cannot. Therefore, na¬ 
tional decisions which rec^uire uniformity will be strongly siippiorted 
by those who lo.se f rom the migration of others and opposed by those 
whose mobility permits them to gain at the expense of less mobile 
voters. 

The possibility of migration has implications for national programs 
which seek to redistribute income from high-income to low-income 
families. Economic analyses of multiple government systems (e.g., 
Olson 1969; Oates 1972) conclude that interjurisdictional mobilitv 

" ll al.wi, ol course, includes those who are indifrercnl between any of these alterna¬ 
tives. Only people who arc indilTereni to all three alternatives are in berth A, and A, or 
in Ixrth B, and B,. 
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makes serious redistribution impossible at the state level. Progressive 
tax and spending policies must be carried out by preemptive laws at 
the national level. Interstate migration, however, also lowers the 
prospects for passage of national redistributive laws. If voters arc 
mobile, support for a national preemptive law which redistributes 
income from the rich to the poor may be less than the support such 
legislation would obtain in a unitary system or in a federal system with 
no migration. Many who would support a particular progressive tax 
and spending program in a unitary system may oppose it in hierarchi¬ 
cal federalism if the law preempts state “beggar-my-neighbor” laws 
which benefit these citizens. Selfishness will dominate altruism if the 
opportunity cost of altruism is too high. 


IV. Does Federalism Matter in the Long Run? 

The result in (3) applies when the number ofx-type and v-type states 
is held (onstant (i.e., fora given G). One student of federalism (Riker 
1975), however, argues that in the long run a federal system and a 
unitarv one will converge to the same pattern of national laws. The 
view' that federalism docs not matter in the long run is based either on 
a hypothesis about how tastes change over time" or on a model that 
makes particular assumptions about the way votes change in response 
to new information and new circumstances (i.e., changes in G). The 
cases where lecleralism detes not matter are contingent upon these 
assumptions. 1 his sectiott shows that it is jxtssible to impose other 
plausible conditions on tastes, opjjortunities, and government struc¬ 
ture and produce an equilibrium where x- and y-type states coexist in 
spite of majority support for x ovei v. 

.My anahsis ol stale legislative choices challenges the idea ihat stales 
are "lighthouses" that show the way to the federal government by 
enacting innovative laws. In my model, a law may spread to many 
states, not because it has Ix-en “tc.sted” and found useful,'" but rather 

” Rikfi's (urns on the overndinx iinporianrc «l individual ia.slcs in drtei- 

inining poliiual ounonics He uses the examjiie ol while racism. "As long as whites 
stronglv preter racist insiiiiitions, one ran cxpccl insiiiuiions to Ijr rarist regardless of 
whether ihe couiurs is tederal or uniiary. But wlien the prefcrciKC fcri racist institu¬ 
tions weakens, then lederalisin hel|>s laiisin by rendering difTicuil the cnrorcemem ol 
an anti-racisi polics on the itiiitonis of white rat tsts. So we can say that the hcncheiaries 
of federalism gel only marginal fienefiis on polity, but marginal or not, they are 
undoubtedly real" <1975. p 136). 

'* Sec Walker 11 1971. 1973), By his choice of words. Walker appears to believe 

that early adopters ol laws are piogrrssivr. He calls them "pioneers" (1969. p. 881). A 
glance at Walker's list of laws should warn anyone against this inference. Many involvi 
the licensing of tKcupaiions suth as liarbers or real estate brokers. Others simpl' 
mandate the establishmerii of slate agencies, many of which are required as a conditioi 
lor receiving federal grants. For a different penpeclive on the same tasue whic 
emphasizes politicians' incentives to take risks, see Rosc-Ackerman (1980). 
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because voters in ^-type states want to avoid damage at the expense of 
others. Similarly, a national law may eventually pass in a federal 
system, not because a new initiative has been tested in the states, but 
rather because voters want to override the costs of spillovers and 
inconsistent laws. 

Since a fully general analysis would be difficult to interpret, I 
concentrate on three special cases that capture the essential features 
of many actual situations. In these examples, 1 assume that tastes do 
not change over time, that voters have similar initial information, and 
that benefits that spill over to people in one group of states are costs to 
{jeople in the other. In case 1, voters inx-type states receive positive 
spill-ins from out-of-state residents, and voters in y-type states bear 
costs. In case 2, voters in x-type states receive negative spill-ins from 
y-type states, and voters iny-type states benefit at the others' expense. 
Finally, in case 3, voters’ beliefs in “states' rights” may conflict with 
their substantive position on other issues. 


Case 1: Positive Spill-ins to x-Type States 

Examples of case 1 are casino gambling or state-run lotteries in states 
where voters are concerned only with lax revenues and jobs. Taxes 
imposed on products sold to out-of-state consumers also fall into case 
1. In these situations, dj and B, are empty sets." Thus (3) becomes 
1 A'(Bj). 

Since an x-type law leads to positive spill-ins, it is plausible to 
suppose that over time more and more states passx-iype laws (i.e., the 
set C changes). A few states institute lotteries, for example, and 
eventually many more follow their example. If this happens, jxrople 
in setB now join set A. .Many who favored a national lottery (i.e., those 
in and Bj) may now oppose it as a way of preserving their newly 
actjuired positive spillovers (i.e.. they move and/'f 4 , respectivelv). 

As more states pass x-tvpe law's, however, the level of net positive 
spillovers to each x-type state falls. In the case ol casino gambling and 
lotteries, a slate’s c<»nsumers are less likely to gamble in other slates if 
their own state has an x-iype law.'* In addition, negative spillovers to 
y-type slates increase as more and more .states export costs to them. 
The decline in net benefits and the increase in net costs as G changes 

" Voters in At favor preemption but oppose x in a unitary svnem. Voters in A, 
oppose prcempiKin but favor policy x in a unitary system. Both of these preference 
patterns seem to be implausible if x-type states benefit at the expense of s-tvpe stales. 
I his case would not hold if some people think that Kambfing brings costs (i.e.. crowds, 
ctirruption). 

“ Similarly, tax incentives to encourage industry to locate in a jurisdiction provide 
few benefiii to taxpayers if most states provide equivatcni incentives. For an attempt to 
show the costs of interstate competition for business investment, see Jacobs (1979). 
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will raise the vote for a national preemptive law in individual x- and 
v-type states. Thus, a federal structure generates two influences that 
operate in oppwisite directions. A person whose state institutes a lot¬ 
tery is more likely to oppose federal preemption. However, as the 
numher of states with lotteries increases, the benefits of multiplicity 
fall. If the first factor dominates the second, federalism will continue 
to matter even in the long run. 


Cm/' 2: Posttwr Spilloi’m to y-Type States 

F.xaniples ol case 2 are minimum wage laws'^ or fair labor statutes that 
increase job opportunities in stales without such laws and state pollu¬ 
tion control laws where water and ait cross slate lines. Since in case 2 
federalism permitsy-type states to gain at the expense ofx-iype states, 

and B., are implau.sible preference patterns. Thus (3) becomes 
MB,) S A'(.T). 

The majorit) ol voters iitA-ty|re stales have decided that, in spile of 
the (osts, the law is worth having. Suppose that as lime passes, voters 
in \-type states learn from the experience of those who have x-iype 
(e.g., minimum wage) laws. If the benefits are greater than expected, 
then eventually more states pass minimum wage laws. People whose 
states enact such laws are now more likely to favor fedeial preemp¬ 
tion. They no longei obtain the benefits of spillovers from other 
states, and they might like to prevent other slates from benefiting at 
their expense. Ilowevei , as the number of .v-type states incieases (G 
changes), the costs imposed on each x-iypc* state fall and the lx*nefits to 
each y-lype state ri.se. I his shift in costs and benefits should redutc 
support for a preemptive law in individual x- antiy-type stales. If ihe 
second (change m G) effect outweighs the first (change from B to .4) 
effect, a national minimum wage law may not pass in a federal system 
even iliough many states have passed their own laws and N(xRy) > 
N(yRx). Historical research, however, has sire.ssed the cases where 
state initiatives led eventually to a federal statute,'^ leaving un- 
analyzed the many situations where the adoption of a law by many 
states has ncjt been followed by a preemptive federal initiative.'''' 

Rikcr (i96‘l, p 146) uses the exaniplr of minimuin wage laws that differ across 
slates. "There is then much likelihcKHf of capital flow from the high-wage liMalities to 
the low-wage lcK:ahiies for all those indu.slries in which labor represents a high propor¬ 
tion of the cost. Aside from the iinposilion ol a nationally uniform minimum wage, the 
only way that high-wage localities may rounlei this lapiul How is by reducing the 
minimum wage level." 

” See Fine (1956. pp. 3,53-85), who disemscs the examples of anlitru.st laws, pure 
ff>od and drug laws, proleitivc latwr legislation, social security, and welfare. 

Many of the laws studied by Walker (1969) have spread across many states but fiavc 
never been adopted nationalb. It is possible that .some ol these statutes might have 
maforily support in a unitary system. 
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Case 3; States’ Rights 

In this case, some people put their belief in states’ rights above their 
position on x (e.g., the abolition of slavery) and are in/fg and 83 . Thus, 
no one favors imposing x on all states if they alst) oppose x in a unitary 
system (i.e., and 83 are empty sets). Therefore, (3) becomes NiAj 
U 83) > 0. 

The vole for x in a federal system is certain to be less than the vote 
in a unitary system, and federalism may continue to matter so long as 
preferences remain constant.Assume, however, that people with a 
substantive preference for x over y favor slates’ rights only if the 
results of independent state choices arc not too far away from their 
position on x. I’hen, it is easy to tell a simple story where an unantici¬ 
pated change leads to convergence of a federal and a unitary system. If 
the number of y-type states (e.g., slave states) increases exogenously, 
people who are opposed to slavery may now choose federal preemp¬ 
tion over states’ rights (i.e., some of those in A 3 and 83 move to A, and 
8 ^). Therefore, x may now pass at the national level. An exogenous 
change that seemed to favor y leads in fact to its repudiation by the 
nation. Thus, it may not be necessary to assume a growth in Northern 
antislavery sentiment to explain the evolution of pre-Civil War poli¬ 
tics. As slave!y spread across the South after the invention of the 
cotton gin, even people whose antipathy to slavery was constant might 
have supported a national policy of abolition if they thought that 
more and more states would permit slavery (i.e.. they might shift f rom 


t«i svninirli y, wo (iiuld iiuludo a fourth case that appears to ho of loss oinpirkal 
im[>onantc In this final case, people frelicvc in iinifoimiiy or in a stronn (oiiiral 
^ovenimoni and are willing l>) support a national law even when the\ would oppose the 
law in a unitary sysioiii- I'hus.d, and fliarr einpiv sots. (3) Itoiomcs 0 < .Vt.t, U Bj). and 
iho vote fora is always largoi in a fodoial sv.stoni than in a unilars svsiem If v is a status 
ijtio whu'li porniils the private market to o|iorale with a iiiinimum of gosernment 
Intel lei eiue, then tase 4 is illusiraled by fuiMiiess managers who favor laisser-faire 
(yBa) but if fared with a mixture of differing Mate laws would rather have a uniform 
fedeial regulalort statute Iteeaiise tl |>crmil.s them to redure <o.sts Employers who .sell 
itieir pnxiuels iti many states oftcti ftl this prefeieme |>attrrn. They would rather not 
Ix’ tegulated at all. hut being regulated in some stales is worse than a uniform national 
Maiidard. For example, ss-lters ol iKitiled mineral water are lieginning to laic a vaneiy 
of labeling laws marled by dilfereni stales Arrordiiig loButineu Wrrk "Most bottlers 
say they would not oppose rcasotiable. uniform regulations. . Industry exeiutives 
shudder at the pros^ieri of trying to comply with iiilrs that ruuld vary widely among 
states ’ ("Mineral Water Caiuld Drown in Regulauon." flu.tinei,i Wrrk (June I 1. 1979]) 
■State trucking regulations arc sometimes iiicoiisisieiil and rosily for truckers Thus, 
state regulations of the mudguards retjuireil on iniersiale trucks were inconsistent, and 
an Illinois law was overturned by the Supreme Caiuri a,s mierfciing with interstate 
commerce (Bibb v. ,Navajo Fieight lanes. Inr., .^.W IKS. 520 11959], discussed in Tnbe 
[1978. p. 3391). Similarly, the Supreme Court ruled that Wisconsin's prohibition against 
twin trailers on interstate highways was an unconstitutional interference with interstate 
commerce ("Breaking a Bottleneck in Long-Haul Trutking." flu-uiwo HWk [March 7. 
1978]). 
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Ag to A,). When slavery looked as if it might expand into the western 
states, people who had favored a federal solution might begin to 
support abolition.” 


V. Conclusions 

Political economists have generally recognized that realistic political 
systems must include some interjurisdictional spillovers as a cost of 
providing citizens with a choice of public service levels. Intergov¬ 
ernmental grants may reduce the.se interjurisdictional costs (Breton 
1965) but will not eliminate them. Because both the dispersion of 
tastes across the population and the level of externalities differ for 
different public services, some analysts have argued that a federal 
system is the best way to accommodate these conflicting tendencies 
(e.g., Oates 1972). In making this recommendation, however, these 
authors fail to make clear an important idealization inherent in their 
analysis. They assume that it is possible to assign functions unambigu¬ 
ously to levels of governments so that constitutional structure only has 
an impact on spillovers and on the position of minorities. In fact, it 
will often be impossible to assign responsibilities neatly to a particular 
political level. Indeed, this overlap is the characteristic feature of 
contemporary “cooperative” federalism. Given this fact, the present 
paper has shown that, when authority is divided, the choices of 
lower-level governments can have important consequences for the 
decisions of higher-level governments. Even when the central gov¬ 
ernment has the power to preempt state and local laws, its democratic 
choices will depend upon the strategic position of citizens living under 
alternative state legal regimes. The essential difference between a 
hierarchical federalism and a unitary system is the difference in the 
status quo. This difference affects the vote on national legislation and 
the bargaining power of individuals. Individuals may vote against the 
extension of a state law to the nation as a whole even though they 

” Potter (1976), in a tiisiory of the period from 1848 to 1861. stresses the difficult 
tradeoffs faced bv many people. He writes chat “the problem for Americans, who, in 
the age ol Lincoln, wanted slaves to be free was not simply that Southerners wanted the 
opposite, but that they themselves cherished a conflicting value: they wanted ihe 
Clonstituiioii, which proiected slavery to be honored, and the Union, which was a 
fellowship with slaveholders to be pre.served” (pp, 44-45). The Northern public “placed 
their aniislavery feelings in a context of state action, accepting personal responsibility 
for slavery within their own particular states" (p, 46). In conlrasting the position of 
Stephen Douglas in 18.'>4 with that of his opponents. Potter shows how these trade-offs 
were resolved in different people. “Oou^as was a vigorous believer in the democrauc 
principle of local autonomy, but his opponenu were equally vigorous believers in the 
moral primacy of freedom. . . . Douglas cared more about the Union than about the 
eradkadon of slavery and would never push the slavery issue to a point where it 
imposed too much strain upon the Union. Many andslavery men thought the Union 
hardly worth preserving so long as it had slavery in it" (pp. 172-73). 
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would favor the law in a system with only one government. They may 
wish to extend a state law to all citizens although they would oppose 
the law in a unitary system. 

The analysis suggests a promising area for future empirical re- 
search. Congressional votes on particular issues might be associated 
with existing state laws,'’ and the timing of federal passage of laws in 
several areas could be related to the patterns of state adoptions of 
similar laws. Empirical work is also needed on specific policy areas. 
For example, one might study the impact of local power over schools 
on state and federal education policy or see how federal efforts to 
reduce poverty have been conditioned by prior state and local efforts. 

My discussion of federalism also suggests that, at the point of 
constitutional choice, people are more likely to support a federal 
system with strong lower-level governments, the fewer the strategic 
possibilities open to individual states. Strong low-level governments 
will be attractive if people are grouped geographically by their taste 
for public services and if it is difficult to impose costs on other 
jurisdictions. This implies that many people would support constitu¬ 
tional constraints on state governments that limit the states’ strategic 
behavior while retaining many of the benefits of variety and ex¬ 
perimentation. 1 would also expect that, if interjurisdirtionai migra¬ 
tion is possible, then those who favor aid to currently di.sadvantaged 
groups at the expense of mobile capital and labor resources will favor 
a strong central government.'* It is a commonplace in economic 
analyses of federalism to note that low-level governments cannot 
carry out redistributive policies (e.g., Oates 1972). My point, however, 
goes beyond this conclusion to the observation that if states try to gain 
at the expense of other states, then interstate redistribution can occur 
that bears little relation to anyone's notion of s<K:ial Justice and may- 
end up making all households worse off. The end result depends 
upon each state’s strategic position, that is. its abiliiv to exjxirt taxes 
and import benefits. 
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The il<'ma»d didrihutioti is ciiiplou'd as a novel anti fxiwerful basis for 
:inal\/in>; nionopol> piovision of |)ricc-ex( liitlable public goods (and 
(OilId be used to anab/e other market structures or, in some circuin- 
siatKos, private goods provision). For (iinitorin) per-unit, all-or- 
iione, ivvo-pari, and multipart pricing, we identify: characteristics of 
leveiuie (uiutioiis, relative profitability, and operational procedures 
for seU'cniig pri<e-oiiipui levels. Uiidei all these strategies, rationing 
of some (onsumers hy out/wl is rccpiired, and a {lositivc prit e-ouipni 
relationship mav arise. In general the revenue ranking of pricing 
strategies is sensitive to the distribution of demand and, though 
mulli|iart piking laii be ex[x*cted to be revenue dominant, uniform 
per-iinii piking eiiieiges as a surprisingly lobiisl strategy even Ix*- 
foie operational (omplexilv is fully taken into account. 


Recent contributions to the economics literature rcHect a growing 
interest in private provision of commodities which possess one 

Work on this paper was coniplcicd while both authors were in the Department of 
Economics, Monash L’niversity. Ihe notion of the demand distribution, to our knowl¬ 
edge, was fii-si suggested in earlier unpublished work ol the first-mentioned author. We 
have benefited from comments of participants in a Monash workshop, a Virginia 
Polytechnic Institute seminar, and the 1978 Public Choice meetings. John Head, Vew- 
Kwang Ng, Geoffr^ Brennan, and DavKi Eriedinan provided useful observations, but 
our greatest debt is to William Oakland, whose extensive and perceptive comments 
have been of considerable assistance. The authors nonetheless accept the conventional 
attribution of errors to themselves. 

{Jtnnul PMniral hmmmj, IVSI. >01 SV. no II 
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characteristic of a public good (“jointness,” or “nonrivalness in con¬ 
sumption”) but not the other (“impossibility of exclusion”); Exclusion 
is assumed to be commercially feasible, so that anyone unwilling to 
pay the required price can be denied access to any or ail of the units 
produced.' This important class of commodities can be termed 
price-excludable public goods or (reflecting a characteristic they retain 
rather than one they lose) joint goods. Television and radio transmis¬ 
sions and the services provided by transportation facilities (rivers, 
roads, airports, ships, buses, and airplanes), entertainment fa¬ 
cilities (theaters, galleries, museums), recreational facilities (national 
parks, football stadiums), and information serve as examples, of vary¬ 
ing degrees of purity;* All are important in modern societies, and 
virtually all have been suspected of being subopiimally provided by 
the private sector. 

Competitive models dominate the published literature, and Samu- 
elson’s (1954, 1955) seminal analysis of pure public gocxls has clearly 
influenced the questions asked and the methodology employed. 
Analysis of the behavior of and interaction between individual profit- 
seeking entrepreneurs has been neglected in favor of somewhat 
speculative predictions of the likely characteristics of market equilib¬ 
rium so that efficiency issues can be immediately examined. This is 
true, to some extent, even of the important contribution of Oakland 
(1974), in which it is suggested that competition will generate an 
equilibrium with unfamiliar characteristics in the joint goods context, 
with consumers facing an increasing step function of prices for ho¬ 
mogeneous consumption units.’ 

In contrast, we present an analysis of joint gtxxls provision with the 
individual profit-maximizing entrepreneur as its focal point. How¬ 
ever, we depart from the analogous private gtxxls methodology in 
one important respect: The familiar notion of the aggregate demand 
curve is replaced by the demand dtstnbution as a novel, powerful, and 
operationally meaningful building block, reflecting the fact that the 
number of consumers of particular output units plays a critical role in 
joint goods analysis even under simple pricing strategies. L'sing this 

' See. in particular. Buchanan (1967). ITiompson (1968). Deinseir (1970). Oaltland 
(1974), Au.ster (1977), lee (1977), ami Brennan and VVabli (1979): and also working 
papers by Brennan and WaJ.sh (1978) and Borns (1979). 

‘ Pure joint goods (like pure public goods) would permit an indeliniteh large number 
of consumers. The examples cited (except perhaps informatxm) lall well shon of this 
but, as long as they remain uncongested, can still be analyzed as it thev were "pure" (like 
"local" public goods). By treating costs as varying with mtensitv of consumption, cxir 
approach could also be applied to the impute cases. 

* We accept Oakland's description of equilibrium, but his analvsis of its ailainroent is 
essentially circular. The process of estaUishmcnt of equilibrium must be substantially 
different from that in the private goods case. In fao, it may lie (hat the process is 
analogous to the seriatim introduction of "new” goods 
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basis, our central contribution is a general exposition of the neglected 
case of monopoly provision of joint goods, and for a variety of pricing 
strategies we are able to identify characteristics of revenue functions, 
relative profitability, and operational procedures for selecting price 
and output levels. Nonetheless, our framework can also provide a 
basis for analysis of competitive joint goods provision and for analyz¬ 
ing marketing strategies lor private goods, particularly where constant 
marginal costs prevail. 

In Section I we introduce the demand distribution, provide a 
geometric interpretation of it, and .set out basic assumptions and 
definitions. .Sections II and HI examine the characteristics of a range 
of feasible marketing strategies (applied uniformly across individuals) 
from per-unit pricing to sophisticated multipart pricing procedures, 
each of which has been discussed in the private goods literature and 
has a basis in common busines.s practice.'* Section IV contains an 
illustrative analvsis using a specific distribution,il form, while Section 
V brings together the main results with final observations. 

I. Framework of the Analysis 

For monopoly piovision of private goods under uniform per-unit 
pricing, information about aggregate demand provides an adequate 
basis for decision making. Since each output unit is exclusively con¬ 
sumed by a single individual, the profit-maximizing price will never 
be less than marginal production cost, and hence the firm will always 
produce as many units as are demanded by all individuals at that 
price. I'he aggregate demand curve thus completely specifies the 
relationship between per-unit price and both aggregate consumption 
and production, directly determining the revenue-output relation¬ 
ship. However, even for a private good producer, the information 
revealed by the aggregate demand curve is not suf ficient when dis¬ 
criminatory pricing schemes are being considered; for these pricing 
strategies, information is required on the “composition" of demand 
for the product. 

In contrast, for monopoly production of price-excludable public 
goods, information on aggregate demand is inadequate even under 
uniform per-unit pricing. Since each production unit can be fully and 
equally consumed by all individuals, output need never exceed that 

* Dupuit ([1844] 1952) was an acknowledged authority on price thscriinmation and 
influenced the adoption, by the nineteenth-century French public .sector, of quite 
sophisticated pricing piocedures. More recent literature includes general analyses by 
Gabor (1955-56), Puu (1964), Ng and Wcisser (1974). and Yamey (1974), and contri- 
butionson quantity discounting bj Buchanan (1952-53) and Berglund (1964). In many 
cases the analysis has been applied to goods with significant jointness elements. 
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required to satisfy the highest demand individual at any price. 
Moreover, the per-unit price faced by each individual can be less than 
marginal production cost since many units are jointly consumed, but 
this would necessitate the rationing of some high-demand individuals 
by output rather than by price. Consequently, not only does the 
conventional aggregate demand curve not define the relationship 
between price and output for joint goods, in general it need not even 
define the relationship between price and aggregate consumption. 
Operationally, given his knowledge of production costs, the producer 
of a joint good will be concerned to identify the maximum revenue 
obtainable from (various) given output levels, and this critically de¬ 
pends on the composition of demand. Specifically, he will be in¬ 
terested in the number of individuals who would purchase (at least) 
a certain quantity when confronted with a particular (revenue-maxi¬ 
mizing) price, since this determines his marginal revenue. Com¬ 
pared with his private goods counterpart (who can work simply with 
output and price), the joint good producer must consider output, 
revenue-maximizing price, the number of consumers of the mar¬ 
ginal production unit. Accordingly, we have adopted as the funda¬ 
mental building block olOur analysis a construct which takes account 
of the critical distributional characteristics of demand, is based on 
variables that may Ite empirically observable in the normal process of 
operation, and has a basis in conventional demand theory. This con¬ 
struct we term the ‘‘distribution of demand" or, more succinctly, the 
demand distrUmliou. 


The Nature uf the Denuind Distnhutwn 

A point on the surface of ihe demand distribution defines the number 
of individuals, «, who would each consume at least q units of output if 
the joint good was made available at a per-unit price of Formally 
(adopting the conventional private gotnl assumptions that we are 
dealing with large numl>ers of individuals and that prices and quan¬ 
tities are divisible) we assume that the distribution may be represented 
by a well-defined function, n = n{f>jj), with the properties 


dn dn 
dp ’ dq 


< 0 , 


( 1 ) 


so that, by the implicit function theorem, the inverse function, p = 
p(n 4 j), is al.so well defined. 

It is appropriate that we should briefly discuss the basis for these 


* Note csp. that « tl<x?s 1101 relcr 10 ihc number who wirulrl consume .some pan of 
output g, but rather 10 those who would consume 11 all, 
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Ki(.. 1.—Crf)S 5 section through demand cliMribuiion with price constant at p'* 

specific assumptions. In figure 1 we show, for a particular per-unit 
price, how the number of individuals consuming each unit of output 
is functionally determined by the distribution of demand. For n here 
to be a well-defined function of q clearly we must have the demands of 
individuals arranged in order of magnitude; but, more than this, we 
require that there never be a significant number of individuals with 
identical demands (at any uniform price). 1'hus, not only have we 
ruled out the possibility of a significant number of identical individual 
demand curves but also (perhaps more interestingly), while accepting 
that many such curves may iiitcrseri, we fiave ruled out the possibility 
that a significant number will intersect at identical values oiq (and p). 
The strict inequality assumed in equation (1) also involves further 
considerations. First, it implies that variables which inHuence the 
pattern of demand (e.g., tastes, income, and wealth) are assumed to be 
distributed continuously among the population. Were this not the 
case, significant changes in q could lie ass(x:iaied with zero changes in 
n and a well-defined inverse function,/i(n,^), may not exist. Second, 
besides these implications regarding the relationship Ijetween n and q 
(with/i constant), equation (1) also implies the assumption that indi¬ 
vidual demand curves (defined as q, = qtlp]) are downward sloping." 

* Williain Oakland has suggeste<l a neat demonstraliun of how the d'niributional 
function, n(p^), might be derived formally (mm individual demand behavior. Let the 
inverse demand relations of a subset of individuaJ.s be giyen by 6 with the shift 

parameter distrihuied arjording to/(a) over (a4)- Then, for given p and q, n is 
determined by n(a) = / ^ Moreover, the intersecting case is accommodated 

immediately by considering any number of different classes of such demand relations. 
E.g., with two classes, p = p(qA) and p - q>{qj>), we would have separate density 
functions for a and b and define n ^ n(a) + n(b). 
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The sum of all these restrictions yields the demand distribution as 
defined above, a geometric representation of which is provided in 
figure 2. Here the boundary values P, Q, and -V define, respeaively, 
the maximum price that would be paid for the initial unit of output, 
the maximum quantity that would be demanded by any individual at 
zero price, and the total number of individuals. Also shown are the 
solids whose volumes indicate the revenues obtained from alternative 
per-unit pricing arrangements associated with a particular output 
level, rhese revenue volumes reflect the fact that, for a joint good, 
each unit produced may be consumed by all individuals willing to pay 
the price, p, and are especially relevant to the analysis undertaken in 
Section II. 


Thf AU-or-Nolhing Distribution 

The incorporation into our analysis of the (desirable) feature that 
individual demand curves may intersect has important consequences 
for the treatment of all-or-nothing procedures. With respect to the 
perspective provided in figure 1, the presence of such intersections 
means that the order in which individuals' demand curves are ar¬ 
ranged must vary according to the (constant) price level considered if 
ordering with respiect to magnitude is to be preserved. Thus, in 
general it may not be assumed that a cross section through tlie distri¬ 
bution with n held constant will indicate the demand relation of a 
particular individual (although this will be so in the nonintersecting 
case). Even if we make the conventional assumption of zero income 
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effects so that p(n,q) indicates the marginal evaluation of different 
consumption units, the integral of this inverse function, given n, 
cannot be assumed to measure the “willingness-to-pay” of a particular 
individual. For this reason, and because u.sers of all-or-nothing pro¬ 
cedures will generate market observations on variables different from 
those relevant to the per-unit pricing case, the analysis of these strate¬ 
gies (presented in Section III) has been undertaken with respiect to an 
alternative (hut related) distributional construct, the all-or-nothing dis- 
tnhution. A point on the surface of this distribution defines the num¬ 
ber of individuals, n, who would purchase a bundle of q units of the 
good at an average (per-unit) charge C rather than go without the 
good altogether. We assume, analogously to the demand distribution, 
that the all-or-nothing flisirihution may be repiesented by a well- 
defined function, n ~ n{('.,q), with the properties 


()n 

m: ' 


dn 

dq 


< 0 , 


( 2 ) 


again ensuring that the inverse lumtion.C = (i(n,q), is well defined. 


Some Useful Definitions 

Finally, later disiussion is considerablv assisted by referente to par¬ 
ticular concepts which may be formally defined ;ii this stage. Fhe 
definitions made here with respect to the demand distribution may 
also be applied to the all-oi-nothing disti ibution. 

Deitmiion I; A level of output 9 will lie lenned r//irieii( with respect 
to (a partunlar pruing pnuedure) if under that procedure; Riq) > Ii(q*) 
^ q* < q (where R denotes total revenue). 

Deuntiion 2: A regular distribuUon of demand has the property; 
d'^R(q„)ldp^ < 0 (where = p[ti,qo\ ■ «[p,c/„|). 

DEKiNirioN 3: A weakly restricted distribution of demand has the 
property that; dp*[q„)ldqo < 0 (where is that price which 

maximizes /fl^ol). 

Definitions 2 and ate restrictions on the nature ol the demand 
distribution, the former reejuiring concavity of the revenue function 
relating to consumption solely in respect of the t/Bth unit of Output.^ 
Some of the more complex operational results derived in the paper 
require these plausible restrictions to hold, and in lhe.se ca.ses the 

’ Definition 2 reflects the conventional siinplihcatinn adopted m private gtxxl analysis 
that niatginal revenue functions are strictly downward sloping. I'he reslrK tion implied 
by dehnition 3 is lent plausibility by the observations that it will always be true if cross 
sections through the distribution lor i/ constant arc (mathematically) similar and that 
even where this is not so the overall trend in p*(if) must be negative. 
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results have been presented as propositions with proofs clearly 
identified in the text. 

II. Uniform Per-Unit Pricing 

We have previously emphasized the important distinction between 
output and aggregate consumption in the joint goods context. Since, 
operationally, the entrepreneur will be concerned with the profitabil¬ 
ity of various levels of output and because his production cost condi¬ 
tions need have no unusual features, our analysis of alternative mar¬ 
keting strategies can focus on his search for the revenue-maximizing 
price (and/or charge) associated with each potential output level and, 
more generally, on the characteristics of the revenue functions he 
faces. Initially, we suppose that the joint good producer seeks to 
maximize profit while setting a price which is uniform across both 
individuals and output units. 

Baste Charartensties of the Uniform Per-Unit Prue 

I'he unusual nature of his pricing decision is made transparent by 
first supposing that the entrepreneur behaves analogously to his pri¬ 
vate good counterpart, allowing all individuals to consume as much as 
they wish at the prevailing per unit price. Since, for joint goods, 
output wouhl be then determined by the highest individual con¬ 
sumption level demanded at each price, this strategy’ tan be alter¬ 
natively represented by the assumption that at any output level the 
producer selects the maximum price at which anyone is willing to 
consume the entire output. This prtKedure we call Maximum Uniform 
Pnring (MUP): In figure 2, for output q it inyolves setting price 
p,, at which price the highest demand individual consumes the en¬ 
tire output, and a total of n, individuals consume at least part of it. 

I'he revenue yieUled by this pricing procedure, for any output level 
q„, is given by: 

«((y„)/MUP - f p„ ■ n(p„.q)dq, (3) 

where/i„ = /»{(), 90 ) and is constant throughout the integration. Differ¬ 
entiation of (3) with respect to 90 yields the marginal revenue (MR) 
function r" 

MR(9u)/MLJP = •i(po^q)dp 

r* d I 

" Discussion of the appropriale algrlnaic methods f«>i derivation ot tins expression is 
to be found in Courani (1960), p. 220. 
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Note, however, that the last term in this expression, representing the 
effect of a change in output, price held constant, in the limit ap¬ 
proaches ^ero because n(pQ,q(,) itself tends to zero. Thus, MR( 9 o)/MUP 
can be written as; 

MR(yo)/MUP = (4a) 

This marginal revenue function has a number of interesting 
properties (identified in the illustrated example in fig. 4 below), but 
more importantly it immediately suggests that for joint goods the 
optimal (i.e., revenue-maximizing) pricing strategy differs from MUP 
(the private goods analogue). Any level of output which \% efficient xtrith 
respect to MUP must have MR( 9 )/MUP > 0, and it follows directly from 
(4) and (4a) that, at any such output, revenue would be increased by 
lowering price (with output held constant). This is clearly illustrated 
in figure 2, where for output ijr, lowering price from p^ (i.e.,plq]/MLJP) 
to p^ increases total revenue: revenue declines from the original 
consumption units, but this is more than offset by the fact that each 
output unit is now more intensively consumed. Thus, t/ie Optimal 
Unijorm Price, p(q)/OUP, mil he less than the Maximum Uniform Price, 
plqjlMUP, at (in\ output lex’el where marginal cost is positive. Moreover, 
some mdividuals (all those for whom qi{p\ > q) will be rationed by output 
lohile others (for whom qi[p\ ^q)are rationed by price, and this is indeed a 
general chaiactet istic of optimal pricing for joint goods which does 
not ap[)ly for private gocKls.* 


Some (ieneral Ptoperties oj Rei/enue Functions under OUP 

.Some insights into more general properties of the relevant revenue 
functions are straightforward. If OUP is charged for an output level 
qt), we must have 

d[/((9„)/OUPJ 

--= 0. (5) 

It follows trisiallv, either from arguments similar to (3) and (4) above 
or from geometric considerations, that: 

MR(<?„)/OUP = [/r(?o)/OUPl • n{p(q„)iQ\J?,q„]. ( 6 ) 

“ t he possitilc need to ration by output for ))cr-unil pricing was first recognized in 
Brennan and Walsh (1978). Strictly speaking, rationing applies only if at the profit- 
maximizing ouiput MO > 0 (.sec sutnection, "Some (icneral Properties of Revenue 
Functions under OUP," following). Moreover, while lorp > MC > 0 rationing cannot 
sensibly be consideretl to apply, the range of output levels for which this occurs is 
ne^tgible when brge numbers of consumers are involved. It should be observed, also, 
that the use of continuous mathematical methods makes it appropriate to regard 
nlp(qt)lOVF,qt] > 0 as indicative of rationing even for the special case of 1 > 
nip(q,)tOVPaft] > <>■ 
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That is, under OUP marginal revenue (with respect to output) will be equal to 
the revenue obtained solely in respect of the final (i.e., least consumed) unit 
of output. Regarding the characteristics of the function MR(^)/OUP, 
it clearly can never exceed max [p(n^) ■ n(p,q)\ which, given the 
properties of the demand distribution (1) above, must decrease as 
output increases. Since MR{0)/OUP = max [p(n,0) • n{p,0)] marginal 
revenue under OUP must obtain its maximum value at zero output, 
although it may sometimes increase with output at some nonzero 
production levels.'® 

The results (5) and (6) also yield insight into the characteristics of 
OUP when output is selected to yield the maximum possible revenue. 
Since this situation must involve/»(^)/OUP > 0 but MR(^)/OUP = 0 it 
follows from (6) that there will be no rationing. In this case the OUP 
solution has MUP characteristics and, since (5) always holds for OUP, 
is in fact identical to the revenue-maximizing solution for MUP. 


Output and Price Determination under OUP 

Since profit-maximization requires the equality of marginal cost and 
marginal revenue, we have begun by deriving and identifying the 
properties of the marginal revenue function appropriate to OUP, 
properties which (with respect to output) turn out to be similar to 
those observed in the private good ca.se. The actual process of output 
determination under OUP is not unduly complicated. Once the Op¬ 
timal Uniform Price has been (experimentally) determined for a 
particular output level it follows directly from (6), providing con¬ 
sumption of the final unit of output is known or can be estimated, that 
marginal revenue is also determined. However, the decision to in¬ 
crease or decrease output must be accompanied bv a decision re¬ 
garding price adjustment, and here the user of OUP is confronted, in 
complete contrast to the conventional (private goods) wisdom, by the 
possibility that the profit-maximizing price may rise as the profit- 
maximizing output increases, ceteris paribus. In order to show this we 
first derive a general result: 

Proposiiion 1: The Optimal Uniform Price for a given output 
level, /»(^)/OUP, may generally be expected to lie in the range 
Ixjunded by the previous such price,/^(^ - dq)IO\JV, and the price 
which maximizes revenue solely in respect of the t^th unit of output, 
p*(q). This will always be so if the demand distribution is regular. 

Proof; If a distribution is regular, for a given q and dq, the revenue 

The operationaJity of exprc.uion (6) does, of ct)urse, rwjiiire ihai ii is possible lo 
identify how many individuals consume ihe entire mitput. Lack of this information in 
no way invalidates the results derived for total and marginal revenue functions 
throughout the paper, but it must be recognized that praokal detcrinination of MR 
often may be less straightforward than is implied by the simplicity of the stated results. 
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expression p(n,q) ■ n{p,q)dq is a concave function of price, as there¬ 
fore must be the sum of any such expressions. The revenue obtained 
from per-unit pricing q units of output may be expressed as 
fl!'^‘‘'^’p(n,q) ■ n{p,q)dq + p(n,q) ■ n(p,q)ilq, which is therefore the sum 
of two concave functions of price. As sucli its maximum must be 
attained at a price hounded by tho.se prices which maximize each of its 
parts,/)(^ - /iq)IO\]? and p*{q). Even if the functions are not globally 
concave with respect to price, since they are continuous they must be 
locally concave at their maxima. Where the maxima are not at 
significantly different prices, as well as in those cases where con¬ 
vexities in the revenue functions are not severe, we might also expect 
p{q)/0\JP to be bounded as suggested. 0 

This conclusion, which is useful in the derivation of later results, 
also has some operational content. It suggests that in the absence of 
information about the trend of p*{q), larger output expansions could 
initiallv be atcoitiftanied by a price dcctmtc. and ior small increases in 
output (since p*\q -t- -» p*lq\) perhaps price initially should be .set 

between/t((/)/OLjP and p*(q). More immediately, however, it leads to a 
fundamental observation: 

Proi'osi riON ‘J: The Optimal Uniform Price,/>(^)/OUP, may .some¬ 
times increase as out put increases. However, this will never wcur if 
the demand distribution is rrf^ilar and nwakh reslmtrd. 

Proof: fhe first part follows directly from the possibility (illus¬ 
trated in hg. If, whet ep*n] applies strictiy to the initial unit of output 
and hence is equal toptlJ/OUP) that p{q - dq)/OV? may tie le.ss than 
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p*(q). For the second part, given a weakly restricted distribution, so 
that /?*(9 — dq) > p*(q), and regularity, it follows from proposition 1 
that 

/>(0)/OUP = p*((i) > p*((lq) 

:.p(0)/OVP > p(dq)IOVP > p*{dq) > p*(2dq) 

:.p(q - dqVOUP > p{q)IO\iP > p*(q). □ 

I’he phenomenon of (optimal) price rising when output is increased 
(i.e., for reductions in marginal cost) appears to be unique to the joint 
goods context. Moreover, even in the case of regular and weakly 
restricted distributions, the more conventional fall in price (as output 
rises) may be very small compared with that expected for a private 
good." Other unusual phenomena, though perhaps to be given no 
more attention than peculiarities which can arise in traditional private 
goods cases, may al.so occur. For example, the optimal number of 
individuals to ration may (fjerversely) increase as output increases. 


,iu Ovenunii of Results fat Uniform Per-Unit Pricing 

Except in the special case where marginal cost is zero. Optimal Uni¬ 
form Pricing of joint goods for any output involves a price lower than 
tfie Maximum Uniform Price for that output, and, consequently, 
some individuals will fie rationed bv output. Marginal revenue for 
OUP can fie defined straightforwardly as the revenue tibtained soleh 
in respect of the last unit of output; the characteristics of the .MR 
function are similar to those applying to private gtxxls; and pre¬ 
scriptions regarding price and output determination can be obtained 
with only slightly more difficulty than in the private goods case. 
However, the Optimal Uniform Price may increa.se as output in- 
cieases, and, even if this result does not arise, the price reduction 
associated with output increases may lx? small relative to those which 
would occur if the good was private. 


III. More Complex Pricing Strategies 

Observation of common business practice suggests that in the course 
of their marketing operations hrms can generate information which 

" Brennan and Walsh (I97H) isolate a result in wlik li prxe irwreases with ouipiit, l>ut 
one w.iirh applies to specitie ixiints (artualb “gaps") in the piicc-oiilpiil relationship 
and which appears to emerge fmm the small-nuinliers case thev <leal with. Our anaivsis, 
however, using large numbers, conhrms that the whole trend of prkes mas be upward 
over some output ranges. Our other result—that price falls mav be small—is illustrated 
by the linear distribution example in Sertitm IV since Pt'l > pt^l/Ol'P > PfS. where P is 
the price intercept of the highe.st demand iiKiividual. 
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All-or-Nothing Prtcing 

All-or-nothing type pricing schemes are exemplified by “season ticket 
only” offers for a series of operas or Shakespeare plays and flat-rate 
bus or subway fares. We assume that the same all-or-nothing offer 
confronts all individuals for a particular output bundle offered by the 
producer, though it is convenient to express this offer in terms of an 
average (per-unit et|uivalent) charge which must be paid for all units 
in the bundle. 'I'hc Optimal (revenue-maximizing) AII-or-Noihing 
Charge (OAN) will be such that the marginal consumer pays his average 
valuation for the units in the bundle and the OAN strategy inherently involves 
rationing by output. 

It is interesting to note that, in the special case where individual 
demand curves do not intersect, there is an obvious symmetry be¬ 
tween the all-or-nothing and per-unit pricing procedures; as is dis¬ 
cussed more fully below, in that case the revenue volumes (and re¬ 
lated algebraic expressicjns) may be similarly defined, except that the 
roles of p and n are reversed. However, to avoid these restrictions on 
the demand distribution we initially focus attention on the all-or- 
nothing distribution defined in Section I. Here, for a given output, j#, 
revenue is defined by; 

fffqo) = qt, ■ C(n,qo) ■ n. (7) 

where C is the (per-unit) all-or-nothing charge for all units in that 
output bundle. The optimal value of n and associated charge C may 




where dn* is the change in n when q is increased by one unit with C 
held constant (at its previous optimal level)." 

Operationally, marginal revenue thus defined may be obtained in 
just two steps: First, ItKate the optimal per-unit charge for a given 
output level (which gives us n • C[n,qJ), and second, note the change 
in the number of consumers when output is increased by one unit and 
the charge (per-unit) is held constant (which yields the second part of 
eq. [9]). Evidently, MR(q)IOAN u not much more difficult to determine than 
MR(q)IOUP. 

Of more immediate intere.st, however, is the revenue yield of all- 
or-nothing charges relative to OUP. Insight may be obtained by 
limiting attention to the case where individual demand curs’es do not 
intersect. In these circumstances, /((^ol/OUP involves choosing p to 
maximize• n(p,q)dq whileAN involves choosing n to maxi¬ 
mize /*«n • p{n,q)dq (in the nonintersecting case pln,q] being, for 
given n, the nth individual’s inverse demand function). The natural 
symmetry between the two prtKedures leads direc tly to two observa¬ 
tions. First, for any distribution which is symmetrical in p and n (i.e., 
so that if p = a,n = b satisfies n = so mustp = b and n = a) for 

some scale of measurement of these variables, the revenues must be 
identical; second, the existence of some distribution for which R(q)/ 
OAN > /iL( 9 )/OUP implies the existence of a distribution yielding a 
contrary result. Thus, OA.W may he superior or inferior to OUP in renenue 
terms depending upon the characteristics of the demand distribution. More¬ 
over, the general properties of the two strategies may be quite similar. 
For example, by arguments directly analogous to those used in propo¬ 
sitions 1 and 2 it follows that the optimal i<alue of C may increase as output 
increases, but not if the all-or-nothing distribution is both regular and weahly 
restricted. It is also possible that the number consuming a given bundle. 
n( 9 )/OAN, may sometimes increase as output increases." 

“ See again Courant (1960). As an approximauon, ihc linear form MR(o)/OAN “ 
MC/2 may be useful in estimating whether the unit output expansion should be tried. 

” This can be seen easily for a distribution with nonintersecting demands (but no 
other restrictions), since then OAN analysis can be conducted with the bask demand 
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Given that selection of the Optimal AIl-or-Nothing Charge may be 
only slightly more difficult than selection of the Optimal Uniform 
Price, there is virtually no theoretical basis for choosing between the 
two procedures. This should cause little surprise because the degree 
of complexity underlying the maximization problem faced by the 
entrepreneur is identical in both cases. Alternatively, insight into the 
ambiguity in revenue ranking may be obtained from the realization 
that C{q)OAN ma\ he S p(q)/()UP depending upon the distribution and 
(compared with a uniform per-unit price equal toC) that OAN involves 
the additional consumption of {lower-valued) units by some individuals while 
excluding all (higher-r'alued) consumption by others.'* 


Two-Part Pricing 

The characteristic feature of Two-Part Pricing (TPP) is the combined 
use of an entrance (or license) fee and a uniform frer-unit price—a 
combination which is commonly used in the marketing of recreational 
and entertainment services. The essence of the TPP case can be 
captured in the (bllow'ing; 

Proi'osi iion 3; fhe introduction of a license lee in conjunction 
with uniform per-unit |)riting may reduce revenue. However, if the 
demand distribution is both regular and weakly restricted and p(q)l 
OUP is used as the per-unit price, revenue is necessarily increased. 

Prooi; With uniform per-unit pricing, more individuals consume 
the “initial ' unit of output than any other unit. The marginal intro¬ 
duction of a license fee may be viewed as raising the price of the initial 
output unit wiihcrui affecting the price or consumption of later units. 
The revenue effect will, therefore, be 

d[p(n,0) ■ ri(p,0)] 
dp 

and even (or a regular distribution this may Ire positive or negative 
depending upon whether the original per-unit price was below or 
above p*{{)) (the price which maximizes revenue from the hist unit 
alone). It follows directly from proposition 2, however, that if OUP is 


distnbuiion but with f> and n reversed. Whatever holds lor OLT', an analogous tesult 
holds for OAN I bis siinilaniy Ijetween OAN and OUP, moreover, suggests an 
analogue to MUP which might be termed “simple all-or-noihing pricing" (SAN). Koi 
output y, it involves setting a charge so that all who have positive marginal evaluations at 
that output are perniittecl to consume, the marginal consuinci paving his total evalua¬ 
tion. H(q)/SAN = /<(y)/ML'P for nonintersecting demands with the distribulion sym¬ 
metrical in p and n 

'* Indeed, for the linear distribution used in .Section IV we have both that C(y)/OAN 
= p(q)/OUP and that the additional and excluded consumptions are identical. There¬ 
fore, R(q)/OAN = R(q)/OVP. 
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used and the distribution is both regular and weakly restricted, the 
revenue effect must be positive. □ 

That this proposition has two distinct components is quite inten¬ 
tional, since together they shed light on a trade-off which gives rise to 
the existence of an interesting possibility. From the first part of the 
proposition, if a given output level has a uniform price greater than 
p*(0), the introduction of an entrance fee will cause a greater loss of 
revenue from now-excluded consumers than is gained from entrance 
payments by the remaining consumers. From the second part of the 
proposition, the two effects yield a net revenue gain for a given 
output level for any uniform price lower than />*(0). Insight into the 
distinction between the two cases may be aided by regarding the first 
as involving a relatively high per-unit price and small number of 
consumers and the .second as involving the reverse.'* However, in 
lowering the per-unit price to make an entrance fee profitable (in an 
absolute sense) the departure from the uniform p>er-unit price p(q}l 
OLIP may be sufficiently large to involve a greater loss of revenue 
than can ever be captured by the fee. Thus, for some distrihuttons it may 
tinier he profitable (relative to OUP) to use an entrance jee in conjunction with 
uniform per-unit pricing {i.e., TPP). 

Notwithstanding this interesting possibility, proposition 3 does sug¬ 
gest that under plausible conditions some form of two-part pricing 
will be superior to OUP, and our conceptual experiment of intro¬ 
ducing a marginal entrance fee suggests there is no great difficulty in 
gaining .some extra revenue. Indeed, it for any given output level the 
revenue-maximizing license fee and per-unit price could be deter¬ 
mined, there would be little difficulty in hxating the profit-maxi¬ 
mizing output level, since we must have: 

MK(q)/TPP = p ■ n(p,q), (10) 

w'here p is the relevant per-unit price for output level q. Phis unsur¬ 
prising result follows from arguments similar to those used to derive 
MR(^)/OLiP alxive and, indeed, it is an interesting properts of many 
marketing strategies that the marginal revenue function is defined b\ 
the revenue obtained solely in respect of the final unit of output. Fhe 
lea.son is straightforward. The optimal setting of the parameters for 
any such strategy implies that marginal changes in those parameters 
will not change the revenue associated with any given output level. If 
a marginal increase in output results in only an incremental change in 
the price parameter applying to final production units, then revenue 
with respect to the original output level will be unchanged, and any 


.Strictly speaking, allowance should also be made for the way n changes asp changes 
in the two situations. 
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change in overall revenue must be simply that amount obtained solely from the 
final output unit. 

Other results (related to those for OUP) follow almost immediately 
from (10), Except where marginal costs are zero, TPP will involve 
rationing of some individuals by output; and the per-unit price component may 
increase with output but it will not do so if the demand distribution is 
both regular and weakly restricted. In these latter circumstances it can 
also be established that the per-unit price element of TPP is less than 
p{q)fO\JP.'^ However, even allowing for this last insight and despite 
the simplicity of the marginal revenue function, the entrepreneur 
would still need to undertake innumerable experimental variations in 
both the license fee and per-unit price components at each output 
level. It remniris unclear, therefore, whether TPP dominates OUP (and, a 
f ortiori, OAN) in terms of overall (operational) profitability even when it does 
so in terms of potential revertue-racsing capacity. 


Multipart Pricing 

We now suppose that the producer considers confronting consumers 
with a .schedule of prices in which different units (or bundles of units) 
attiact a different price (though the same price .schedule is applied to 
all (onsurncrs). This strategy might be exemplified (though perhaps 
.somewhat imperfectly) by “quantity discount" offers for joint goods, 
such as are applied to a .series of orchestral concerts, operas, or plays. 

Obviously, the Optimal Multipart Pricing procedure (MPP) will 
involve pricing each unit separately at the price, p*(q), which maxi¬ 
mizes revenue obtained solely in respect of that unit (although, as 
we observe later, for operational reasons the producer may offer 
different sized bundles of units with a conip>osile charge, or average 
per-unit equivalent, equal to the sum of the revenue-maximizing 
separate prices for the units it contains). The price-output relation¬ 
ship is, therefore, immediately defined and, as with other strategies, 
price may increase imth output over some ranges. However, in all cases the 
overall trend must involve prices decreasing with output, and for a 
weakly restricted distribution (this restriction directly relating top*[ 9 ]) 
the price-output schedule would necessarily be negatively sloped, with 
higher prices charged for the most intensively used (“lower level”) 
units of output. 


'* The reader may wish lo confirm why such arguments do not apply to the MRf^)/ 
O.AN function defined with respect lo parameters of the all-or-nothing distribution. 

" Given dCtdq, dtJdn < 0, the per-unit price component of TPP will be that which 
maximizes revenue in respect of the last (q — q*) units, q* being the minimum con¬ 
sumption of any (nonexcluded) individual. From propositions 1 and 2 this price must 
be less than p(q)IOVP given the stated restrictions. 



PRICE-EXCLUDABLE PUBLIC GOODS 


183 

For the MPP strategy, marginal revenue is trivially defined as the 
maximum revenue obtainable solely in respect of'the^th unit, {p*{q) 

• n[p*{q), 9 ]}, and, given the properties of the demand distribution in 
(1) above, MR( 9 )/MPP must decrease as output increases. Equally 
clearly, R(q)/MPP must exceed R(q)IOUP, except in the special case where 
p*(q) = p*(0)for all q, in which case the strategies are identical in all respects. 

This superiority of MPP might have been expected to extend 
straightforwardly to comparisons with other relatively simple strate¬ 
gies, such as OAN. Interestingly enough, no such exact result can be 
obtained unless the restriction is made that individual demand curves 
do not intersect. In these special circumstances, the marginal con¬ 
sumer of a bundle of units under OAN necessarily has the lowest 
marginal valuation of each and every unit in the bundle. The revenue 
obtained under OAN would, therefore, be equal to that obtained 
from a form of multipart pricing which makes the units available at 
the separate prices,/>„(^), given by this marginal individual’s demand 
curve. However, except where (quite Jortmtously) those prices are also the 
optimal p*(q) this OAN revenue must be less than that obtained under MPP. 
In more general circumstances where demand curves do intersect, it 
can no longer be presumed that all individuals who would consume 
the entire bundle at the OAN composite charge would still do so if the 
units were priced separately at pm(q)- Even though it is also true that 
some individuals who would not purchase the bundle at the composite 
charge would undertake some consumption at these separate prices, it 
remains possible that R(q)IO\N may exceed the revenue obtained 
from the multipart pricing arrangement usingSince we do not 
know by how much /^(^l/MPP exceeds this latter arrangement, we 
cannot be certain that it in fact exceeds /?(^)/OAN. .^nd this uncer¬ 
tainty extends to comparisons between MPP and any strategy involv¬ 
ing a lump-sum component, including TPP, It appears that the general 
expectation that MPP will be the rn'enue-dominaiU strategy, while intuitively 
plausible, can only be established in a probabilistic sense. Nonetheless (and 
notwithstanding some operational difficulties referred to below) MPP 
might be regarded as closest to a theoretical “best” procedure in 
revenue terms when interpersonal discrimination is ruled out. 

Two further aspects of the MPP strategy might be noted. First, the 
notion of (a schedule oQ prices related to “intensity of use" immedi¬ 
ately suggests a comparison between the MPP solution and Oakland’s 
(1974) competitive solution. Although for Oakland’s case the price 
schedule is unambiguously positively related to output while our 
monopoly counterpart may characteristically involve a negative re¬ 
lationship between price and output, the similarities are more than 
merely superficial. For a given demand distribution and marginal cost 
schedule, the Oakland competitive output and the monopoly output 
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under MPP musi be identical, since both solutions must be character¬ 
ized by marginal cost being equated to the “total price” (n ■ p) which 
maximizes revenue with respect to the last unit of output and, in both 
cases, this will in general be associated with rationing by output.'* (Of 
course for Oakland no above-normal profit exists on any unit, and 
consumption by inframarginal consumers is greater for his solution 
than under MPP because inframarginal units bear a lower price in the 
competitive case.) 

Second, however, given that the characteristic MPP solution in¬ 
volves a negatively sloped price schedule, the operational status of 
MPP differs markedly from that of Oakland’s solution. For MPP we 
must assume that producers can acquire information on the relevant 
revenue-maximizing prices, can prevent resale (or exchange) of units, 
and can prevent consumers from “reversing intensity” (i.e., purchas¬ 
ing lowei-priced units without first purchasing the higher-priced in- 
framargiiial units). Resale possibilities may be of little concern in the 
joint good context given the service nature of most of the examples. 
However, preventing reversals of intensity requires either that the 
producer can completely meter individual purchases or that he can 
pi e.sent consumers with a choice between bundles of units (containing 
1,2, 3, and so on, up toe/ units) each with a compo.site charge equal to 
the sum of the levenue-maximizing prices for the separate units it 
contains (a strategy we might call MPP*). However, for MPP* to l>e 
made operational, producers must be able to determine the optimal 
set of conifKisite charges (and hence the relevant p*lq]), and this may 
be a great deal more complex than isolating the p*(q) wheti intensity 
reversals do not occur. In general, firms would initially have to reduce 
output to zero and proceed sequentially—first determining the rev¬ 
enue-maximizing charge, p*(l), for one unit of output, then pro¬ 
ducing a second unit and determining the optimal composite charge 
for a two-unit bundle (i.e., (X:*[2] = /;*[1J + /i*[‘2]), and so on.'* If 
re.sale possibilities or pioblems of overcoming intensity reversals 

Oakland's oucpiii is die maximal quaiilitr, q, for which k pk(q) - MCi (ranking 
individuals A = 1, , . . A so lhai^,iyj cs the highcsi inverse demand). II * • pi,(q) > MC 
(for some A > 1) when q is such lhat p,iq) - MC, ouipui is expanded and so on. This is 
equivalent lo increasing output whenever the maximum revenue from the marginal 
production unit exceeds MC and, unless y is maximal where/),(^) = MC, at least the 
highest demand individual is rationed by output. A rcicree suggested lhat the equality 
of output for MPP and competition is analogous to the private gocxl result for perfect 
discrimination and perleet compeiiiioii. Phis may fie potentially misleading: .MPP is not 
perfect discrimination, and in general the MPP (ami hence competitive) output is not 
efficient. 

'* Thou^ this may be a protracted experiment and fairly costly in forgonc-revenue 
terms (particularly if fixed costs arc high), if the revenue-maximizing charge declines, 
firms may be able to minimize their losses by allowing consumption of luriner units at 
the per-unit price/>*(1) while experimenting to hnd the relevant charges for bundles of 
two or more units. Similar but more complex procedures arc required if the optimal 
charge may increase over some output ranges. 
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create insurmountable problems, then the producer will be obliged to 
set prices such thatp(^) « p{<f + dq) for all q, and the most that can be 
said is that for a weakly restricted distribution, the optimal separate 
prices are in fact uniform and therefore coincident with OUP. 

Summary of Results 

The more complex, but uniformly applied, pricing strategies con¬ 
sidered here retain two basic characteristics of optimal uniform |>er- 
unit pricing; All generally involve rationing by output, and all may 
generate a positive price-output relationship over some output 
ranges. For all the strategies, moreover, the relevant marginal reve¬ 
nue functions can be defined with little or no more difhculty than 
under OUP. Optimal All-or-Noihing pricing (OAN) is operationally 
little more complex than OUP, but there are no strong theoretical 
grounds for choosing between OAN and OUP in revenue terms. 
Selection of the Optimal Two-Part Price ( TPP) is more difficult than 
selection of the OUP (and OAN), and its revenue dominance is 
assured only for restricted (though plausible) forms of the demand 
distribution. Optimal Multipart Pricing (MPP) is revenue dominant 
over OUP, hut its dominance over OAN and TPP cannot be univer¬ 
sally guaranteed, and its position as the ’‘theoretical best" strategy can 
only be established in probabili.stic terms. The MPP output level is, for 
given cost and demand conditions, klentical to Oakland's competitive 
output, but for operational reasons MPP may l>e ol somewhat limited 
commercial applicability. 


IV. Illustrative Analysis with a Specific Distributional Form 

Both to illustrate the methodology used in the general analysis and to 
(onsider .some of the welfare projjerties of monopoh provision, we 
now analyze the results that emerge when a specific demand distri¬ 
bution is u.sed. In choosing a distributional form, account has been 
taken of three factors. First, there would be heuristic gains from 
adopting a framework which could perform an expositional role 
similar to that of linear demand curves in traditional demand analysis. 
Second, the chosen distributional form should exhibit the more fun¬ 
damental characteristics of distributions likely to be met in practice. 
Third, the framework should not be devoid of oiierational content. 
Consideration of these factors leads more or less unequivocally to the 
choice of a distribution similar to that shown in figure 2 but for which 
the surface is linear." To enable the framework to include all-or- 


*“ On this, and tor a further application of results ol>iainal)lc I'rom a linear demand 
distribution, see Burns (1979). 
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nothing charges we also assume that demand curves do not intersect 
and that income effects may be regarded as negligible. 

Within this framework, the bounding values P.Q, andN completely 
define the distribution and may be treated as (empirically determina¬ 
ble) constants throughout the analysis which follows; consideration 
of alternative pricing procedures can be simplified considerably by 
drawing on the proportionality properties of similar triangles. In this 
respect it is convenient to define 

^ = = = ( 11 ) 

where p, q, and w represent specific values of price, output, and 
numbers of individuals. The actual derivation of the required reve¬ 
nue functions and pricing rules for a range of pricing strategies has 
been presented in some detail in Burns (1979), and the summary 
information contained here in table 1 and figure 4 is sufficient for 
present purposes. 

In table 1 seven alternative strategies are defined, the associated 
revenue functions and pricing rules being shown for each of these 
strategics. The revenue functions have been pre.sented in a form that 
makes self-evident the question of relative profitability (for given 
output levels) and illustrates certain results obtained earlier, such as 
the general dominance of OUP and OAN over MUP and SAN. The.se 
functions also provide a neat illustration of two further points. First, 
and unsurprisingly, revenue (for given output) is positively correlated 
w'ith the degree of complexity of the strategy adopted. Second, 
broadly speaking, schemes of comparable complexity should be ex¬ 
pected to yield similar revenues; for example, we can see that there is 
nothing to choose between OUP and ()AN, although nonlinear dis¬ 
tributions could be constructed (but probably not empirically deter¬ 
mined) to favor either of these procedures. It is also apparent from 
the revenue functions that the returns to increased complexity of 
strategy diminish very rapidly, particularly at lower output levels. 

Figure 4 is fundamental in illu.strating the basic methodology sug¬ 
gested in this paper. Entrepreneurial decision making concerning 
output and price determination relies heavily on information about 
the marginal revenue function of a chosen strategy and, of course, on 
the relationship of such a function to (marginal) cost behavior; figure 
4 contains the derived marginal revenue functions for each strategy. 
Also shown is a schedule indicating the sum of marginal valuations at 
different output levels: the intersection of this schedule with a margi¬ 
nal cost function would determine the optimally efficient output. 
Assuming, initially, constant marginal costs, the results for a linear 
distribution offer a clear warning that, over a substantial range of 
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(higher) marginal cost values, ilicrc may l)e negligihic differences in 
the outputs acfiieved hy (filfereiit protcclures of varying complexity 
and that the outputs attained by even the most complex prcKedures 
may be considerably less than the optimally efficient output. On the 
other hand, at lower levels of marginal cost the difference in output 
levels becomes more pronounced, indicative of the fact that more 
complex prcKcdures achieve their maximum possible revenues at 
higher output levels, but in general all procedures mote nearly ap¬ 
proach the optimally efhcient output level, especially the more 
sophisticated ones. Finally, noting that the monotonicity of MR(^)/ 
MPP is guaranteed while that of other marginal revenue f unctions is 
plausible,^' a situation of cfecreasing marginal costs will increase the 


K.g., Hitliough ,MR(y)/OUP may soinctiinrs inrreaie with output, ii» general trend 
must be downward (.see .Section II, “.Some (ieneral Properties of Revenue Functions 
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divergence in output levels, while increasing marginal costs would 
have the converse effect. 

In conclusion, we offer a cautionary remark about comparisons 
between privately achieved and optimally efficient output levels. Our 
results indicate (not surprisingly) that suboptimal output levels can be 
expected for all of the private procedures considered. While these 
results are relevant to the concern shown by a number of authors who 
have made such comparisons in other contexts (and have sometimes 
claimed that efficient outputs might be secured privately), it is clear 
that compari.sons of output levels alone are of little interest. In the 
case of the linear distribution, outputs under OUP and OAN would 
lie identical, but the efficiency implications of the two procedures are 
significantly different. Indeed, one of the insights generated by the 
approach advanced in this paper is that the replacement of a uniform 
per-unit pricing strategy with a (comparably complex) all-or-nothing 
arrangement will succeed only in substituting the consumption of 
lower-valued units of output by higher-demand individuals for the 
consumption of higher-valued units of tiutpui by lower-demand indi¬ 
viduals. In general, efficiency compari.sons must be based on the 
degree of consumption rationing (whether by price or by output) that 
occurs for all consumers vis-a-vis the completely efficient outcome in 
which all individuals would consume all units produced and output 
would be such that the sum of marginal valuations equals marginal 
cost. 


V. Concluding Remarks 

A major contribution of this paper is its demonstration that the 
complete deman<l distribution provides a novel and pow'erful basis 
lor analyzing private provision of price-excludable public (or joint) 
goods. For a monopoly producer of such goods we have identified, 
under surprisingly general conditunis, the characteristics of the reve¬ 
nue functions ass(x;iated with various pricing strategies, indicated the 
likely relative profitability of tho.se .strategies, and pic.scrified prtx^e- 
dures which might aid firms in isolating profit-maximizing price and 
output for any chosen strategy. Moreover, the specific example we 
provide, based on a linear ciemand distribution, not only serves to 
illustrate our general results, but also in practice mav constitute an 
important empirical basis for generating first approximations for 
outpu' and price levels. 


under OUP"). Moreover, given MPP as the ihcotctiral txrst strategy, MR(y )/MPP wtsuld 
lie the Ixiundary form toward which wher MR fiinclions tenti (as complexity of the 
pricing strategy >s increased), and MR(^)/MPP is unatnhigunusly negatively slojsed. 



igo JOURNAL OF POLITICAL ECONOMY 

Using this linear demand distribution, we have also sketched out 
some efficiency implications of our results, and, in this regard, an¬ 
other important application of our general approach is highlighted. 
For joint goods, overall efficiency depends on how output is “ra¬ 
tioned” among consumers as much as on the level of output per se. 
A framework containing information on the distribution of demands 
clearly is indispensable to a comprehensive analysis of the efficiency of 
private (or, indeed, public) provision of joint goods. In fact, our 
framework could be used profitably as a basis for a general analysis of 
Joint goods provision under compietitive conditions and other market 
or nonmarket structures. Moreover, the demand distribution ap¬ 
proach is equally essential to analysis of “discriminatory” pricing 
strategies for private goods, the necessary extension of the analysis 
presented here being particularly straightforward when constant 
marginal costs prevail. 
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Comments 


Recalculating the Scientific Tariff 

Bruce R. Boltiick 

Han'dffl /ot lnl/*nuJtunuil Develitpmmt 


In pan ‘2 of his (lassie 1960 papcT, “ I hc (lost of Protection and the 
Sdentiht Tariff." Harry (i. Johnson analyzed the second-best' prob¬ 
lem of using tariffs to promote “nontconomir" objectives commonly 
suggested as arguments for protection. Very simply. Johnson showed 
how to compute the balance between marginal Ixmefits and marginal 
costs using llarberger-ty|x* welfare triangles. He illu.siraied the meth¬ 
odology foi five selei ted noneconomic objectives: (a) a tariff to pro¬ 
mote self-sufficiency and independence; (/>) a tariff to promote di¬ 
versification. industrialization, or agriculturaliz.ation; (c) a tariff to 
promote a "way of life": (d) a tariff to increase military preparedness: 
and O') a bargaining tariff. 

Johnson assumed that world prices of importables are given, that 
domestic production of importables is positive before and after im¬ 
position of the tarif f , and that cross-elasticitv effects can lx neglected." 
Using linear demand and supply curves and free-trade values for 
market parameters, jolrnson computed the marginal cost of an ad 
valorem tariff (/) ttj be precisely 

MC = (Ft + fCr}. ( 1 ) 

where/-’ and U are the free-trade levels <rf ciomestic production and 
consumption, and c and t) are the compensated owtt-prke elasticities 

1 would like to cxpre.vi my great apprrilatioit lo Hiofcs.sor lalwiard lower for 
invalualrle discussions and suggest ions. 

' Slrictly speaking, the use of larifls tor such purposes is nol necessarily sretmd best. It 
may Ik- third Ivst it the "noneconoinK ” objective implies a divergence in factoi rnarkecs; 
or ii may be first best if a trade divergence Ls involved (sec t'-orden 1974, chap. 2). 

* Units of output are assumed to be defined so that the world price is unity This 
a-ssumpiion. along with the others listed in the text, serves to simplify computations. 

\Jounuil of PotUitai Erorurmy, I9A1, vol 09, ntf I] 
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of supply and demand. (I have dropped Johnson’s subscript t, indi¬ 
cating parameters for good *, in order to simplify the notation.) 

Given any specific tariff-related prdicy objective, the marginal 
benefits (MB) can be computed as a function of / and the parameters 
P, C, e, and ij. Johnson’s methodology then involves setting the 
ratio MCIMB (call thisZ) equal to unity and inferring the characteris¬ 
tics of the scientific tariff structure from the re.sulting expression.^ 
However, the condition Z = 1 is neither necessary nor sufficient for 
determining the scientific tariff unless the prohibitive tariff (which 
satisfies Z = 1 only by accident) can somehow be ruled f)ut of con¬ 
sideration as the second-best optimum. Furthermore, even excluding 
the prohibitive tariff from consideration, Z = 1 is sufficient only when 
it has a uniejue root in /, which will be true only for a special class of 
policy objectives (which define MB). Thus the scientific tariff can be 
inferred from the first-order conditions in only a limited set of 
circumstances—even if one adheres to the simplifying assumptions 
underlying Johnson’s analysis. 

These problems can be illustiated using Johnson s own case of 
"self-sufficiency,” which he chose to measure by “the proportion of 
consumptioti supplied from domestic production. ” This gives 

MB = + r,){\ - tr))-K (2) 


where fi is the pcjlicy weight attached to marginal units of the objec¬ 
tive. in numeraire units.^ 1 hus 


Z = 


MC 

MB 




V 

€ + 


-)(1 - / T ,)^ 


(3) 


which clearly is cubic in I and can therefore generate multiple roots* 
for the equation Z = 1. Note tcM> that the prohibitive tariff (for which 
= P) is at the level /„ satisfying /„ = (C - P)l(Cr\ + Pi). 

Figure 1 shows how MB and MC vary with t. Here MB(B\) 
maiginal fx’nefit fine for a low value of /3, in which case the scientific 
tariff (/„ where Z = 1) is unambiguously preferred to the prohibitive 
tariff (/„). For some policy weight ^ 2 , MB{Bf) will satisfy the property 


' S|>c( itically, from the total cliffcrriitial ol Z oiir can determine- the dl neiessarv to 
ollset dX (X = e, tj, P, or f.'), thereby characleri/iiig the diflerent es in / as the market 
parameter.s vary across goods or over lime. 

* I have introduced the parameter fi into Johnson's miMJel to make Affl commensu¬ 
rate with MC, in order to clarity the discussion. This p.<rameier is assumed 10 be 
(onsiant, purely as a simplifying devKe 

‘ John.son, in his (1960) article, avoided confronting this problem by using a linear 
approximation to (S) in his analysis. This devKe is [lermissiUe if and only if one can 
presume that the scientific tariff will be small, but that ptesumption cannot generally be 
valid since the whole point of the exercise is to/ind the level for ihe scienuhr tariff. 
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that ihe solid area equals the slashed area, so the policymaker is 
indifferent between a tariff of t,^ (where Z - 1 given C, P, e, and tj) 
and one which is prohibitive (3= <^). For any > /Sz a prohibitive tariff 
will always be preferred. Thus a small increase in /3 may cause a finite 
jump in the scientific tariff.* 

Similarly, a small change in any of the market parameters may 
cause a finite jump in the scientific tariff (by altering Af/(,A4C, and /p). 
For example, beginning with MC and MB(^ 2 ) as shown in figure 1, it 
IS clear from equation (3) that an increase in e will shift MB up relative 
to MC, causing the scientific tariff to jump from t,^ to tp (which it.self is 
reduced in magnitude since dtp/rfe < 0). The possibility of such discon¬ 
tinuities is completely absent from Johnson’s well-behaved but inap¬ 
propriate formula. Furthermore, any further increase in e will reduce 
the scientific (i.e., prohibitive) tariff, contrary to Johnson’s conclusion 
that “the implied scientific tariff structure entails tariff rates which are 
higher the higher the elasticity of supply . . .’’ (p. 348). 

In short, John.son’s methodology applies only to a limited set of 
circum.stances; more generally, the first-order conditions are neither 
necessary nor sufficient for identifying the “scientific tariff." 

* For example, if/S = 1, C = 10, f = 1, and ■»> = €= I, then t, = .82 while < .33 
(since must be to the left of the peak of the cubic 7. function, which occurs at 1/3 t)). 
Thus a slight change in the ptanner's preference for protection, or any of the market 
parameters, could conceivably cause the scientific tariff to leap from below 33 percent 
up to 82 percent. 
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Moreover, the structure of protection implied by the correct calcula¬ 
tion may differ dramatically from that implied by Johnson’s proce¬ 
dure. 
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Discount Rate and Wealth 


G. S. Laiimas 

lllinim Suite i'niveisity 


III a receni issiif ol this fournal, Mohabbat and Siinos (1978) provided 
esiimates of the rate of diseouni using Kendriek's (1976) series of total 
private wealth. Since Kendrick also provides estimates of the division 
ol total wealth In-tween human, 14’/,, and nonhuman, wealth, it 
should be of interest to investigate how the discount rales differ 
heiween the two coinponerils of wealth. 

In order to accomplish this, it is lU'ces.sary to divide the total income 
into two pans—one corre.s[)oiiding to the human wealth, f/,, and the 
other to the nonhuman wealth, Kendrick (1976, p. 22) and Chris¬ 
tensen and Jorgenson (197;4) provide an outline of the method for 
sue li a division ol the total income. Following tfiem, the time series for 
]\ and r„ were estimated, '{'he estimated series closely apptoximate 
ilie estimates for "lalior compensation" and “propen ty compensation" 
given by Kendrick and Christensen and Jorgenson. 

.-\s suggested in Mohabbat and Simos, the lime-senes estimates of 
the rate ol discount fot If'/, and I4'„ were computed using the varying 
(larameieT regression lechiiicjuc. I hese series are given in table 1. 

1 fie obvious inference that can tie drawn from the results given in 
table 1 is that the discount rate on If'/, is consistently higher than the 
rate on ir„. 1 his evidence is iriifrortant by it.self. In the ab.sencc ol 
direct empirical estimates, as provided here, economists had to make 
assumptions about the behavior of the two rates (see Pesek and .Saving 
1967). 

Additionally, since the major portion of W,, is education, the results 
provide further evidence on what Welch has called “one of the most 
important phenomena of our lime . . , that the rates of return to 
investments in schooling have failed to decline under the pressure of 
rapidly rising average educational levels” (Welch 1970, p. .64). 'The 
existence of a relatively higher rate on ff^/, confirms the conclusions 
about the behavior of the rale of return on education arrived at by 
Griliches (1970, 1977), Welch (1970). Becker (197.6), and others. 

\Juurnal of PoUtual vol 89 ik> 1] 

^ Iht l.nivrrMiy of Chudgo *f0 
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TABLE I 

Timf.-Series Eshmates or the Discount Rate on Human 
AND Nonmuman Wealth, United State;s, 1929 to 1969 


Year 

Discount Rate on 

Human Wealth 

DiMounl Rale on 
Nnnhuman Wealth 

1969 

.15 

.08 

1968 

.15 

.08 

1967 

.1.5 

.09 

1966 

.15 

.09 

1965 

.15 

09 

1964 

15 

.09 

1963 

.15 

.08 

1962 

16 

08 

1961 

16 

08 

1960 

.16 

08 

1959 

16 

.08 

19.58 

.16 

08 

1957 

17 

08 

19.56 

17 

08 

1955 

17 

.08 

1954 

17 

08 

19.53 

17 

08 

1952 

.17 

08 

1951 

.17 

08 

1950 

17 

08 

1949 

17 

08 

1948 

17 

08 

1947 

17 

08 

1946 

18 

08 

1945 

20 

.08 

1944 

.21 

08 

1943 

.21 

08 

1942 

20 

.08 

1941 

18 

.07 

1940 

16 

08 

1939 

16 

.08 

1938 

.16 

07 

1937 

16 

.07 

1936 

16 

08 

1935 

14 

07 

1934 

14 

06 

1933 

13 

.06 

1932 

.13 

.05 

1931 

14 

07 

1930 

.15 

08 

1929 

16 

.09 
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Money, Inflation, and the Bank of Canada: An Analysts of Canadian Monetary Policy 

from 1970 to Early 1975. By Thomas J. Courchf.ne. 

Montreal; C. D. Howe Research Institute, 1976. Pp. 290. 1.5.00. 

For 12 years, from 1950 to 1962, while other industrialized countries were 
operating a hxed exchange-rate regime under the Breiion Woods par value 
system, Canada was a maverick floater that provided economists with a labo¬ 
ratory in which the practical problems of implementing monetary and fiscal 
policy under a regime of flexible exchange rates could be observed and 
analyzed. Prior to the advent of greater exchange-rate flexibility among the 
developed countries in the early 1970s, the Canadian experience was fre¬ 
quently cited as an example of the degree to which, through the judicious use 
of monetary policy, a flexible exchange rate could be kept stable with a 
minimum of official intervention in the foreign exchange market. It is no 
coincidence that two Canadians, Robert A. Mundell and Harry G. Johnson, 
were major contributors to the early literature on the appropnate use of 
monetary and fiscal policy under alternative exchange-rate regimes. 

During the late 195()s and early 196()s. debate on the relative merits of hxed 
versus floating exchange rates was waged within Canada not as an abstract 
intellectual exercise, but as a ptjlicy issue of immediate practical interest to all 
Canadians, ('.anada returned to the fold of hxed exchange-rate countries in 
1962 and remained there for 8 years. W'hen she again chose to break from the 
par value system on June 1. 1970 she briefly appeared once more as tbe odd 
man out. By 1974, however, the accumulating pressures of widely divergent 
inflation rates among countries, continuing U.S. balance of payments dehciis, 
and rapid increases in the prices <tf basic ctimmtidities and energy had led to 
the collapse of the par value sy stem, leaving the larger industrial countries 
with little choice but to opt for the present system of “managed Boating" 
exchange rales. In this context, the experience of the Bank of f^nada in 
conducting monetary policy under alternative exchange-rate regimes is of 
considerable importance to students of monetary economics and policymak¬ 
ers alike. It is hardly surprising that, despite its relative youth (it was estab¬ 
lished in 1934), the Bank of Canada has been the subject not only of detailed 
research by academic economists, but also of lively and sometimes acrimoni¬ 
ous public debate. 

Thomas J. Ojurchene's careful study of the Bank of Canada during the 
cruaal and fascinating period 1970-75 is a welcome addition to a substantial 
literature which includes works by Neufeld (1955), Johnson and Winder 


L/MtffM/ ^ Fotaical Etmwmy. 1981. vol 89. no U 
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(1962), Reubci (1964). the Royal Commission on Banking and Finance 
(1964), and, perhaps the best known, Paul Wonnacott's study The Canadian 
DoUai, 1948-1962 (1965). I.ike these earlier works, Courchene’s book at¬ 
tempts to analyze and explain the policies of the Bank of Canada within the 
context of standard concepts of macroeconomics and monetary theory. But 
Coitrchene’s book is far from being a mere update of earlier research. His 
work differs in at least two ways. First, he seeks to explain his views not only to 
other econcmiists but also to interested laymen, a dc.sirc which is the natural 
reflection of the position occupied by the Bank of Canada at center stage in 
the policymaking theater. .Second, he approaches the analysis of monetary 
policy from an avowedly “monetarist” perspective. These considerations gov¬ 
ern the stnictiire of his work. 

The hook is divided into tour main sections. Part 1 is devoted to an 
elementarv elaboration of seveial important aspects of the monetarist theory 
of a small open economy under fixed and flexible exchange rates. Coiirchcne 
emphasizes the distinction between nominal and real stocks of money and 
desenhes the diffeicni forces that act to equate money demand and money 
supply under alternative exchange-rale regimes. He then goes on to describe 
the concept of the natural rate of unemployment and spells out its implica¬ 
tions for what inonetarv polic \ can and cannot do. Part 1 is completed by a 
brief description of alternative approaches to monetary theory, paiticulurly 
the neo-Keyncsian approach 

Part 2 of the hook treats the theory of central hanking in general and the 
Bank of Canada's ap|iroach in particular. Caiurchene first considers the vari¬ 
ous indicators of monetary siringenev or ease ihiit were u,sc*d in Canada 
during the 1970-75 period—nominal interest tales, the licjuid itsset ratio of 
commercial banks. Ml growth, etc. He argues that (he Bank of Canada's 
choice of policy instnimciiis during the period was a logical consequence ol its 
theory of iuiw the Canadian financial system operated and pariicularly ol the 
Idle of market imperlecticins in allowing the authorities to inHiience both “the 
cost of creilit' and "credit conditions" to some extent independently. 
Throughout, Courchene has c hosen the technique of compating actual Bank 
ofOanada policies with what would have been ideal fiom the point of view of 
monetarist theory. Such an approach is. of course, consistent with his objec¬ 
tive, hut it docs intcilve comprornises. Monetary economists of a neo- 
Keynesian bent will not be entirely satisfied with Colilchene's rather .spar.se 
treatment of the inqilualions ol the Keynesian model lor monelaiv policy. 
Courchene righlly argues that his puijxise is not to create a straw man out of 
the neo-Key nesian approac h, but it is still left to the reader to decide whether 
jxilicies that do not appear optimal when viewed from a striit motietanst 
perspective are more rea.sonable from ihe [mint of view of the mcKlel that the 
Canadian monetary autfiorities had in mind. 

Fortunately, this question turns out to be somewhat beside the point, lor 
the real meal of Courchene's book is to be found in pans .“i and 4, which 
present a very detailed description and analysis of Canadian monetary |xilicy 
during 1970-75. After an initial chapter covering the whole period, Cour- 
chene devotes a chapter to develczpments during each year. This method of 
organization, while ideal for a reference work, makes for rather heavy going 
when the hook is being read straight through because,- it results in excessive 
repetition. Nevertheless, Courchene’s achievement in these sections is impres¬ 
sive. He manages to present a careful and incisive analysis that integrates a 
great deal of statistical material on the economy's performance with the 
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iiuihoritie.s' own public explanations of their policies, drawn main!)' but not 
exclusively from Bank of Canada publications. As a result, the comparison of 
actual policy with the monetarist paradif^m becomes less important than the 
detailed description of events. 

Courchene's thesis is that the floating of the Canadian dollar in June 1970 
was caused by the Bank of Canada’s attempt to pursue a less inflationary 
policy than the rest of the world, a policy that could only succeed if the 
exchange rate was flexible. But Courchene argues that “when the rate floated 
and Canada was finally allowed the independence to pursue its own policies 
with respect to the behavior of prices, the concern over inflation was immedi¬ 
ately jettisoned in favor of ensuring that the exchange rate was set at an 
‘appropriate’ level” (p. 161). This change in emphasis, in Courchene’s view, 
ushered in a period of excessive monetary expansion that persisted from 
1971 until 197.5. All this is supported by the marshaling of an impressive 
array of facts and figures. One may choose to disagree with Courchene’s views 
about the monetary authorities' motives and intentions, but it is not easy to 
dispute the broad outlines of his argument. Along the way, the reader is 
treated to mimcroiis interesting details; the difficulties experienced by the 
authorities in using indicators and instruments based on the asset side of 
commercial liatik balance sheets; the subtle web of pioblems arising out of the 
fecferal governnieni's cash deheit and the use of Cianada Savings Boncf issues 
to finatice its needs; and reliance on the transfer of government deposits, 
rather than statidard open-market operations, to alter the level of bank 
reserves. 

On the basis of the evidence, few would dispute Courchene’s conclusion 
that rapid rates of monetaiv expansion fueled inflation in Canada in 
1971-75. But this ( riiu ism is somewhat tempered when the Bank of Canada’s 
policies are compared with the actital track record of central banks in other 
countries during the same period rather than with the monetarist paradigm. 
Kurthermore. by late 1975 the Bank of Canada, like several othei central 
banks, had amiouncc*cl its intention to plate greater emphasis on cpiantitative 
nionetarv targets, a fact recogiii/ed by Courtheiie in a postscript apparetiily 
inserietl as his bcKik went to press. Whatever the importance of these shifts in 
the thrust of monetary [kiIicv, Courchene s bcark will iindoubledlv Irecome a 
standard reference work for the 1970-7.5 period, and the thoroughness with 
which he has aiialy/ed monetary (nilitv in Canada will serve as a firm basis for 
the continuing debate on this subject. 

MAi.toi..vi Kmc.ht 

tiilnrialinuri/ Mmirtary hind 
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.4 Life jar Sound Money: Pet Jacob.t.wn. Hu Biography. By Erin E. jAf; 0 BSS 0 N. 
Oxiord; Clarendon Press, 1979. Pp. xxv + 428. £16. 

Erin )acobs.son has written a lively and readable life of her father, covering the 
manv facets of a remarkable career. She did not lack for material; 6 meters of 
corres[M)ndetKe; a diary < overing over 50 years in such detail that for the later 
years ii would hll .several volumes; .some 500 articles; innumerable .speeches 
and reiiorts, including the greater part of the Bank for International Settle¬ 
ments (HIS) Annual Reports for 25 years. This is drawn upon to paint a 
picture of the man and his many activities over the years, along with a 
necessarib brief account of the background to some of those activities. A 
(onduding tha|)tei deals with his economic thought. 

From the time he joined the League of Nations Secretarial in London in the 
spring of 1920, until his death in office as managing director of the IMF in 
196.1. Per Jacobs.son was almo.st continuously in the service of a key interna¬ 
tional orgatii/ation. ‘1 here was a break of 2!4 years in 1929-11 when he 
returned to .Sweden Theie fttr a time he was economic adviser to Ivar 
Kreuger, die .Match King, not long before the latter's suicide and the .sensa¬ 
tional collapse of Kreuger and Toll. But even in those years he spent mtiih of 
his time abroad on iniernatioiial business. Wherever he went as a wandering 
economit adviser he seemed to be in the thick of the action, with an exiraor- 
clinary knack of positioning himself where he could best observe, analyze, and 
mfltierice events. His influence was felt over nearly half a century, not only 
within the miernanonal oiganizaiions for which he worked, but among those 
responsible for economic and financial policy throughout the world. 

What was ifie secret of his influence as an economic adviser? Part of the 
answer lies ni his per.sonaliiv. Always a little larger than life, he was the rno.st 
clubbable of men, full of good stories, parables, anti telling fads, persuasive 
and witty, confident m simple remedies and in his power to expountl them, 
the world's onisianding salesman of sound finance. A man .so entertaining 
and heart warming was sure of a hearing. Added to that was an extremely 
well-stored mind- he icKik endless pains to master the factual background to 
financial problems, to know what was going on and put together whal he 
could learn from others about past experience and current developments. It 
was not for nothing that he built up an excellent daily summary of hnancial 
news at the BLS and immersed himself .so deeply in the preparation of its 
annual reports. When he had to suggest remedies he could start from a 
thorough diagno.sis and take full account of special circumstances without 
limiting himself to purely general propositions. 

His approach was neither that of the economic forecaster nor that of the 
pure theorist. His concern was to make sure that the right policy instruments, 
especially monetary irntruments, were brought into use and that they oper¬ 
ated in the right direction. Thus he was no enthusiast for fine tuning and was 
more given to laying emphasis on keeping costs under control than on the 
need for demand management. He never put together any extensive analysis 
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of the impact of monetary policy on output and prices but took for granted 
that it was at once the most powerful and most delicate of instruments 
available for maintaining internal and external balance. 

His use of economic theory was almost always at a comparatively elemen¬ 
tary level. But he could put his finger on the key issues and had a deep 
intuitive understanding of the complex interactions that govern economic 
development. He had courage, common sense, and conviction, so that he did 
not hesitate to repeat unpopular doctrine until it won acceptance. Above all, 
he had a very good track record. He had foreseen, well in advance, the fall in 
long-term interest rates in the thirties. He was never much impressed by the 
prediction of a prolonged dollar shortage and was one of the first u> stress the 
growth in dollar liabilities. He also realized very early—certainly by 1942— 
that the postwar problem was going to be one of coping with inflation rather 
than with unemployment. The emphasis of what he had to say moved in¬ 
creasingly, therefore, with the temper of the times. 

The danger that he foresaw was that cheap money and deficit spending 
would continue in circumstances to which they were quite inappropriate. 
They would issue inevitably in excess demand, external deficits, and appeals 
for aid from abroad when policies designed to secure “internal balance" 
would make such aid superfluous. Such policies should be designed to restore 
monetary discipline, and it was on the monetary component that he kept 
harping until discount rates were increased in the early fifties. The message 
that he sought to emphasize was the primacy of monetary policy in maintain¬ 
ing price stability' and the need simultaneously to limit public spending and 
preserve budgetary balance, if not year by year at least over a period of years, 
'I'his was the point at which tic t(K)k issue with Keynesian thinking, and it was 
also at the rout of his popularity with central bankers 

It hapixms also to be the point at which he and I parted company. He had 
no use for the Radcliffe Cominiitee RejKiti of which I was a signatory. He 
found convincing evidence of the importame of credit control in Germany's 
experience in 19.50-51, when an incrca.se in the discount rate from 4 to 6 
fiercent was followed by a gradual but pronounced swing ol the (ierman 
balance of payments into surplus. But tins still seems to me a special case in 
which, for obvious reasons, stock building ol imported materials had been 
carried to excess, and it was only a question of time liefore the steep upward 
trend in German exports predominated over speculative excesses. In general, 
the early postwar peruxl was dominated by the logistical difficulties involved 
in regaining prewar levels of production and trade, and cheap money was not 
a universally important handicap in the restoration of internal balance. 

However all that may be. there was no doubt about Jacobsson's priorities. 
He put price stability first; and since he shared neither Keynes's doubts about 
the self-regulating character of the economy nor Keynes's optimism about 
managing the level of activity with success, he was content to let employment 
IcKzk after itself, Iti some respects, therefore, he came close to the monetarist 
position. But he disagreed .strongly with Milton Friedman on monetary man¬ 
agement, insisting on the need for flexibility. He was also insistent on the need 
for what he called “a balance between prices and costs." It is not altogether 
clear what he meant by this, since one might suppose that the relationship 
between the two is governed by the state of demand. It would seem that what 
he was after was the danger that profit margins might be compressed by 
excessive wage claims or misplaced tax policies and the need to take account 
of cost and supply factors before deciding on the timing or scale of any boost 
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to demand. He envisaged llie use by governments of measures designed to 
influente costs direitly, including what has come to be called “incomes jxtl- 
icy,” so that here again he was by no means in agreement with present-day 
monetarists. 

Although he was doubtltil whether the iiiHiience of Keynes had been on 
balance healthy, his economic philosophy had much in common with 
Keynes's. He was, like Keynes, a believer in managemeitt; both the manage¬ 
ment of ntonev atid the mattagement of the economic system through (inan- 
tial mechanisms. He was no Iteliever in controls; hut, like Keynes, he saw the 
need to find a way of stahili/ing the wage level with the agreement of all 
parties concerned. Although he disliked deficit spending and public works, he 
was not prepared to condemn them in all circumsiames. Kven more than 
Keynes, he was c-ssentially a nineteenth-century liberal, again.st nalionali/.ation 
and government suhsidic's. m favor of a market economy and free trade, 
championing the small against the large, private enterprise against govern¬ 
ment control, 

I'he ideas lor which Jacobsson conietidecl gainc'd increasingly m inHuence 
over the postwar wot Id. ^'e^ he- nevcT succ'c-eded in committing them sy.stem- 
alicallv to paper m the form ol a treatise. Tins he regreiled deeply. His 
tonipulsive need to talk—"the daily spreading of the substance ol my 
gifts '—took precedeme over the nec-d to con.sinict a theoretical .scalfolding 
lot Ins ideas. But there is no doubt that he made the right choice and that his 
(cute lay m economic policymaking rather than m theoretical virtuosity. He 
left behind in his diary and his writings an ample lecotd of the events in 
which he tcxik part and ol the ideas governing his a|)praath to them. 

■Aik Cairncross 


Knrii‘'> Minitifiiy '/Vna/g/if A Sliiih «/ Il\ Drvelolxnrnt. By Don 1’aiink.in. 
Dui'ham, SC.- Duke f'mversiis I’ress. 11176. I'p. D'i.'i $yt,7:’>. 

It IS a trulv ama/iiig moi \. He was a leiiowiic-d leac hc-r, scholai, and writer, an 
inHiieiiiial commeiiiatoi on world tiflaiis, an ex|>eri .ilways consulted by his 
government, an intellectual aenve m partisan politics, a successful opcralot in 
husmess and (iiiaiu e, and much more. At the- age ol 47, he had jnsl published 
hiswicigiucw a iwo-volcimetreatisesetting loilli his ntalnreconcinsionscin 

the entire range of his scientific expcitise. 1 he cageily awaited work excitc-d 
great interest m his jirofc-ssion, but coiisiderahle criticism too. TTte author 
himself was dissatisfied. Lveii fK-fcne the ink was ctry, he was (hanging his 
mind and working on the secpiel. T his task bc-camc- a C()r[x)rate enterprise, 
engaging not only the critic al wisdom of his poet s bnl the loyal etithiisiasin of 
one of the most reniarkahle (oncentratioiis ol talented young .scholars in 
academic histcjiy. Seminars and lecluies cjuickly turned Iron) glosses on the 
completed work to draft chapters of the new. Over .5 years ol constant 
conversaiicvn and correspondence, the master, his young disciples, and 
selected critics, friendly and not so fnentlly, argued out the new dextrines. He 
and his disciples never doubted they were at last figuring out how the world 
really works, solving puzzles that had stumped iheii prcclecessors for a cen¬ 
tury. They did not rely on experimental finclings, statistical inference, fiistori- 
cal study, or mathematical inncjvaiion, Ixit on sheer logic and insight. Their 
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parochial confidence that (ruths worth knowing arose inside a circle of a 
40-mile radius would have been ridiculous arrogance had it not been so 
nearly justified. Finally, the new book appeared and wrought the revolution 
its makers anticipated, not only commanding the attention of the profession 
worldwide for decades to come, but profoundly influencing the policies and 
politics of nations. 

The two IxKiks were, of course, Keynes's Trralt\e on Monty (19,S0) ancf his 
General Theory (1936). The publication of The C.olleeud WrtUngt, of John Maynard 
Keynes by the Royal Economic Society makes available many details of the 
story, and more are acces.sible in unpublished documents among the Keynes 
Papers in the Marshall Library at Ciambridge. It is the gcMxi fortune of the 
profession that a great monetary theorist. Don Patiiikin, has undertaken 
active research in the history of monetary theory and in partic ular on Keynes. 
Ill the brief volume under review be traces Keynes's monetary and mac¬ 
roeconomic thought through his three major works, beginning with the 'Iran 
on Monelary Kelorm (1923), .stressing the story sketched above of the two later 
Ixioks. Keynes's multiple roles in .science and public life make for an engross¬ 
ing case study of the interactions of theoretic al developments with the events, 
problems, and [Kilic y conlroversic-s of the times. T he intellectual and jsersonal 
Ielaiionships of Keynes to his older colleagues (Pigou. Roliertson, Hawtres, 
llayek), to his young disciples at Cambridge' (Kahn, Joan Rcjbinson, .Austin 
Robinson, -Sraffa) and at (Kford (Harrod, Meade), and to others (including 
Kaldor, Leriier, Myrdal, Ohliii, Hicks) are a fascinating episode in ihe scxwl- 
ogy of scholarshij). Patinkin's concise narratives whet the appetite, and he 
refers tis to wotks by Winch, Moggridge. and Howson for lull meals. 

Here Patinkin's main emphasis is on theory. He c learly explains and com¬ 
pares pioposiiions of the thiee iKioks and related articles, and he seeks 
reasons lot the changes. Most leaders will Ik* particularly interested m Patin¬ 
kin's views of what Keynes "really meant" in the (ienerol Theory With admira¬ 
ble restraint, Patiiikin reftains from imeriireting 01 appraising Keynes from 
the hinclsighl of today's theory and knowledge, even his own. 

T he 7'mri Patiiikin finds fascinaiing on the national and international 
problems and j-Kilicies ol the era. but m monetary economics he finds it a 
roiitme recital of the (iambricige ceisioii of the (|uan(iiv theory. T he Tiratise 
he finds dull and mechanic al —".1 Kevnes out of c harac ter. a Kevnes attempt¬ 
ing to act (he Idle ol a Professoi, and a (iermaiiic one al lhai" (p. 24l. T he 
famous fundamental Equations are liiiidameni.ilb ffawed. and the ob|ec(iye 
of explaining output lliictuations is lioiind to elude a theory that is designed 
Id explain prices and profits lor given output but kicks ec|ualions for ouiput 
ileteimmation. It is this gap that led Keynes to the theory of eflecti\e de¬ 
mand, m Patinkin's view (be ccniral insight ol the GenernI 'Theory, for the 
General Theory and for Keynes's lifetime contribution Patinkin is a great 
enthusiast. In these days, when Kc'ynes's standing with the professKin and 
yy ith the general public is cyclically low, when the man's writings and influenc e 
are blamed for all the world's ills, siucfc'iils and young c'conomists will gain 
valuable (lerspc'clive fiom Patinkin's licKik. 

On the mam punts of interpretation I agic-e with Patinkin. (I) The preten¬ 
sion to establish an equilibrium witb involimtaiy unemployment, which puts 
off so many c-conomisis instilled with neoclassical instincts, should not lx* 
taken seriously. Keynes is leally describing an economy in di.si‘i)uilibrium, or 
iri a .succession of constrained teinpu ary c-quilibria, using comparative statics 
as (he analytical language ol the day. His strong conviction and his true 
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message arc that the automatic self-righting mechanisms of whole economies 
are slow, weak, and unreliable. Excess supply can persist for long periods, 
whether one calls the situation equilibrium or not. (2) Keynes did not assume 
either money wage rigidity or money illusion. He did contend that wages 
would adjust downward slowly—more slowly than prices could move upward. 
And he was skeptical of the employment effects of economy-wide wage 
reduction. As Patinkin say.s, he regarded wage and price deflation as the 
equivalent of monetary expansion. Neither might be capable of lowering 
interest rates enough to revive investment in depressed times. (3) Keynes 
emphasizes non]xjlicy disturbances to the economy, real as well as monetary. 
The "state of long-term expectation” by entrepreneurs and investors is a 
major determinant of effective demand. Since these expectations relate to 
essentially unknowable and nonprobabilistic future events, including the ex- 
[lectations of future entrepreneurs and investors, they are arbitrary and 
volatile. Keyne.s also stresses, as in the Trrattst, the importance of differences 
of view lietween savers and investors, or bulls and bears, or lenders and 
borrowers. (4) The Hicksian fS/LM apparatus captures beautifully the 
general equilibrium character of Keynes’s vision, but its conci.se formalism 
does not do justice to the above insights and others. (In early drafts of the 
book and in lectures Keynes wrote symbols for expectations or “slate of the 
news” as arguments of hmction.s, but they were not in tlte book for Hicks to 
find.) 

My differences frtiin Patinkiu's observations are minor. (1) He describes (p. 
811 the financial side of the (ieneral Theory as a three-assei model (money, 
hoiitls, equities), in contra.st lo the two-as.sei model (money, equities) of the 
IrenU^e. Thi.s .seems to be stretching things, li seem.s lo me that the Grnnal 
Thi’oiy really has only two assets (money, everything else). Anyway, there is 
only one endogenous interest rale, that to whieh the estimated marginal 
efficiency of (.ipiial i.s if) Im- equated by variation of investment—a .stock-flow 
fonfnsion, as Leiner observetl early on. In treating the porilolio choite 
between etpniics and fixed-money-value assets and in tonsidering the way the 
maikei brings logelher "hulls" and “bears,” 1 think the Treatise is supcrioi, 
dealing with inoie mlercsiing and important matters than speculation about 
Ixincl interest rales. (2) Patinkin notes with favor (p. 36) that the Treatise 
abandoned the quaniity-thetiry presumption, folittwed in the Tract, that mon¬ 
etary mea.snres first alter the quantity of money and only then, via this 
medium, alfccl spending, profil.s, prices, and output. In the Treatise central 
bank operations change interest rates and thereby affect all the variables, 
inchidtng the tuianlity of money, siniuliancously. The General Theory appears 
lo be a .step backward in this respect. (3) Patinkin allows himself to wonder (p. 

110) about Keynes's omission of wealth, in distinction from capital gains, as a 
determinant of consumption. Keynes in fact does recognize life-cycle effects, 
and he would pioperlv not wish to count planned or expected accumulations 
of savings as another determinant. But why does he not recognize real gains 
or losses on money balances? The an.swer is not clear. Perhaps it is because 
none of Keynes's theoretical work contemplates drastic or continuing one-way 
price change. As in other monetary theory of the day, the quaesitum was the 
tleteimination of the equilibrium value of money, that is, the price level, not 
the inflation rate or any other dynamic price path. Given this mind-.sei, 
Keynes might naturally regard short-run price variation as too transient and 
reversible to enter consumers' reckoning of wealth. The same mind-set must 
be the reasetn that Keynes did not see inflation as a way of lowering real 
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interest rates even when nominal rates were dose to their floor. (4) Greater 
stress might be placed on Keynes's uncritical acceptance of the neoclassical 
competitive model. By assuming that firms are price takers in auction markets 
rather than price setters in monopolistic competition or oligopoly, he made it 
harder to sustain his vision of persistent disequilibrium, with failures of 
coordination, communication, and adjustment. Imperfect competition was 
the other revolution in economics in the 1930s; one of its sites was Keynes’s 
Cambridge, and two of its agents, Joan Robinson and Srafia, were in his 
group. Yet for some mysterious reason the two revolutions were never 
meshed. 

Keynes planned, Patinkin reports, to write a Ixxik of “Footnotes to the 
General Theory," but was never able to do so. The fourth book, Patinkin feels, 
would have been one of continuation and clarification of the third. Keynes 
had no reason to rethink the General Theory as he had the Treatne. 

Which of the several Keynesian traditions today is authentic? Any or all? 
Patinkin does not enter this dangerous terrain. We can hope he will tell us on 
some other occasion. Meanwhile we can hxik forward to more books that 
combine, as this one does, fascinating narrative, economic history, history of 
thought, and lucid exposition of difhciilt theory. 

James Tobin 

Yale i’nniersity 


StMlies in Manoeronomic Theory. Volume I: Employment and tnfintion. By Rdmi'nd 

S. PlIKI.HS. 

New York: Academic Piess, 1979. Pp. xii + 418. SlO.fiO. 

This vciluinc reprints 17 papers b\ Phelps that were originalh published 
between 196.5 and 1978, together with one previously unpiiblislied pa|>er 
from 1959 and some newly wrillcn material in the lomi of seven brief 
commentaries on the origins and aims of the papers and a synthetic intro¬ 
ductory e.s.say laying out a theory of price, wage, and emplovmcni fluciiia- 
lioiis. Phelps's work is deep, insightful, and tec hmcally challenging. .Many of 
the reprinted pajiers are well known and have been influential in the rapid 
development of niacroetonomic analysis in recent years. 

My candidate for the most significant pa|X‘r in the \olnme is "Phillips 
(airves. Inflation Expectations, and Optimal Em|)lovment over rime"' (1967). 
In this pa[ier Phelps introduced the nalmal-rate hypothesis and the .associ¬ 
ated acceleration hy[X)ihesis into macrcxfconomic analysis, although he mcxl- 
estly reports that he learned these ideas from his mentors William Felbicraiid 
Henry Wallich. The natural-rate liy|x>thesis probably has proven to be the 
most important new idea in macroeconomics since 1936. 

Milton Friedman's appaieiuly independent invention of the natural-rate 
hypothesis in his 1967 presidential address to the American Economic Asscx i- 
ation represents a remarkable coiiicidence in the history of the subject. In¬ 
terestingly, however, Phelps and Friedman in their original presentations 
drew radically different lessotis from the natural-rate hyfxithesis. For Fried- 
niaii, the natural-rate hyjxithesis provided another rcicson for believing that 
activist stabilization jxilicy offers little potential benefit to offset its inheient 
risks. For Phelps, the natural-rate hyjiothesis provided a new and welcomed 
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challenge lo the technical virtuosity of students ol’optimal control techniques, 
a leartion that reflects an essential pari of his implied view, discussed further 
below, of the relevant e of economic analysis for cctinomic policy. 

The coiuenls of thi.s volume mo.stly are concerned with two problems: the 
develo|mient of positive mcxlels of economic behavior that can account for 
observed macroeconomic fluctuations, .ind the development of normative 
criteria for macroeconomic policy bicsed on a dynamic theory of optimal 
inflation. With regard to the former problem, the newly written introductory 
essav attempts to tonsolidate a theory of macroeconomic Hiictuations that, as 
fhelps pots it, IS “developed piecemeal” in the reprinted papers. 

In mv view, this ess;i\ fails lo confirm Phelps's claim that there now exists a 
satislactoi) theory of employment fluctuations that explains why nominal 
disturbances apparentlv t reate a temporary divergence between actual and 
expel ted inflation rales and thereby affect real variables in aildition to prices 
and wages. Recent sears have seen progiess, lo which Phelps’s work has 
contributed, in clarifving the objeciives foi such a theory, but have not seen 
the attainment of these objectives by either Phelps or other theorists. Both of 
the main existing approaches lo this problem—the non-market-tlearing 
moilels. exemplified in this volume by the intiodiictoiy essay and the paper 
written |oiritl\ with Joliti laylor, ".Stabilizing Powers of Monetary Policy 
under Rational F.xpectation.s" (1977), and the incomplete-information mod¬ 
els. developi'd most full) In Robert I.ticas and Robert Burro —seem to rely 
either explicitly or implicitly on contrived and unconvincing impediments to 
Waliasian oiitionies. 

Ill Phelps's new work, specifically, the key assertion is that costs of informa¬ 
tion transmission c ntise delays in market-clearing prit e and wage adjiisimenis, 
( his idea, howevei, receives neither formal theoretical deveiopmeni nor 
eiiipiiical confirmation. Phelps’s sulisiantial contributions to the ongoing 
work oil a tlieory of mac roei onoinic flucluntions actually have largely in¬ 
volved (|ualificaiion and enilR'Ilishnieiit of the essential clarification embodied 
in tfic iiatiiral-rate hvpotfiosis—namely, that a Phillips curve consistent with 
neoclassical eccinoitnc postulates would relate divergencies ol uuemploymeni 
from Its et|ciilihnum, that is, ‘‘natural.” level to the difference iK-twcen the 
inflation rale and some mettsure ol the- expected inflation rale, ralfier lhan 
simpi) lo tlie mllaiion rate. 

One acconiplistiinenl of the newly written mairnal is lo cleai up am¬ 
biguities and me onsisiciic les in some of the reprimetl papers with regard to 
market-clearing assumptions. As indicated above. Phelps now prefers the 
ncin-market-clearing approach, and the new mlroduclory essay, iii conirasl lo 
some of his earlier work, assumes that both labor and product markets fail to 
clear. His mam defense ol the non-market-tlearing approach is lliai he 
regards the alternative incomplele-inlormaiion approach as unrealistic, be¬ 
cause when combmed with assumptions about rational rxfx?ctatioiis it implies 
that only unsysteniaiic components of monetary policy aficet real variahles. 
Phelps, however, discusses no empirical evidence that supports rejection ol 
this hypothesis. Actually, the existing empirical analysis .seems lo he iiiconclu- 
sive. Vloreover, here as elsewhere, bis diHus.sion ol the existing literature is 
impressionistic and does not include either demiled examination of alterna¬ 
tive mixlels or even specific references. 

Phelps al.so refers to "the patent unrealism of the view [implicit in the 
inrompleie-inlormation approachl that unemployment is wholly voluntary" 
(p. 27). However, as many authors, ineJuding myself, have pointed out, this 
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crilicism is either semantic or wrong. Specirually, the recently developed view 
ol labor market transactions as involving implicit contractual arrangements, 
which Phelps acknowledges to be a useful contribution, explains layoffs and 
other apparent evidence of nonwage rationing of employment without in¬ 
voking a failuie of labor markets to clear. 

Phelps alludes in a few places to what seems to me to be the most telling and 
fundamental prohlem with the inomiplete-information approach, incorpo¬ 
rating rational expectations and market clearing, to modeling macroeconomic 
fluctuations. I'liis prohlem is that the ttssumptions of this theory apparently 
imply that perceived monetaiy disturbances would not affect real variables, a 
hy(iothcsis that seems inconsistent with the apparent relation between pub¬ 
lished monetary data and measures of output and employment. 1 o gel 
around this prohlem, the Lucas and Barro formulalions of this theory im¬ 
plicitly employ the tontrivance that rational individuals disregard current 
monetary data. 

With regard to the implicitly contractual view ol labor market transactions 
referred to above, Phelps emphasi/es a variant, develoixvl in a short paper 
written jointly with (iuillermo f'.alvo. “Kmploynienl (ionlingent Wage (ion- 
ti at ts" (1977), that assumes that employers know more about reali/.ed stales of 
naliiie than do ihcit workers. 1 he significante ol this assumption, however, 
seems ((uestionahle becatise, tonirarv to what Labo anti Phelps assume, 
employers m tompeiiiive labor markets probably would not find it worthwhile 
It) expltiit suth an informattoiial athantage. In any event, even with the 
LaKo-Phelps assumption ahoiil firm behavioi, the 1 oniractiial mcKlel does not 
imfilv that employment fluctuations shoultl l)e relatetl to published moneiars 
tlata. Ihiloriunately, Ciaho and Phelfts obsture this jtoini by implicitly miro- 
duemg anothet t|uestionable assumption, that the nioneiars authoriiv has 
supeiitir initirmation about teal and velotily shocks. 

In addition to mat r<K*<onomit fluclualitms. the other main ftxiis ol this 
volume IS, as indicated alxive, a thetuy tif optimal inflation. Phelps is m- 
lerestetl m |)rescribing inflation {xilicv Ixiih in thectiniext of a determtnisitc 
steatly state and in slot hastic anti non-steads-stale lonlexts m wliith optimal 
ctmtrol lechnitjues hetonie relcsain. 1 he analysis of these issues is com¬ 
prehensive and introfltites many subtleties anti complexities. Kxamples of 
fundamental insights arc the Mewing of the problem ol optimal inflation as 
involving the thoice ol an optimal lax mix and as being |)arallel to the 
pioblein ol tipiimal growth. Importantly. Phelps clearly distinguishes anti 
explains the resjieclive roles ol empiiital knowledge anti elhiial prstulaies in 
|irescriiiing policy. 

It IS wtirlh recalling in this tontexi ih.tt Phelps rejects models of mac- 
roeionomit flucttiaiions that assume market tlearing and rational ex|XHia- 
tions. .Suth motlels imply that feedback control is irrelevant. In his more 
ret enl papei s, however. Phelps takes careful account of how the inirtxiuclion 
of rational expectations in plate of adaptive expectations in non-markel- 
clearing models raditally changes the control problem. In "Disinflation with¬ 
out Retession: Adaptive (;uide[xrsls and Mimeiarv Policy" (1978). he devises 
a subtle counlerinHationary plan involving "« dyriamir monrtnn propam which 
IS talculaieci to maintain full einploymeni in.sofar as the average wage ap¬ 
proximates its warranted path, and the insliliilitm of some fonn of indirativr 
wage planning to guide and promtile tlie growtli of wages along that warranted 
path" (p. 256). In advix ating this plan, he also carefullv assesses the questions 
about economic liehavior that make ns success less than certain. 
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Phelps’s analysis of optimal macroeconomic policy involves challenging 
imelleciual exercise and undoubtedly enhances understanding of important 
economic relations. His discus.sions of policy issues, however, both in this 
volume and in other writings, also illustrate the confusion prevailing among 
us academic economists about the relevance of normative economic analysis 
for actual policymaking in real economies. In his critique, “The 1972 Report 
of the President’s Council of Economic Advisers [CEA]; Economics and 
Government,” published in the September 1972 issue of the Ameriran Eco- 
nomtr Hnnnv (vol. 62, pp. 533-39) and referred to but not reprinted in the 
present volume, Phelps observes in a tone of anger and frustration “that 
economic analysis does not occupy a strategic place in the .selection of eco¬ 
nomic policy” (p. 538). But he does not seem to be willing to entertain the 
possibility that the explanations for this observation may be systematic and 
immutable. 

In critici/mg the CEA, Phelps describes the so-called game plan of 1969-71 
as involving both deception and self-deception, and he suggests that these 
symptoms are chronic characteristics of governmental inability to make effec¬ 
tive u.se of scientilic knowledge. In “Stopover Monetarism: Supply and De¬ 
mand Factors in the 1972-74 Inflation" (1975), he attributes the record of 
ptKir monetary ptdicy to inadequate quantitative knowledge about the econ¬ 
omy. In his 1972 book. Inflation Policy and Unemployment Theory (New York: 
Norton), he writes of "the hope, however unwarranted, of (the cost-benefit 
approach to inflation planning] having an early policy impact" (p. ix). 
Moreover, with regard to alternative inflation policies, he acknowledges “the 
exiraotdinat > diflicitkies of estimating costs and benefits” (p. xvii), as well as 
the fear that "il the lid is taken off the inHatioti question . . . the public will 
siniplv bungle its inflation decisions" (p. xv). At the end of this discussion, 
however, he succumbs to apparently wishful thinking when he writes, ”1 
believe there could be a significant gam from adopting a less ritualistic, more 
’calculating’ apptoach to inflation policy” (p. xvi) and hnally offers the advice 
to sink or swim when he writes, “Getting hold of out democratic policy¬ 
making potentialities is as muth the |x>iiii as the modest though valuable 
economic gain that can be expected to result” (p. xvi). 

It is interesting that, in “Inflation Planning Reconsidered” (1978), Phelps 
discusses precautionary considerations that lake account of the political in¬ 
ability to lake appropriate action. But, he al.so displays unrealistic, ivory-tower 
faith in the importance of logic and ideas. For example, he apparently 
regards such constructs as “the social utility function,” “the Rawlsian 
maximum criienon," and “the intersection of weighted individual prefer¬ 
ences and technically feasible choices” to be potentially operational and to 
provide a meaningful basis for actual policymaking. In a newly written com¬ 
mentary, he admits to being amazed that the natural-rate hypothesis, a 
straightforward implication of neoclassical theorizing, turned out to be so 
contrczversial. In “Commodity-Supply Shock and Full-Employment Monetary 
Policy” (1978), he fancifully attributes a decision not to increase the money 
stock to accommodate supply shocks to Federal Reserve acceptance in 1974 of 
the latest, and at that date still unpublished, theoretical work implying that 
nonaccornmodaiion is harmless. 

Phelps clearly does not appreciate the correct perception of many laymen 
that much professional economic advice is based on academic fads. In the 
newly written material he slates, "It may be that, as a methodological device, 
the piostulate of rational expectations will prove to be irresistible—whatever 
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our doubts over its substantive reality” (p. 28). But does he really expect the 
public to take seriously prescriptions based on nothing more than profes¬ 
sional prejudice? And how are laymen to distinguish solid advice from un¬ 
tested hypotheses? My guess is that attempts to apply the latest intellectual 
speculation to policy questions are subversive to efforts to enhance the role of 
fundamental economic analysis in policymaking. 


Brown Univmity 


Herschel I. Grossman 





Interpreting Economic Time Series 


Thomas J. Sargent 

University of Minnesota and Federal Reserve Bank of Minneapolis 


This paptT explores some of the implications for econometric prac¬ 
tice ol the principle that people's observed behavior will change 
when their constraints change In dynamic contexts, a proper 
definition of people’s constraints includes among them laws of mo¬ 
tion that describe the evolution of the taxes they must pav and the 
prices of the goods that they buy and sell. Changes in agents' per¬ 
ceptions of these laws of motion (or constraints) will in general 
produce changes in the schedules that describe the choices they 
make as a function of the information that ihe\ possess. Until very 
recently, received dynatnic econometric practice ignored this princi¬ 
ple. 'I'he practice of dynamic econometrics should be changed so that 
it is cotisistcnt with the principle that people's rules of choice are 
infhiencc'd by their constraints, {'his is a substantial undertaking and 
involves major adjustments in the wavs that we formulate, estimate, 
atid sitnuinte econometric mcKlels. 


Introduction 

This pafX,T explores some of the implications for econometric practice 
of a single principle from economic theory. This principle is that 
people's observed behavior will change when their constraints change. 
In dynamic contexts, a proper definition of [jeople's constraints in- 


Thc views exjiresscd here are solely mine and do not necessanly rejireseni the views 
of the Federal Reserve Bank of Minneapolis or the Federal Reserve System. Many ol 
my thoughts on the subject of this p.ij>er have liecn heavily influenced by luiinerous 
discussions with Lars Peter Hansen and Rolicrt F. L.ucas. Jr. Tfie observauons on 
Bayesian methcxls are in large jiarl those ol Hansen. Ian Bain made manv heljiful 
comments on an earlier draft. This pajier is the text for the Mary F.liralieth Morgan 
Pn/e lecture, given at the linivetsily ol Chicago m November 1979. 


{Journal of tolUKol t.ronomn. I9H]. vol no 2) 

(S) 19BI bv The Umv^nitv ol ChKa^o (M>S2 5M)H/K|/89»2 0(K>7|0I 
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eludes among them laws of motion that describe the evolution of the 
taxes they must pay and the prices of the goods that they buy and sell. 
Changes in agents’ perceptions of these laws of motion (or con¬ 
straints) will in general produce changes in the schedules that describe 
the choices they make as a function of the information that they 
possess. Until very recently, received dynamic econometric practice 
ignored this principle and routinely deduced policy conclusions by 
assuming that people’s rules of choice would not vary, for example, 
with the government’s choices of laws of motion for variables such as 
tax rates, government purchases, and so on. These variables are 
supposed to have their effects precisely because they influence the 
constraints of soitie private agents. 

The practice of dynamic econometrics should be changed so that it 
is consistent with the principle that people’s rules of choice are 
inffnenced by their constraints. This is a substantial undertaking and 
involves major adjustments in the ways that we formulate, estimate, 
and simulate econometric models. Foremost, we need a stricter 
dehmtiori of the class of parameters that can be regarded as “struc¬ 
tural.” The body of doctrine associated with the “simultaneous 
equations ’ model in econometrics properly directs the attention of the 
researcher beyond reduced-form parameters to the parameters of 
“structural equations,” which presumably describe those aspects of 
the befia\ ior of jteople that prevail across a range of hypothetical 
environments. Estimates of the parameters of structural equations are 
needed in order to analyze an interesting class of polity interventions. 
Most often, however, included in a prominent way among the 
“structural equations” have been equations describing the rules of 
choice for private agents. Consumption functions, investment sched¬ 
ules, demand functions for assets, and agricultural supply functions 
are all examples of such rules of choice. In dynamic settings, regard¬ 
ing the parameters of these rules of choice as structural or invariant 
under interventions violates our simple principle from economic 
theory. 

This paper de.scribes methods for interpreting economic time series 
in a manner consistent with the principle that people’s constraints 
influence their behavior. For the most part, I shall restrict things so 
that the dynamic economic theory is of the equilibrium variety, with 
optimizing agents and cleared markets. However, many of the princi¬ 
ples described here will pertain to other types of dynamic economic 
theories, such as “disequilibrium” models with optimizing agents. The 
line of work I shall describe has diverse antecedents, of which major 
ones are contributions of Muth (1960, 1961), Nerlove (1967), Lucas 
and Prescott (1971), Telser and Graves (1971), and Lucas (1972i>, 
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1976).' The works of Granger (1969) and Sims (1972) have provided 
key technical econometric foundations. 

The basic idea is to interpret a collection of economic time series as 
resulting from the choices of private agents interacting in markets 
assumed to be organized along well-specified lines. The private agents 
are assumed to face nontrivial dynamic and stochastic optimization 
problems. This is an attractive assumption because the solutions of 
such problems are known to imply that the chosen variables (e.g., 
stocks of factors of production or financial assets) can exhibit serial 
correlation and cross-serial correlation. Since time series of economic 
data usually have the properties of high own-serial correlation and 
various patterns of cross-serial correlation, it seems that there is pio- 
tential for specifying dynamic preferences, technologies, constraints, 
and rules of the market game that roughly reproduce the serial 
correlation and cross-correlation patterns in a given collection of time 
series measuring market outcomes. If this can be done in such a 
fashion that the free parameters of preferences, technologies, and 
constraints are identifiable econometricalJy, it is then possible to 
interpret the collection of time series as the outcome of a well- 
specified dynamic, stochastic equilibrium model. This paper is in¬ 
tended as a nontechnical summary of some of the econometric and 
theoretical issues involved in interpreting data in this way. 

But why should anybody want to interpret time-series data as rep¬ 
resenting the results of interactions of private agents’ optimizing 
choices? The answer is not that this way of modeling is aesthetically 
pleasing, although it is, nor that modeling in this way guarantees an 
analysis that implies no role for government intervention, which it 
does not. The reason for interpreting time series in this way is practi¬ 
cal; potentially it offers the analyst the ability to predict how agents’ 
behavior and the random behavior of market-determined variables 
will each change when there are policy interventions or other changes 
in the environment that alter some of the agents’ dynamic constraints. 
There is a general presumption that private agents’ behavior and the 
random behavior of market outcomes both will change whenever 
agents’ constraints change, as when policy interventions or other 
changes in the environment occur. The most that can be hoped for is 
that the parameters of agents’ preferences and technologies will not 
change in the face of such changes in the environment. If the dynamic 


‘ Examples of work in the general line arc Holt ei al. (1960). Craine (1975). Crawlord 
(1975), Gcwckc (1977), Sargent (1977, 1978). Blanco (1978), Hansen and Sargent 
(1979, 1980a. 19806), Taylor (1979, 1980). Hunt/tnger (1979). Kennan (1979). Meesc 
(1979), and Nerlove, Grether, and Carvalho (1979). The philosophy of this work ts 
reviewed by Lucas and Sargent (1978. 1980). 
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econometric model is formulated explicitly in terms of the parameters 
of preferences, technologies, and constraints, it will in principle be 
possible for the analyst to predict the effects on observed behavior of 
changes in the stochastic environment. 

Past dynamic econometric studies should usually be regarded as 
having been directed at providing ways of summarizing the observed 
behavior of interrelated variables, without attempting to infer the 
objectives, opportunities, and constraints of the agents whose deci¬ 
sions determine those variables. Most existing studies can be viewed, 
at best, as hai ing estimated parameters of agents’ decision rules for 
setting chosen variables as functions of the information they possess. 
Most ol the better .studies of consumption, investment, asset demand, 
and agricultural supply functions must be interpreted as having 
e.stimated such decision rules. Dynamic economic theory implies that 
these decision rules cannot be expected to remain invariant in the face 
of policy inters entions that take the Form of changes in some of the 
constraints facing agents. This means that there is a theoretical pre¬ 
sumption that historical econometric estimates of such decision rules 
will provide poor predictions about behavior in a hypothetically new 
environment. I his was Lucas’s (1976) critique of econometric policy 
evaluation procedures as they existed in 1973. 

Some readers of Lticas (1978) have interpreted the message as a call 
to evaluate policies by using existing econometric models differently,^ 
ffowever, one implication of Lucas’s argument, and of dynamic eco¬ 
nomic theory generally, is that the formulation, identification, and 
estimation of the models must each be approached in substantially 
new and different ways. Most existing models simply cannot be saved 
by simulating them a little more shrewdly.''* 


* papers by Anderson (1979) and Mislikm (1979) seem ai least partly motivated 
by this interpretation 

^ I he set of ideas I distuss m this paper has |K*rhaps received most notoriety in the 
context of rnacroecononiit examples^ In particular, substantial attention has been 
devoted to the sample et<momies of Lucas (I97J^) and Sargent and Wallace (1973) in 
which those systematic nonnculralities that come from imputing persistently siibopii- 
mal expectations to agents were shown to disappear when the hyjx>thesis of rational 
expectations was imposed on agents. C^rudcly pul. certain classes of systematic mone¬ 
tary policies, in particular tho.se which operate solely via deception, were rendered 
imjx>tent in the Lucas and Sargent and Wallace examples Since the publitaiion of these 
papers, many papers have been published that have described .setups in which the 
cbc)ice of systematic policy mailers, even when rational expectations prevail These 
papers u.sually invoke a source of nonneutrahty not based on deception, of which there 
are many in standard macroeconomic theory. Pa}>ers of this class have often been 
interpreted as providing a defense of *‘pre-rational expectations’* activist fiohcies along 
lines that were produced by calculating optimal controls for Keynesian econometric 
models (A the style of the late 1960s. In fact, no such defense is implied, partly because 
the methods by which optimal controls for government policy variables are calculated 
are very different in all rational expectations models from the prcKcdures that were 
applied to pre-rational expectations models, but also because the ways in which 
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Formulating and estimating “rational expectations” models and 
dynamic equilibrium models of economic time series involves a variety 
of important conceptual and econometric issues, some of which 1 try 
to summarize in this paper. Among the issues to be treated are the 
following; 

i) Identification criteria. — Prior identifying information of the Cowles 
Commission variety, that is, mainly exclusion restrictions, plays a 
much smaller role in dynamic equilibrium models. Nonlinear cross¬ 
equation restrictions implied by dynamic theory are used extensively. 
This shift involves important modifications of past ways of thinking 
about identification and estimation. 

ii) Models of error terms. —The dynamic equilibrium modeling strat¬ 
egy virtually forces the researcher to think about the sources and 
interpretations of the error terms in the stochastic equations that he 
fits. The explicitly stochastic nature of the theorizing makes it difficult 
to “tack on” error terms after the theoriz.ing is done, a usual proce¬ 
dure in the past. 

iii) The role of Granger causality. —Granger causality turns out to be a 
critical concept in the formulation of dynamic economic models, as it 
is coincident with the condition for the appearance as an information 
variable in an agent’s decision rule of a variable not otherwise in the 
agent’s criterion function or constraints. 

iv) Bayesian analy.iis .—Bayesian econometric techniques provide a 
means of mixing prior theoretical information about parameters with 
information from the data. Such procedures are widely used by 
applied lime-series econometricians, although often no formal Bayes¬ 
ian justification is given. Dynamic economic theory provides no 
justification for one widely imposed class of prior restrictions which 
tan be viewed as restrictions directly on decision rules. Instead, 
dynamic economic theory suggests that prior information about 
agent.s’ criterion funtlions and constraints is what should be used in 
estimation. This feature of dynamic economic theory has implications 
for the proper implementation both of formal Bayesian procedures 
and of le.ss formal procedures for constraining parameter estimates. 

1 shall organize my discussion around an example, namely, a 


(•cojiomelric esuinates are Ui be construcied lor rational expettations models, with or 
without neiitralilie.s. dilfer substantially from the methods applied to ihc Keynesian 
models of the 1960s The mam point of the Eutas (1972o) and Sargent and Wallace 
(197,'i) examples is that substituting the assumption ol rational expetialions for 
"adaptive” expectations makes a critical difference for the methods both bv which we 
should evaluate and optimally choose government policies. That same message is pres¬ 
ent in the papers ol Fischer (1977), Phelps and Taylor (1977). and Hall (1978), even if 
superficially the differences in some qualitative features of the optimal policies under 
the two assumptions on expectations mav have seemed less dramatic than in Sargent 
and Wallace's example or I.ucas's. 
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linear-quadratic version of Lucas and Prescott’s (1971) mode) of in¬ 
vestment under uncertainty. I shall use this example for discussing 
the econometric implications of dynamic equilibrium models. I have 
adopted a linear-quadratic setup because it simplifies both the 
theoretical and econometric discussions, while illustrating many of the 
salient methodological implications of dynamic decision theory. 
Linear-quadratic optimum problems deliver difference equations that 
are linear in the variables and .so match up nicely with much existing 
dynamic econometric theory. The reader familiar with Lucas and 
Prescott (1971) will recogni/e how the example can be generalized to 
incorporate more general specifications for the technologies, prefer¬ 
ences, and constraints. That increased generality would make the 
econometric implications harder to extract than with the present setup, 
without altering the basic message.'* 


Investment under Uncertainty 

1 his paper describes a linear-quadratic version of Lucas and Pres- 
(oit’s model of investment and u.ses it as a vehicle for expositing a 
variety of conccjitual and econometric issues. The model describes 
the mutual determination over time of the capital stock, output, and 
market price of a single industry. The model tan be generalized to 
handle multiple factors of production at the cost of what are really 
only technical complications. Similarly, the model could also be 
generalized to incorporate a set of industries, like the corn and hog 
industries, with interacting dynamics. Finally, 1 mention that it is 
straightforward to modify the model to incorporate much richer 
dynamics by generalizing the nature of the adjustment costs. 


■' Using llie mclhods of di-voiinicd dynamic programming (e.g.. Blackwell 196.5), 
theoretical re.siilt.s c.stahlishing existence and uniqueness of eqiiilibiia and various 
qualitative features <il ifie equilibria can often l>e obtained for "weak" or "general" 
assumptions, such as that utility is concave, constraint sets are convex and monotone in 
shift variables, and so on. Lucas and Prestott (1971) and Lucas (1978) give interesting 
illustrations of these methods. These techniques were also used by Sargent (198()/i) to 
make some general observatitins on interpreting time-series correlations between To¬ 
bin’s 7 variable and the aggicgale rate of investment. However, for applied work, it is 
necessary to be able to calculate equilibria as a function of the free paramcteis of 
preferences and constraints, and it is highly desirable if the equilibria ran be calculated 
easily. While ftir general iunclional forms 11 is in principle (xissible to talciilate equilib¬ 
ria of recursive competitive mrxlels using a contraction map|>ing, in practice such 
methods are presently too expensive to use in empirical work. For this rea.son, for 
empirical work it is jjrescntly necessary to choose functional lorms lor which equilibria 
can be calculated either analytically or very quickly. Linear-quadratic specificaUons are 
one of the few such choices of convenient functional forms available. (Various versions 
of logamhmic specification are also sometimes tractable, e.g., Merton [1971).) A valu¬ 
able treatment of recursive competitive equilibrium models with general specifications 
of functional forms is Prescott and Mehra (1980). 
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I define the following variables; 

yt = output of the representative firm; 

n = number of firms in the industry, assumed constant over time; 
Y, = ny, = total output of industry; 

Pi = price of output; 

Di, = a (p, X 1) vector of random variables appearing in the indus¬ 
try demand schedule,/), > 1; 

D^t = a (p — p,) X 1 vector of random variables which help predict 
future values of the collection of variables D,f, p > p,; 

jy _ ^\t 1 . 

J 

Wt = rental rate on capital; 

Wt = a {q X 1) vector whose first element is u',; the remaining 
elements of IV, are variables that help predict future w,'s; 

и, = a random shock to demand; 

e, = a random shock in the production function; 

к, = stock of capital of the representative firm; and 
K, = nk, = total capital st<x;k in industry. 

The subscript t indexes the date to which the variable corresponds. 

1 further define the following polynomials in the lag operator L : 

ru 

«„{/.)= 1 - 


where 6^, is a scalar; 


6„(L) = /„ 


n> 

V 




where S/,j is a /) X p matrix and is the p x p identity matrix; 

'"it 


where 6^,^ is ^ (q x q) matrix and /, is the {q x q) identity matrix; and 
8,(Z.) = 1 - 

where is a scalar.* 

The industry consists of n identical competitive firms, each of which 
uses a single factor of production, capital, to produce a single output. 


“ I shall impose the condition that the zeroes of 6,(i). del 6n(t) and del S^tr) each 
exceed unity in modulus. Actually, a weaker condition would suflice, namely, that the 
zeroes ol these polynomials each exceed in miKluliis, where /3 is the discount factor 
introduced below. These conditions on the zeroes are rej^ularitv conditions that assure 
that the infinite series calculated in eejq (14) and (19) converjfe 
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Output of the representative firm y, is governed by 

y, =/i,+ n->e,. [>0, (1) 

where k, is the representative firm’s stock of capital at t, and e, is a 
random error in the technology. The firm knows {e,, . . . }, but 

does not know with certainty future values of the shock e,. The error 
€/ is known to follow the r,th-order Markov process 

SAL)e, = VU ( 2 ) 

where f'f is a “fundamental” white-noise error term for e,.® The firm is 
assumed to know 8,(/.) and with certainty. 

The demand curve for output is given by’ 

P, - d.,do,/1,>0, (3) 

where /),, is a (p, x 1) vector of “demand shifters,” is a (1 x p,) 
vector of constants, and //, is a random shock to the demand curve. 
The random term u, obeys the r„th-order Markov process 

8„(L)ti, = Vt, (4) 

where PJ' is a fundamental white noise for I he {p, x 1) vector of 
detnand shifters D,, consists of the first/>, rows of the/; x 1 vector D,, 
which follows the r„th-order vector autoregressive process 

= ( 5 ) 


w'here I'j' is a (/; x 1) vector white noise that is fundamental lor the 
process/f,. The representative firm is assumed to know 6„(h), 8/j(Z-), 
do. d,, dj, and the second moments of F“ and V'/ with certainty. 

At time t, total output is given by 


Y, = ny, = /A', + 


( 6 ) 


The repre.sentative firm’s problem is to choose a contingency plan 
for k/tj to maximire the criterion 


A., 


P,y, - w,k, 




( 7 ) 


' An (n X 1) \eclor white noise vf is said to be fundamental for an (n x 1) vector 
process if the vector of one-stc|>-ahead linear least-squares errors in predicting x, 
from past x's ran be written as a linear combination of n components of vf. 

’’ Since a simple static demand function is posited, all of the interesting dynamics of 
the model come from its supply side. Specifying a demand schedule with interesting 
dynamics would complicate tlie presentation but not alter the basic mc.ssages of our 
example. Telser and Graves (1971) analyze dynamic optimi/ation problems in which 
much of the interesting dynamics come from a demand curve that is s[)ecified. Sargent 
(1979, chap. 16) analyzes a model of the labor market in which the dynamics are 
influenced by nontrivial dynamic optimi7.auon problems solved by both suppliers and 
demanders. 
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subject to ^0 given. In 0),E, is the mathematical expectation operator, 
conditional on information known to the firm at time I. This informa¬ 
tion set will shortly be specified precisely. In (7), d is a positive con¬ 
stant. The term (d/2)(A,+, — k,)^ is intended to represent the notion 
that there are costs internal to the firm of adjusting the capital stock 
and that these rise at an increasing rate with the absolute value of the 
change in capital. We assume that the rental on capital iv, is the first 
element of the x 1) vector random process W, that obeys the 
ru,th-order vector autoregression 

S^(L)W,=^Vr, ( 8 ) 


where VJ" is a x 1) vector white noise that is fundamental for W,. 
The firm is supposed to know 8^{L) and the second-moment matrix 
of Vf with certainty. 

At time /, the firm chooses given the information that it has 
available at t. However, the maximization problem (7) is not yet well 
posed, since we have not completely spelled out the dynamic con¬ 
straints with respect to which the maximization is supposed to occur. 
To complete the problem (7), we begin by substituting (fh, n *€;) 
lory,, and (do - A u,) for P, to get 



(^<1 


(A„ - AJK, - A,e, + A-iDt, + u,)(fk, + n 'e,) 
- w,k, - , - A,)* . 


(9) 


In order that the problem of maximizing (9) with respect to a 
contingency plan for {A,+j} be well posed, it is necessary to attribute to 
the firm precise views about the laws of motion of the random vari¬ 
ables that it cannot toniiol, but whose values influence the best choice 
of its own stocks of capital. For problem (9), these uncontrollable 
variables about which the representative firm cares are K,, D„, e,. 

and it’i. The firm cares alxiut the present and future behavior of the 
variables (A.',, e,) because they influence the present and future 

behavior of the market price through the demand relationship/', = 
A a - AJKi - A,e, + A.,D„ A- u,. 1 he firm cares about the evolution of 
the rental process le, becau.se it influences its costs. We have already 
completely described our assumptions about the firm’s views of the 
laws of motion of//,,, «,, and w,, namely, that the firm knows the 

Markov laws (4), (5), (2), and (8) thai govern them, and at time t knows 

. . . , M„w, 1 , ...,€„ . . . , and W„ W,..,.To complete 

the specification requires that we specify the firm’s views about the 
evolution of the aggregate capital suh k K,. We assume that the repre¬ 
sentative firm believes that the aggregate capital stock evolves ac¬ 
cording to the law 
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A',+, = //« + H^(L)W, + Ho(L)D, + H.(L)€, 
+ H^(L)ut + H^K,. 
where H„ and are scalars and 

''UJ-l 

HAL) = where f/ u,j is (I x 7 ); 

j»o 


( 10 ) 


r/j-l 

Hn(L) = where //dj is (1 x/>): 

J=0 

Hf{L) ~ ^ where //,j is a scalar; and 

ru- 1 

HJL) = where H^} is a scalar. 

J “0 

The representative hrtn is assumed to know all of the parameters of 
the linear law of motion (10) with certainty. The reason that we have 
chosen the form ( 10 ) as the firm’s perceived law of motion for/C will 
shortly become apparent. 

With these specifications, the maximization of (9) is now well posed. 
Summarizing the setup, we have that the representative firm 
maximizes 


E 


0 



(»0 


{An - A JK, - /l,e, + A2D,, + u,){Jli, + n ’e,) 


(^(+1 



(9) 


subject to the laws of motion* 

- Hn + HAL)W, + H,AL)D, + HAL)€, 


+ HAL)u, + H,K„ 

8AL)W, = Vr, (8) 

8„(/Jm, = V^ (4) 

8AL)D, = V[\ (5) 

8.(L)e,= V;, (2) 


and subject to the information set at time t,* 

" It would be straightforward to modify this setup to assume that the {tV, u, «, D} 
pr(x;e.sse5 are each finite order mixed moving average, autoregressive processes. For 
the detaih, .see Hansen and Sargent (1979). 

• These variables completely characterize the “state" vector for the firm's problem. 
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{^<1 W,_J, . . . , Wt-r„+it D/, D/_J.D(_rj,+|, 


f(-l> • ■ • > E(-r(+l» • • • > *^(-r|,+ l}' 

The firm maximizes (9), taking the laws of motion (8), (4), (5), (2), and 
(10) as given and beyond its control. The firm is assumed to behave 
competitively and to act as if it has no control over the aggregate 
capital stock/f. This is a reasonable assumption ifn is large. The firm 
is assumed to know the S’s and H's with certainty and to know the first 
and second moments of the V'/s.*® We further restrict the problem so 
that the solution is a linear contingency plan.'* For this to be true, it is 
sufficient that the least-squares predictors of future W, D, e, and u's be 
linear in the conditioning variables. This will be true ifF', V“, Ff, and 
Vf obey normal probability laws. Alternatively, the analyst can simply 
assume that the industry is operating under optimal linear rules. In 
either rase, the solution of the representative firm’s problem is a 
linear contingency plan of the form'* 


= Ao + h^{L)W, + h„{L)D, + h,(L)e, 
-I- hy{L)ui -f hiKi + h^kt, 


( 11 ) 


where /»„, k,, and Aj are scalars, and 


• W • 

A„(L) = 


where hu» is (1 x q)\ 


rp-l 


A„(L)= 


J -0 


where X p)\ 


A.(f.) = x h,jU, where is a scalar; and 

j-O 


r„-l 

A„(L) = X A^jf.', where Auj is a scalar. 
j=o 

The A’s of (11) arc in general functions both of the parameters in the 
criterion function (9), the parameters of 5,^, 5„, and 8, appearing in 


We have in mind that the firm actually has observations on values of A’. It', D, t. and u 
for all dates! and earlier. It turns out that the firm’s derisions are optimally a function 
only of the information set listed in the text. 

It is assumed that each of Vf. f'“. E}’, and I'J is orthogonal to the informauon set 

{W,.„ u,_^ 0,_„ 5 » 1). 

" This is because we want the stochastic difference equations describing the behavior 
of the system to be linear and thereby to be readily susceptible to econometric analysis. 

” That the solution to the problem is of this form follows from linear optimal control 
theory (see Kushner 1971, chap. 9; Bertsekas 1976, chap. 3; Sargent 1979, chap. 14; 
or Kwakernaak and Sivan 1972). 
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AVi = //« + HAL)W, + HAL)D, + HAL)€, 

+ H„(L)u, + HiKi, 
where H„ and H , are scalars and 

r„.-I 

j =0 
r/;-| 

//„(/.) = ^ 

HAD = 

J-O 

The representati\e tirrn is assumed to know all of the parameters of 
the linear law of motion (10) with certainly. The reason that we have 
(hosen the form (10) as the firm’s perceived law of motion for A will 
shortly become apparent. 

With ilu'se speciheations, the maximization of (9) is now well posed. 
.Summarizing the setup, we have that the representative firm 
maximizes 


where/y„j is (1 x q); 
where is (I x p); 
where H^j is a scalar; and 

where is a scalar. 


f *-0 


(d() AJKi Af€t djZ)], + + n 'c() 


- w,k, - — (k,+, - k,)^ 


(9) 


subject to the laws of motion" 


AL)Wt + H,ADD, + HAI.U, 

(10) 

)u, + HyK,, 

K(L)w, = vr, 

(S) 

S,(A)u, = Vf, 

(4) 

8r,(L)D, = V'’, 

(5) 

8,(LU,==V„ 

(2) 


and subject to the information set at time I,® 


* It would be siraighdorward to niodify this setup to assume that the {W, u, <, D) 
processes arc each finite order mixed moving average, autoregressive processes. For 
the details, see Hansen and .Sargent (1979). 

* These variables completely characterize the "state” vector (or the firm’s problem. 
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{Ki, kt, Wi, Wj_|, . . . , Df, Dt-i, . . . , Z)i_rp+i, 

€|. £(-11 • • • t £«-r,+ t» •^r* > •^(-ry+l}- 


The firm maximizes (9), taking the laws of motion ( 8 ), (4), (5), (2), and 
(10) as given and beyond its control. The firm is assumed to behave 
competitively and to act as if it has no control over the aggregate 
capital stock K. This is a reasonable assumption if n is large. The firm 
is assumed to know the 8 ’s and H's with certainty and to know the first 
and second moments of the V',’s.‘® We further restrict the problem so 
that the solution is a linear contingency plan.” For this to be true, it is 
sufficient that the least-squares predictors of future W, D, t, and u’s be 
linear in the conditioning variables. This will be true if F 5 , F *, Ff, and 
FJ^ obey normal probability laws. Alternatively, the analyst can simply 
assume that the industry is operating under optimal linear rules. In 
either case, the solution of the representative firm’s problem is a 
linear contingency plan of the form'* 


"■ 8(j + hu,{L)Wi hj){L)Di -t- h,(L)€, 

-f- h^(L)tii + hfKf + h^k/. 


where ho, h,, and A 2 are scalars, and 


ruj-l 


/!„(/.) = ^ho-tU, 

r/;-I 

hi>(L) - ^ h[,,lJ, 


J-O 


where hu,j is (1 x g): 
where hoj is (1 X p); 


n-' 

where is a scalar; and 

J-O 

r,j-1 

h„(L ) = ^ where is a scalar, 

j-o 

The /I’s of (11) arc in general functions both of the parameters in the 
criterion function (9), (he parameters of 8„, S„. and 8 , apjjearing in 


Wc have m mind that the firm actually hasolMcnations on values ot K .kJV. D.t, and u 
tor all dates ( and earlier. It turns out that the firm’s decisions arc optimally a funcuon 
only of the information set listed in the text. 

It is assumed that each ol Ky, I’l', and V', is onhopinal to the information set 
(W,.„ u,_,. D,.,. €,^„ .( » 1>. 

" This is because we want the stochasuc difference equations describing the behavior 
of the system to be linear and thereby to be readily susceptible to econometnc analysis. 

" That the sohiDon to the problem is ol this form follows from linear optimal control 
theory (see Kushncr 1971, chap 9, Bertsekas 1976, chap. 3; Sargent 1979, chap. M, 
or Kwakcmaak and Sivan 1972). 
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(8), (4), (5), and (2), and the H's of the perceived law of motion for 
capital (10). The mapping giving the h's as functions of these other 
parameters is defined implicitly by standard formulas in linear opti¬ 
mal control theory, as exposited, for example, by Kwakernaak and 
Sivan (1972) and Bertsekas (1976). For present purposes, it is enough 
to note the existence of this mapping without exploring its nature in 
detail. The economic content of the mapping frtim the S’s, H’s, and 
objective hinction parameters to the k parameters of the firm's deci¬ 
sion rule is easy to understand, since it captures the notion that the 
firm’s rule of choice de|>ends on both its objective and its perceived 
constraints (10). (8), (4), (5), and (2). 

Multiplying both sides of the firm’s decision rule (11) by n and using 
K, — nk, gives 

/c,+ , = «/i„ + nh,^.(L)W, + nhij(L)D, -h nh,(L)et 

( 12 ) 

+ nhu(L)u, + (rih, + h2)K,. 

Et|uation (12) is the actual law of motion for aggregate capital that 
results from the behavior of the representative firm. The represen¬ 
tative firm’s optimization problem in effect induces a mapping from 
the firm’s perceived law of motion for aggregate capital (10) to the 
actual law of motion (12). For each possible particular perceived law 
of motion of the form (10), there is an implied law of motion for 
aggregate capital of the form (12). The notion of rational expectations 
is that the representative firm’s perceptions of (10) arc correct. In 
effect, a rational expectations equilibrium is a fixed point of the 
mapping that the representative firm’s optimization problem induces 
from (10) to (12). Formally, we define a rational expectations equilibrium 
as a perceived law of motion (10) and an implied actual law of motion 
(12) which are identically equal. In a rational expectations equilib¬ 
rium, firms’ perceptions about the law of motion for aggregate capital 
turn out to be confirmed by the aggregate of the choices made by 
firms. Upon comparing (10) with (12) it is evident that necessary and 
sufficient conditions for a rational expectations equilibrium are 

H„ = nhg. 

H„.{L) = nh„.(L), 

H^(L) = nh„{L), 

HAL) = nhAL), 

HAL) = nhAL), 

Hi = (nhi -r hA. 

Implicit in the above definition of a rational expectations equilibrium 
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are the following elements: (a) market clearing, (b) optimization of the 
firm’s expected present value, and (c) correct perceptions on the part 
of firms of the laws of motion of variables affecting their present value 
but beyond their control. 

We begin our analysis of the model by briefly describing aspiects of 
the optimization problem solved by the firm. Among the first-order 
necessary conditions for the maximization of (9) is the following 
system of stochastic “Euler equations,” which are derived by differ¬ 
entiating (9) with respect to A, for t = 1,2,...: 

pdk,+i - d{] + P)k, + dk,-i = p-iv, 

or 



In addition to the system of Euler equations, a transversality condition 
is among the first-order necessary conditions. The transversality con¬ 
dition can be derived by methods described in Sargent (1979). The 
transversality condition for the present problem in effect requires 
that the solution possess the property 


lim = 0. 

Using the lag operator, the preceding E.ulcr equation can be re¬ 
written as‘^ 




-f 1 Z. -t- -U/J 


^(+1 ~ 




Using the factorization 


the above Euler equation can be written as 

('-j'-)"-I"''-7'’- 

Noting that [1 - (l/j3)L] = - PL~') and operating on both 

sides of the above equation with [-/3“'Z,(1 - gives the 

solution'^ 


For a discussion ol the use ot lag operators in the present context, see Sargent 
(1979, chaps. 9 and 14). 

” In effect, the transversality condition compels us to solve the unstable rtxit forward 
in this manner. 
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(1 - L)k,^y 


1 - /3Z.-' 


Wt + 


1 - /3L'‘ 


Pt. 


or, equivalently, 

» oc 

(1 - L)k,^, = 

1*0 •s'O 


(14) 


It can be verified that (14) satisfies both the Euler equations and the 
transversality condition. Equation (14) would give the appropriate 
rule for setting A,+ 1 il the firm had perfect foresight about the entire 
future paths of the rental w, and the output price f*,. When the firm 
does not have perfect foresight, the correct decision rule can be 
derived by replacing the future values on the right side of (14) with 
the col tesponding mathematical expectations conditional on infor¬ 
mation the firm does have. This leads to the decision rule'® 

(1 - Z.)/.',,, = -d '0 y -i- ^fd ' f (15) 

J-(l (=-!> 

Heie fl, IS defined a.s the information set il, = {W„ W,_ .. u,.,, 

.... D,, D,-i, C/.i. K,). The conditional mathematical 

expectations are assumed to be computed using the laws of motion 
(10), (8), (4), (5), and (2) ior K, W,u,D, and c, respectively, as well as 
the demand relationship P, = - T,(/X', + e,) -F ■¥ u„ which is 

used to deduce the law of motion for P,. Once these conditional 
mathematical expectations are explicitly calculated in terms of the 
parameters of (10), (8), (4), (5), (2), and the demand curve (3), they 
can be substituted into equation (15) to deduce the optimum decision 
rule (11) for the representative firm. The deci.sion rule (11) is linear in 
all of the information variables that appear on the right side. How¬ 
ever, as the above method of calculating the parameters h of the 
deci.sion rule (11) suggests, the parameters h are themselves compli¬ 
cated nonlinear functions of the underlying parameters of the model: 
the parameters Ag,A,,A 2 of the demand curve, the parameters f and d 
of the technology, and the parameters 6„(E), 5,(Z.), b„ (L), and bi,(L) 
of the law's of motion of the random processes given from outside the 
model."* The A’s are also nonlinear functions of the Ws of the law of 


.As noted above, we shall want the relevant conditional exjiectations to be linear. So 
we shall regaid the £(• j 11,) that appears in (15) and elsewhere as wide-sense condi¬ 
tional expectations, that is. linear least-.squares predictors. This amounts to restricting 
the firm to linear deci.sion rules, as desired. 

'* The parameters fl and n also belong in this list ol underlying parameters of the 
mcxlel. 1 shall usually delete these two parameters Irom subsequent listings of the 
model's underlying parameters, though they should be understood. In some apfilica- 
tions, the analyst may want to specify counterparts of /3 and n completely a priori, in 
which case they would not be included among the free parameters of the m^el over 
which the likelihood function or other measure of "fit" is to be maximi'/cd. 
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motion of aggregate capital (10), which are not given from outside but 
are to be determined from the analysis. The nature of these non- 
linearities has been characterized by Hansen and Sargent (lOSOt) and 
will be alluded to further below. 

Equation (15), which was derived by purely formal manipulations, 
has the virtue of indicating clearly that the firm has an incentive to 
forecast future realizations of the rental w and the output price P. As 
a result, any state variables that the firm sees at t, and that help predict 
either future P's or future w’s, will appear in the firm’s decision rule 
for ^,+ 1 , given by equation (II). That the h's of (11) are nonlinear 
functions of the parameters {Ao.A,, A„/,d, 8„, 6(, S,,, b,„H ^,H 

Hg, Ho, and H stems from the nonlinear way in which the condi¬ 
tional mathematical expectations of future w’s and P’s are functions of 
these parameters. 

In practice, to compute a rational expectations equilibrium it is not 
necessary ever to calculate the right side of (15). Indeed, il is never 
necessary explicitly to calculate the h’s that determine the decision 
rule (11) of the representative firm. Instead, the H’s of the equilib¬ 
rium law of motion for the industry can be calculated directly as 
follows.” First, multiply both sides of equation (13) by n, then use AC, = 
nk, and collect all terms in K on the left side to get 

l3dK,+, - [d(l + /3) + AJ^fin]K, f d/C,-, = ^nw, 

(16) 

- ^nfAo + Atl3fn€t - plnA^Du - fifnu,. 


It is of some interest that (16) is itself the Euler equation for the "social 
planning" problem of maximizing'* 


Eo'^P' 


d„(/K, + f,)~YA,(fK, + t,y + (/K, + €,)A,D„ 


(17) 


+ (/AC, + €,)u, 


“ - y W'd (AC,+, - AC, )2 , 


"The (ollowiiiK argument in the text provides .a way of distovcring Tinas and 
Preseolt's (1971) method of LaUulatiiig the rational expectations ec|inlibriiim hs for¬ 
mulating a ficlitioiis sixial planning problem that reproduces the e(|uilibrium It is 
worth remarking that Kydiand and Prescott (1977) describe a recursive metluxf of 
calculating a linear rational expectations equilibrium that is applicable to our problem 
and IS distinct from the Lucas-Presiott method u|m>ii which the discussion in the text is 
based. Kydiand and Prescott’s method successfully computes the equilibrium even in 
instances in which the Lucas-Prcscott method breaks down. These instances occur, e.g.. 
in which there is feedback fnim the industry-wide aggregate capital stock K to H' or D, 
as would occur if lagged A't appeared as stares in the Markov law lor II’ or D. In such 
instances, Lucas and Prescott's scKial planning problem faib to reproduce the rational 
expectations equilibrium essentially because the hctitious planner ukes into account the 
externality that the feedback from fC to W or P constitutes. 

"This was emphasized in a more general coiuext by Lucas and ITescott (1971). 
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subject to the laws ol motion ( 8 ), (4), (5). and (2) for w„ u„ D,„ and e,*® 
and subject to Kf, given. 

The term in brackets is the area under the demand curve, since 

- A,x + + u,)dx = + Y,u,. 

Thus (17) is the discounted area under the demand curve minus the 
total costs of production. Dividing each side of (16) by ^d, the Euler 
equation can be written 

Aj.. -{1 + A + + 

= ( 18 ) 

-d >/nA,n„- 

It (an easily be proved that there exists a X such that 

[1 - (1 + /3 ' + AJ-nd ')L + fi 'IJ] = [1 - (\j 8 )-’/.](l - XL), 

where | A | < MV Using - (\/3)“'/,(l - Xfil."') = fl - (Xf3) we 
have that the Eulei etjiiation (18) tan be written as 

- iXfi) '1(1 - XfiL-'){l - XL)K„, = + j'l’f 




AJjl 

d 






«(■ 


A solution of the Euler etjuation that afso .salishes the transversality 
tondition for the social planning problem is 


(1 - XL)K„, 


+ X(3rifA 
d 


^ ( \ -X^) 


nXp 

d 


I - XfiL ' 


w, 


A JnX(3 

d 1 - XpL 


L' InX^d'L-' 

^ e, -T- r “7 Tzrr~~ o^„ 


I - A/3E 


( 19 ) 


+ 


fnXpd'L' 
1 - A/3L ‘ 


Mr- 


Retail, for example, that (I - XfiL ') 'w, = Sjlo (Xf3)^u',^.j. Then it tan 
Ire recognized that ecjuation (19) is the perfect foresight solution of 


It ran also he proved that the transversality condition for (17) imposes the same 
condition on the solution as does the transversality rondilion of the representative firm. 

“This follows directly from the crbservaticrn that if z,, is a rcro ol f 1 - (1 )9"‘ -f- 

AJhid' ')z + /8"'z®], then so is )9z;' 
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the planning problem that the rational expectations compietitive equi¬ 
librium implicitly solves. Thus, equation (19) expresses the aggregate 
capital stock as a linear function of K, and all future values of w,, 
Cl, D,„ and u,. 

By using the methods of Hansen and Sargent (19806, esp. appendix 
A), equation (19) can be converted to the “realizable" law for K that 
satisfies the Euler equations and transvensality conditions, and which 
expresses K,+i as a function only of information known at time t.*' 
This involves replacing the terms w,+,, D„+j, and u,+t in (19) by the 

corresponding mathematical expectations conditioned on fi,. The 
resulting equilibrium law of motion for K can be shown to be 


where 


= H„ + + Hu(L)D, + //.(/-)€, 

-t- H^(L)u, + H,K„ 


dO-m' 


//^(/,) = ■ 

a 


= X, 

L-'[l - S^Kpr'SAL)] 
1 - X/3L ‘ 


L-'[l - 


Hn(L) = +Jn\l3d-'A,<t,„ 
H^(L) = +d '/TiXfi 


1 - X/3I-’ 

/.-■[/ - 5„(X/3)-'6o(E)] 


1 - X/3L-' 


L 'U - 6„(Xff)-S„(L)] 
I - KPL ' 


( 20 ) 


( 21 ) 


Here <6„. is a 1 x q vector with 1 in the first position, followed by (q — 
1 ) zeroes, and <6/, is d fi, x ft matrix with a (p, x p,) identity matrix as 
the lirst/i, columns and zeroes elsewhere. Notice that le, = and 

/)i, = <Pi)Di. It is convenient at this point to recall the laws of motion 
assumed for w,, u,, e,, and /),,, namely, 

= ( 8 ) 

SJL)u, = r“. (4) 


Note that <■<) (19) saii.slics the liiM-ordcr iutcssar> conditions tor the optimi/.ation 
problem hut gives the planner texr mtit h informaUon (it is "antrcipative” or "nonrealiz- 
ablc”). The correct solution to the pioblem taking the infoimation set available to the 
planner into account is the solution ol the first-order necessary conditions that expresses 
Kn, as a (unction only ol inlorinaiion that the planner jxissrsses at time I. Such a 
solution is said to lx- "rcali/able" or "nonanuciyiative " 
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8„(L)D, = y?, (5) 

«.(/>) 6 , = yi ( 2 ) 

Equation (20) expresses the equilibrium law for the industry-wide 
capital stock Ar,+ , as a linear function of and current and past values 
of W, D, f, and u. Current and past values of W appear in (20) because 
they help predict future values of the rental rate w,, while current and 
past values of/), c, and u appear because they are used by agents to 
predict the future course of the market price P. The numbers of 
lagged values of IC, D, c. and u in (20) arer„, — l.r^ — 1, r, - 1, andr„ 
- 1, as expressions (21) can be used to show.“ Thus, the numbers of 
lagged values of these “information variables” W, D, e, and u in (20) 
are entirely inherited from the specifications of the actual laws of 
motion for W, D, e, and // in (8), (5), (2), and (4). 

Notice that the appearance of w, and D„ in the objective function of 
the representative firm (9) (or equivalently in the objective function of 
the fiititious stKial planner [17]) gives rise to the appearance in (20) of 
the entire blocks of variables W, and D, that help predict w and /)„, 
respectively. Thus any variables that help predict w and /)„, and 
which agents have information on, belong in the equilibrium law of 
motion for industry-wide capital. The property that the remaining 
variables in W (or D) help predict future values of u> (or D,) is said to 
be the property that the remaining variables in IT (or D ) (irangn rau.se 
w (or /),). The notion of Granger causality thus turns out to be 
coincident with the criterion for whether random variables that do 
not themselves appear in the agent’s criterion function nevertheless 
end up in the equilibrium law of motion or decision rule, essentially 
because they appear in the agents’ constraints as information variables 
that help predict variables that do appear in the criterion function. It 
is mainly for this reason that the concept of Granger causality has 
played an important role in work with rational expectations models.*® 


Bv expantiing the polyntirtiial in L. it is possible to show that 


/.-'I/ 


' X Z (W ’-’Ss 


1 - Kfil 


V 


( 22 ) 


where 8(/.) = / - 2,' ^Notice that the polynomial on the left side of (22) is one sided 
in noiincgative jxiwers of /., despite the appearance of L ', and that it is a polynomial of 
order (r - I), as asserted m the text. The formula (22) can be derived by mimicking the 
pnaedures used in Hansen and Sargent (lOSOu). The same mathematiial techniques 
used by Hansen and -Sargent (l98Cb) to derive expressions like (21) or (22) were 
independently utilized by Futia (1979) to compute linear rational expectations equilib- 
na. Also, without knowing of Hansen and Sargent’s work, John Kcnnan independently 
denved formulas similar to (22) in a personal letter to me. 

“ From this point of view, it is irrelevant whether Orangcr causality is consistent with 
one’s notion of what "true" causality is. Sims (1972) has described the relationship of the 
concept of Granger causality to that of strict econometric exogeneity. That a random 
process y fail to Granger cause x is a necessary condition for x to be strictly econometri- 
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Equations (20) and (21) reveal explicitly how the parameters of the 
equilibrium law of motion for industry-wide capital are themselves 
nonlinear functions of the underlying parameters {Ao,A,,A 2 ,f, d, 
n, Su,(L), 8u(L), 8t(L), 8d(L)}- The nonlinearity has two sources. First, 
there is the fact that \ is a nonlinear function of/3 and (A^fhid^') via 
the factorization defining [1 — (\^)''L](1 - KL) = [1 - (1 -f- /3"' + 
AyPnd^' )L + Second, given X, the formulas for//„.(L), W„(L), 

H,{L), and Hd(L) in (21) are nonlinear in the parameters of 8^{L), 
S„(L), 8f(L), and 8o{L). Nonlinear cross-equation restrictions of the 
kind illustrated by (20) and (21) are the hallmark of rational expecta¬ 
tions models. Such cross-equation restrictions are largely absent from 
“pre-rational expectations” dynamic econometric models.*^ The 
presence of these restrictions impinges on a variety of fundamental 
econometric and conceptual issues, including identihcation, the 
analysis of interventions, models of “error terms,” and the role of 
“prior information.” 1 now turn to discussing each of ihe.se issues, 
using (20) and (21) as an instrument. 


Analysis of Interventions 

At this point, it is useful to remind ourselves of the principal reason 
that an economist might want to construct a dynamic econometric 
model of an industry along the lines of our example. It is to be able to 
make quantitative predictions about the effects on the industry that 
various hypothetical interventions or “changes in the environment” 
will have. In the present context, a hypothetical “intervention” or 
"change in the environment" means a change in one of the polyno¬ 
mials 8 ^(L), 8 u[L), 81 ,(L), or S,(/-) that describe, respiectively, the 
stochastic processes lor W, u, D. and € that impinge on the market.” 
Several interesting examples of such interventions can be given, in¬ 
cluding the following: 

a) Suppose that there is a specific tax imposed on sales of the 


cally exogenous with respect toy. Kor this reason, the concept of Granger causality is 
also useful in designing specification tests. For a discussion of the relationship between 
Granger causality and econometric exogeneity in the context of linear rauonal expecta¬ 
tions models, see Hansen and Sargent (1980a). 

” Thus Fisher wrote; "In practice, except for such covanance restrictions [across 
disturbances in distinct structural equationsl. restnetions which relate the parameters 
of one equation to those of one or more others are extremely rare. There is no reason 
in principle why such cases cannot occur, however, and it may be worthwhile devoting a 
very short discussion to them" (1966, p. 176). 

“ By now, this is a routine and uncontroversial definition of an intervenuon. Appli¬ 
cations of the techniques ol optimal control theory to the calculation of mac¬ 
roeconometric and microeconometnc policy resjionse functions employ preasely this 
concept of intervention (see, e.g., Chow 1973; KareWen, Muench, and Wallace 1973; 
Arzac and Wilkinson 1979; and Taylor 1979). 
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product. Such a specific tax can be modeled as a component of 
(A^D,,). Since the behavior of the tax through time will be described 
by an element of the vector Markov law = V',’, changes in the 

rule for setting the specific tax amount to changes in one of the rows 
of 8„(L). 

h) Suppose that there is a specific tax on the use of the factor of 
production. This tax can be modeled as an addition to the rental Wf A 
change m the rule for setting this tax can be modeled as a change in a 
row of 

f) Suppose there is a change in the structure of the process gov¬ 
erning the “pretax” jiart of the rental. Again this can be modeled as a 
change in one row of 6 „.(f.). With a little imagination, the effects of a 
change in the organi/ation of the industry^** sujiplying the factor 
might be modeled in this way. 

The model leading to (20) and (21) provides a way of jjredicting 
quantitatuely the effects of such changes, once agents have caught on 
to them. I he effect of interventions in the scn.se described heie is to 
change the function (20) describing the evolution of industry capital 
in a way predicted by the formulas given in (21). Since interventions 
of rliis class change the law of motion ( 20 ), it is necessary to have 
analytic methods which use the cross-equation restrictions ( 21 ) to 
predict how the H's of the K-law of motion (20) will change if there Is 
a hyi>othetical intervention o|)erating on one or more of the 8 ’s. 

In order to evaluate policy interventions in this way, it is essential 
that the //’s of ( 20 ) should not be viewed as being among the free 
parameters of the model. Instead, the model's free parameters are to 
be regarded as the deejier parameters {A„, A,,A.^,/, d, fi„,, 6 „, 6 ,, S;;}. 
The researcher needs to know the.se parameters in order to be able to 
use the formulas ( 21 ) to predict the consequences of hypothetical 
changes in the functions 

From the dynamic economic theory leading to (20) and (21), it is 

E,g , if it becomes a cartel when beltire il had been t oni|)ctiljvc or nonctxipeialive 
in some way. 

" A technical qualirication needs to be added at this point. In order to have a model 
capable of predicting effects of interventions acting on the S’s, one can sometimes get by 
without having uniquely identified the parameters {A, f, d}. What the researcher must 
identify are the parameters of the characteristic polynomial of the Euler equation (16), 
namely the parameters d‘a, <t>i in {pd - [d(l + 0) + A J*np]L + dIA} = ()80, + d><J- + 
where 4>, = d and ~<t>„ = ld(l + ^) + A/*n)8]. The theory assumes that the 0/s will 
be invariant with respect to interventHiiis on the S’s. If the researcher tan uniquely 
identify the 0y’5, he can proceed with econometric polity evaluation, even if he cannot 
uniquely identify all of (A,,/, d). In some setups, the parameters of the characteristic 
polynomials of the Euler liquations are identihed even though only an equivalence class 
of the counterparts of (A,, /, d) is identified. This is enough (or econometric policy 
evaluation to proceed. This problem is discussed by Hansen and Sargent (I980S). It is 
technically related to the "inverse optimal control" problem (see Mosca and Zappa 
1979). 
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evident that a given numerical version of (20), estimated from histori¬ 
cal data, cannot be used to evaluate the consequences of arbitrary 
input sequences for { W,}, {D,}, {e,}, and {u,}. That is, a fixed law of 
motion of the form (20) with given numerical values for the H’s 
cannot be used to investigate the conse<)uence of arbitrarily specihed 
numerical sequences for the W, D, e, and w's. In effect, a particular 
version of (20) can be expected to hold up only for IV, D, e, and u 
sequences drawn from a restricted domain: namely, sequences obey¬ 
ing the probability laws (8), (4), (5), and (2).*" 

However, until Lucas wrote in 1976, evaluating the effects of inter¬ 
ventions in this inappropriate way was the accepted procedure in both 
the macroeconometric and the microeconometric literatures. Re¬ 
grettably, to this day it remains the procedure used in the over¬ 
whelming majority of analyses of policy interventions. It should be 
emphasized once again that from the viewpoint of the dynamic deci¬ 
sion theory de.scribed above, the question of how agents will respond 
to “arbitrary setjuences” of "forcing variables” IV, u, e, and D is not 
well posed. In effect, unless the researcher sjiecihes precisely the 
perceived laws of motion for the “forcing variables," he has not 
specified the constraints subject to which decision makers are thought 
to be acting.^” 

Thus, in order to be able to evaluate interventions operating on the 
S’s, it is necessary to formulate and estimate the model in terms of the 
parameters of preferences {A,,. A,, Ai), technology (/ and d), and the 
constraints (the 8's). The argument in favor of formulating and es¬ 
timating the dynamic model at the level of the deep parameters {/f®, 
A,, A 2 , d, /, S,,,, 6,, 6//} IS in much the same spirit as the usual 

justification lor estimating "striictiira]” parameters rather than 
reduced-form parameters. As Marschak (1953) argued, the re¬ 
searcher wants to estimate tho.se objects which will permit him to 
analyze an interesting class of change.s in the environment. Dynamic 
economic theory has foiced us to reexamine whether objects Jong 
thought to be "structural,” including the parameters of decision rules 
such as consumption, investment, and portfolio balance schedules, 
are correctly taken to be invariant with respect to changes in the 
environment. Once agents' behavior is modeled in terms of genuinely 
dynamic optimization problems, it becomes apparent that the param¬ 
eters of observed decision rules should not be viewed as structural (see 

“ This me-ssage i.s at least implicit in the work bv I-iicas and Prescott (1971). (Gordon 
and Hynes (1970) made the argument in an infoimal way. Lucas (1976) forcefully 
brought the me.s.sagr to the atlenlion of marroeconomists. 

" However, some economists conltniie to argue that existing macroeconomelnc 
models can be used to predict the effects of such arbitrary sequences (see Friedman 
1978). 
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Muth 1961; Lucas and Prescott 1971; Merton 1971; and Lucas 
1972a). 

The Neglect of Learning 

At this point it is worthwhile to discuss a modification of the preceding 
kind of setup which several economists have apparently had in 
mind.'*“ For this purpose it is sufficient to consider the problem of 
maximizing the social welfare criterion subject to the given laws of 
motion (8), (4), (5), and (2) for fV„ u„ D,, and e,. By relabeling and 
reinterpreting the variables, we can think of this as a choice problem 
faced by a single private agent. In posing this problem, it was aiisumed 
that the agent solving the problem knows the true values of the 
parameters of the objective function (17) and the true values of the 
polynomials in the lag operator 8,(L), 6a(L), Sn(L), and 8„.(L). The 
observation has been made that this setup fails to incorporate a model 
of how the agent optimally learns about the S’s from observations on 
past realizations of the forcing variables e, «. D, and W. Presumably, if 
the agent has only finite histories of observations on e, u, D, and IV at 
his disposal, then at each point in time he is uncertain about the 
parameters of the polynomials 8. Why not modify the preceding setup 
to include uncertainty about the 8’s and a model of optimal learning 
about the S’s? There seem to be three reasons why such extensions 
have not as yet successfully been incorporated into rational expecta¬ 
tions models. 

The first is a.s follows. A general model of optimal learning about 
the 6’s i.s readily available in the “Kalman filter,” which can be used to 
model how a rational agent w'ould use observations on (e,, /?,, W ,) to 

revise his prior beliefs about the S’s.'" However, with the S’s uncertain, 
it IS no longer po.ssible to give closed-form formulas for the optimal 
decision rule in terms of what are now the posterior probability 
distributions over the S’s. The reason that no one has yet obtained or 
is likely ever to obtain stich closed formulas is as follows. In deriving 
the closed form of the restrictions (21) for the case in which the S’s are 


“.See Fncdman 1979 and Modigliani 1977. 

See Andervin and Moore 1979. The Kalman filter provides a model ol Bayesian 
learning about the S’s where the initial prior and the posteriors are multivanale normal 
However, as Han.scn points out to me, normal posteriors for the 8’s are inadmissible lor 
dynamic modets of the <la.ss described here. This is because the dynamic optimization 
problems we consider may be ill posed lor points in the parameter space of S’s lor which 
the zeroes of det S<z) arc less than in mcxlulus. Only priors and posteriors that 
a.ssign zero probability to this region of the parameter space arc in general admissible 
for our problems, [ his rules out muluvariate normal distributions. Taking account of 
this admissibility constraint severely complicates the task ol building a model of optimal 
learning alxmt the S’s, 
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assumed known with certainty, the Wiener-Kolmogorov prediction 
formula, 





was used extensively. Here aj£^]+ = S7=o ajL‘, so that [ ]+ 

means “ignore negative powers of L.” The Wiener-Kolmogorov for¬ 
mula is equivalent with the “chain rule” of forecasting (see Shiller 
[1972] or Sargent [1979] for expositions). These equivalent forecast¬ 
ing rules are known to be correct for the case in which the 8’s are 
known with certainty. However, as Chow (1973) has pointed out, 
where there is a nontrivial posterior density over the 8’s, there is in 
general no known closed-form formula such as the above one for the 
i-step-ahead forecast. For example, it is not true that where 8 is 
uncertain, the correct expression for£,W/+( is given by replacing the 
8 u,j’s with their posterior means in the above formula. The fact that 
there is no closed-form prediction formula for sufficiently general 
cases implies that it is impossible to derive closed-form versions of 
decision rules (and hence equilibria) that correspond to (21). As we 
shall see, for the kind of empirical work we are advocating, it is 
important to have a closed form for the mapping from the param¬ 
eters of the objective functions (17) and the dynamic constraints to the 
decision rule (20). From this viewpoint, the suggestion that one ought 
to build a learning mechanism into rational expectations models is not 
useful in suggesting practical econometric alternatives to the proce¬ 
dures recommended here.^^ 

Another drawback with incorporating learning is that, even if one 
could derive the decision rules in the face of uncertain 8’s, the issue 
would arise of how to determine the prior used to initiate the learning 
model for the S’s. Would it be imposed a {priori or estimated? If the 
initial prior were to be estimated, this would substantially complicate 
the estimation problem and add to the number of parameters. 

Finally, in many settings the Bayesian learning model implies that 
the posterior distributions collapse about the true S’s as time passes 
without limit. In such settings, even if the researcher erroneously 
assumes that the S’s are known with certainty when in reality agents 
are learning about them in an optimal way, the researcher continues 
to obtain consistent estimators of the underlying parameters {A„, A,, 


” Further, notice that if tlie decision lulcs could be calculated in closed (orni undei 
uncertainty about the S’s, the resulting tiine-scries models would have time-varying 
cocfficienLs and so be nonstauonary. Even d calculaung the decision rules were a 
tracuble task under uncertainly alxiut the 8's, the loss of' stationarity that it would imply 
might well be a price that the applied economist would not be preiMred to pay even in 
exchange for the "greater realism” ol ihc learning a.ssiiinption. 
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A-i, /, d, H,,., dii, 8„} using the methods described here and in 
Hansen (1979) and Hansen and Sargent (1980w). It does seem likely 
that by erroneously ignoring the phenomenon of learning about the 
6 ’s, the researcher is incorrectly calculating the asymptotic covariance 
matrix of his estimators. However, at present nothing is known about 
the nature of this error. Further, since we simply do not know how to 
compute optimum decision rules under the assumption that agents 
know the 5’s with uncertainty, no consistent estimators of the under¬ 
lying parameters have been proposed that incorporate agents’ learn¬ 
ing about the S’s in the optimal way, to say nothing of expressions for 
the associated asymptotic covariance matrices. 

From the jireceding considerations, I draw the conclusion that 
incorporating optimal Bayesian learning about the S’s on the part of 
agents is not a research avenue that soon promises appreciable divi¬ 
dends for the econoini.st interested in applying dynamic competitive 
models of the sort described here. 


A Model of the “Error Term” 

We now derive a "dynamic supply curve" (or the industry by using the 
industry-wide production function Y, = /A’, + e, to eliminate A' from 
(20) in favor of F. Multiplying both sides of (20) by/'and then adding 
Cm , to Ixith .sides gives 

=/V +/Ti„(L)W,+IH,Al.)D, 

+ IB,(LU, +///„(;.)«, 

+ //,K, + €,+ , 

f.liminaiing 11 , by using u, — P, — A„ gives 

= (H„/-/H„(1).4«J +/W„(Z.)1V, 

+ fllAL)A,d>,AD, 

+ [H, +IH„(I.)AAY, 

+ n AfUADL - /y.A]6„,. 

This can be written as 


where 


Y,^,^S„ + SAL)P, + SMAW, 

+ S„(L)0, + Sy(.L)Yi + ,S',(/.)eHi, 


(24) 
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So = W,/ -y7/„(i)/i„. 
s,iL) =jhal), 

Su,(L) =JJ1UL), ^25) 

S„(L) = l/H„(L) - /W„(L)A,«#,„]. 

Sy{L) = [//, +yH.(L)v4,], 

S,(L) = [1 +fHAL)L - H,L]. 

Recall that the demand curve is 

P, = Ao ~ A,y, + A^D,, + u,. (3) 

Using S,(/.)e, = Uf and 6 „(A)m, = Pf, we can write the supply and 
demand curves as 


+ S„(L)P, + + S„(L)D, 

+ Sy{L)Y, +S.(L)S,(L)-‘VU,. 


(26) 


P,=Ao-A,Y, + A.J)„ + 6„(/.) (27) 

To discuss identification and estimation of the model, we need a 
theory about what is unknown to the econometrician. In constructing 
the model, we have taken the view that all of the variables on the 
right-hand side of the supply and demand curves (24) and (3) (or 
equivalently [26] and [27]) are known to the representative firm. 
Thus, from the viewpoint of private agents, (26) and (27) describe 
exact linear f unctions of the right side variables in which there are no 
"random errors.”’'' The only tractable way that has .so far been dis¬ 
covered of introducing random errors into (26) and (27) has been to 
assume that the econometrician has less information than do the 
private agents. The smaller information set of the econometrician 
leads to what from his point of view are random terms in relationships 
to be derived from (24) and (3) or (26) and (27). The idea is to restrict 
the econometrician’s information set relative to that of private agents 
in a way both that is plausible and that leads to a tractable statistical 
model of the error term. 1 shall de.scribe two models of the error term 
that can be constructed in this way. 

One model results from assuming that the econometrician has time 
series on {P,, W,, D„ Y,} but never observes the random processes e, 
and w,. On this interpretation, and u, become random terms in (24) 


“ Thi.s is a consequence ol the tact that the representative linn views itself as playing 
a dynamic “game agam.st nature." and .so hnds it optimal to use a nonrandom strategy, 
that is. a strategy that ran lie expressed as an exact fiinction ol its information variables 
and other state variables. 
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and (3) from the econometrician’s viewpoint.^^ In constructing the 
model, we have already imposed that VJ+j is orthogonal to all of the 
variables on the right side of (26) and will assume that I'j' is orthog¬ 
onal to y,. We can also impose that F? is orthogonal to D„, if we 
wish,-'* although we might get by with a weaker assumption. 

The second model of the error term results from assuming that the 
econometrician sees less of D, and W, than do private agents. It is 
convenient to postpone a detailed discussion of this second model of 
the error and, instead, first to discuss identification and estimation 
under the first model of the error term. 


Identification and Estimation 

With this model of the error terms, we can proceed to discuss 
identification and estimation. First, notice that every variable that 
appears on the right side of the demand schedule (27) also appears on 
the right side of the supply schedule (26). The dynamic economic 
theory leading to (26) makes the reason for this clear, since any 
variables that help predict future prices P will appear in the supply 
schedule of the representative firm. This immediately implies that any 
variables that help predict the demand shifters D„ will appear in the 
supply schedule.^'’ The fact that no variables on the right side of the 
demand curve (27) are excluded from the supply schedule (26) means 
that, if the supply schedule is to be identified, the source of identifica¬ 
tion must be found in restrictions of a kind different from the usual 
exclusion restrictions treated extensively in econometrics textbooks.^^ 
According to the standard "order condition” for identification, equa- 


" t his IS a version of ilie model of the error term analyzed by Hansen and Sargent 
(19^0)) and Sargent (197H), 

VVe have assumed that f'J and fjf are the "innovations” or one-step-ahead errors in 
predicting t, and u, on the tasis of observations on all variables in agents’ information 
set ai time / — 1 (see n. 6). This implies that is orthogonal to all variables on the 
right side of (26). If we assume that Ff is orthogonal to F;, it also implies that Fy is 
orthogonal to K,. Imposing chat FJ* is orthogonal to/7,, amounts to assuming that//,, is 
sinrtly exogenous in (27), which is stronger than the Cirangcr causality a.ssumptions 
already imposed on /)„, namely, that except for lagged P’s, no other variables in the 
mode! tiraoger cause P„. 

■’* There is a singular class of exceptions to this statement. In the special case that 

EO„„ I {/J.-JA.. = EO,„, I (28) 

for all; ^ 1, D„'s will ap[)car in the demaod schedule hut not in the supply .schedule. 
The condiuon (28) is usually thought to be exceedingly unlikely for any economic time 
senes {//„}. 

These remarks about identification should be compared with Milton Friedman’s 
discussion (1953) ol the conditions needed for “supply” and "demand" to provide a 
useful categorization of the factors impinging on price and output. Friedman argued 
that the categorization was useful to the extent that it effectively sorted forces acting on 
price and output into mutually exclusive categories. 
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tion (26) is hopelessly underidentified.^" Thus if the parameters of the 
model are to be identified, sources of prior information not of the 
exclusion variety must be available. The main source of these restric¬ 
tions in the present model is the extensive body of cross-equation 
restrictions embodied in equations (21) and (25). Equations (21) and 
(25) give the parameters of the supply schedule (26) as nonlinear 
functions of the parameters {/(,, d, n, Sp(L), 8„(L), 

and S,(L)}. In general, provided that the parameters rp and r^, which 
determine the order of the autoregressive processes forZ) and W, and 
the parameters/) and q, the number of elements in the vectors D and 
W, respectively, are large enough, these cross-equation restrictions 
identify or overidentify the parameters of the model. The strength of 
overidentification generally increases with increases in the orders rp 
and and the dimensions p and 

At this point it is useful to collect together the equations comprising 
the model as 


where 


y,., = So + S,(L)P, + .S„.(L)IT, + SpiL)D, 
+ Sy{L)Y, + S,{LmL) 'yuu 

P, = Ao-d,Y,+ A^D„ + SJLr'Y?, 

d^(L)lV, = Pr. 

8 p{L)D, = r{>. 

So = +p\n^Aod~' -yH,(l)Ao, 


Sp(E) = +/^nX/3d-‘ 


L-'[/ -6,(XiB)-’6.(L)] 


S^{L) = 






I - X/3L-‘ 
L-y - 


Sp(L) = +f'hiX^d 'A 241 D 
-Pd'nkfi 


I - X/3L-' 

L V - S),(X^)-'6g(L)] 


1 - X/3/--' 
L-y - 6.(X^)-‘6,(Z.)] 


1 - \pL-' 


(26) 

(27) 

( 8 ) 

(5) 


(29) 




“ The fact that the demand curve excludes some variables that appear in the supply 
schedule is due to the static specification for the demand curve. This feature of the 
model would not survive a variety of alteratitins that might plausibly be used to 
introduce dynamics into the demand curve. E.g., if the demand schedule involved 
expected future prices as arguments, all variables that help to predict future prices 
would appear in the econometrkally operational expression for current P, that would 
correspond to (27). 

*• This characteristic of identification in rational expectations models has been noted 
in various contexts by several authors, including Lucas (1975) and Sims (1980). 
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Sy(L) = 

A.pnkfi I / - g.(A/g):_ 'AC4L| - \L. 

sad = 1 - J 

(1 - (\J3) ‘/.J(l - A/.)= [I ~ (• +/3 ‘ AAj^d-')L + li 'DJ. 

Equations (26). (27). ( 8 ), and (5) form a stati-Stical model for the 
joint process (P„ E,. If'„ D,). The model is linear in the variables but is 
characterized by the extensive set ol cross-equation restrictions de¬ 
scribed by (29). With the model of the error terms currently under 
di.scii.vsion. the statistical model of the (P /, E,. IT/, O/) process has been 
spelled out sufhciently completely that we could write down the likeli¬ 
hood function for a sample E,. IT„ /),)./= 1 . T, assuming a 

normal probabilit) density for (F"’, F", F|\ F').*” Maximum likeli¬ 
hood estimates of the free parameters of the model {A„, A,, A.^, /, d, 
fiiiiD, ^„.(/-). 6 „(Z.), SAL)} could then be obtained. Computational 
details of such procedures are described by Sargent (1977. 1978) and 
Hansen and Sargent (I980 h). From the point of view of computing 
the estimates, it is a great practical advantage that (29) gives a set of 
closed-form formulas for the cross-equation restrictions imposed by 
the dynamic economic theory. 

Application of Bayesian Methods 

The fact that for the present model of the error terms it is possible to 
write down a normal likelihood function means that in principle 
Bayesian methods are applicable. Letting 8 be the list of parameters of 
the model and Z be the data, we have 

or 

I ^} = /(z I o)j„y„A9)/fi(z ) e)f,rue)de, m 

where/p„ 5 i{^lZ} denotes the posterior probability density,/(Z) the 
probability density of Z,j\,j\,„{6} the prior density on B, and 1{7.\B} 
the likelihcHid function. Measures of the location and dispersion of 
the posterior distribution of B can be calculated, for example, by 
integrating | Z } over B for appropriate values of k. In the 

Bayesian view, the role of data analysis is to trace out in as revealing a 
way as possible the mapping defined by (30) from the prior to the 


IMUL-j + x, 


See Hansen and Sargent (1980B) for a discussion of the details. 
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posterior distribution. For snrh u., . i • 

.stantia/Iy eases mafieis if the mapping S)‘ca^ 

J <yp<,».(e I Z )dff can ^ calculated without the need to resort to numeri¬ 
cal mtegration Zellner (1971) and Learner (1978) describe forms of 
prior densities/^pr,„r(«) that have the property that the mapping (30) is 
one that can be written as an analytic closed form when l(Z | 0) is the 
normal likelihood function. 


In the context of dynamic economic models of the class represented 
by (26), (27), (8), and (5), the question of whether the mapping (30) 
can be characterized analytically hinges on which parameters one 
regards as being in the list 0 about which the researcher has formu¬ 
lated prior information. One possibility is that 0 consists of the i’s of 
(26), the /I’s of (27), and 8„., 6„, and 8„ of (27). (8), and (5). With this 
interpretation of 0 , then since (26), (27), (8), and (5) are linear in the 
.V’s, .4’s, 8„,, and S;j, it is possible to get analytic characterizations of the 
mapping from/„ri„r(^) to/p,nt{^ |^)- For example, Learner (1972) and 
Shiller (1972) have shown how priors of various forms on the S's in 
(26) can tractably be mapped into posteriors, in contexts where (26) is 
appropriately viewed as a regression equation. In effect. Learner 
(1972) and Shiller (1973) provided formal Bayesian methods for 
imposing restrictions on lag distributions of a general kind, examples 
of which had long been imposed by applied econometricians. These 
restrictions usually corresponded to restrictions directly on our S/s. 
Predating the work of Shiller and Learner were the restrictions on 
distributed lags proposed by Koyck (1954), Cagan (1956), Milton 
Friedman (1957), Alinon (1965), and Jorgenson (1966). There was 
also the frequently used identifying restriction that various distrib¬ 
uted lag weights sum to unity.'** All of these approaches view the S’s 
themselves as among the free parameters of the model about which 
the researcher can reasonably be imagined to have formed views 
summarized by a prior distribution. 

Unfortunately, the tractability of the Leamer-Shiller approach is 
purchased at the cost of ignoring the essential aspects of the dynamic 
economic theory leading to (26). According to that theory, the S’s arc 
not free parameters but are complicated functions of the parameters 
{Au, A,, A 2 , f, d, jS, n, S„.(L), 8„(L), 8,(/.)}. It is this list of 

parameters about which it seems most appropriate to expect an 
economist to have prior beliefs. The parameters {,4o. A,, A.^, f, d} are 
the parameters describing preferences and the technology, about 
which the economic theorist may have some prior beliefs. The 


*' This rcstnction was cniiiizod by Lucas (1972ft) and Sargent (1971) for essentially 
the same reasons given here. 
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economists’ prior beliefs about the parameters {S„, S^, S„, S*} are 
presumably on a different theoretical footing than his beliefs about 
{do. d,, dj, /, d}, since the former list simply characterizes the serial 
ccjirelation properties of the “shift variables" about which economic 
theory itself suggests little, although casual general observations may 
suggest a presumption in favor of high serial correlation, at least in 
some types of variables. In any event, it is the deep parameters (Aq, 
Ai,A 2 ,f d, /8, n, 5„,, 8^, 5„, 5,} that must be estimated, if one is to build 
a model that potentially overcomes Lucas’s critique of econometric 
policy evaluation prcKedures.'*^ 

When this list ol deep parameters contains the objects of interest, 
Bayesian analysis using (30) becomes much less tractable. This is 
because the likelihood function 1{Z \6) becomes a very complicated 
function of the free parameters in 0 , by virtue of the complicated 
nature of the cro.ss-equation restrictions illustrated in (29). Although 
Bayesian analysis is still possible, the researcher will be forced to use 
numerical methods to characterize the mapping from the prior to the 
posterior given in (30). For example, for a given prior, numerical 
integration will have to be used to calculate the moments of the 
posterior distribution. My own judgment is that given current com¬ 
puter technology, formal Bayesian estimation procedures seem pro¬ 
hibitively expensive for most members of the class of dynamic models 
considered here. This is obviously not an objection to Bayesian 
methods in principle. However, I believe that the high cost attached to 
applying Bayesian methods correctly helps to explain why they have 
not yet been applied extensively to estimating rational expectations 
models. 


A Second Model of the “Error Term” 

More .serious limitations on the domain of Bayesian techniques 
emerge if the researcher embraces a second model of the error term, 
which we now discuss. In the second model of the error term, it is 
assumed that the econometrician possesses only observations on sub¬ 
sets Wf C Wr and D, C D, of the information variables that private 
agents use to forecast future w,’s and It is assumed that these 

“ A p<)ini related to that raised in n. 29 is relevant Kere. Priors and posteriors that 
assign [Jositive probability to points in regions for which zeroes of del 6(z) are less than 
in modulus are inadmissible. This is because under such distributions, for some 
regions in the parameter space with positive probability, the dynamic optimum prob¬ 
lems are not well posed. Taking this into account would substantially complicate the 
analysis since it would involve using mathematically less tractable distributions. 

" This model ol t^^.^aTor term was originally proposed by Shiller (1972) in a related 
but somewhat different context. The model was applied in the present context by 
Hansen and .Sargent (1980a). Nerlove, Orether, and Carvalho (1979) also recommend 
-Shiller’s model of the error term. 
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subsets of information variables follow autoregressive processes 
and 8„(L)W, = where Sd(Q and 6„(f.) are polyno¬ 
mials in the lag operator of order fp and f„, respectively. Then it turns 
out that the equilibrium law of motion for capital, (20), can be written 
in a form identical to (20), except that W, D, Sp, 6„, Hp, and //„ are to 
be replaced by the corresponding objects with tildes above them, and 
that there appears an additional random disturbance i), on the right 
side of (20). The cross-equation restrictions (21) continue to charac¬ 
terize the objects with tildes over them.'*'* The random variable tj, can 
be shown to be orthogonal to all of the current and lagged values of W 
and /5.** However, it turns out that i), is in general serially correlated, 
with serial correlation properties that depend on the joint covariance 
properties of those variables in D, and W, that the econometrician 
does not have observations on. In the context of this setup, it is not 
even possible to write down the likelihood function without specifying 
details of the moments of information variables in D and W that are 
unobservable to the econometrician. It would seem attractive to adopt 
an estimation procedure that avoids the implicit theorizing about the 
stochastic properties of the unobserved D's and W’s that an estimator 
using the likelihood function requires. 

One such estimation strategy that exploits the orthogonality of 17 to 
D and W, without requiring all of the added details required to write 
down a likelihood function, has been developed by Hansen (1979). 
The “generalized method of moments” estimators of Hansen have the 
advantage of delivering estimators of the free parameters whose 
desirable statistical properties do not depend on any arbitrary as¬ 
sumptions about the serial correlation properties of the tj/s.** These 
generalized method of moments estimators were invented precisely to 
handle situations in which the researcher is substantially more 
confident of the orthogonality conditions delivered by his theorizing 
than he is about the serial correlation properties of the error. These 
methods construct statistically consistent estimators, while avoiding 
the need to form the likelihood function. However, in acknowledging 
that he does not have enough information about the disturbances to 
construct the likelihood function, the researcher loses the ability to 
employ Bayesian methods, since knowledge of the likelihood function 
is essential for using Bayes’s law as in (30). 


“See Hansen and .Sarjfeni 1980a. 

“ Ibid. 

“ Under regularity conditions provided by Hansen (1979). the estimators of the 
underlying parameters are shown to be consistent and most efhcienl within a restricted 
class of estimators. Hansen's discussion of the conditions for consistency, which also has 
implications for the conditions for consistency of maximum likelihood estimators, is at 
this date the key reference on issues of suustical consistency in linear rational expecu- 
tions models. 
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In the Lucas-Prescott model of a single industry, state variables which 
help the firm predict the future prices of inputs appear in the repre¬ 
sentative firm's decision rule. The laws of motion of these input prices 
have been taken as given from outside the model. In actuality, the 
prices of these inputs are usually thought to be determined by trades 
in another market, one source of demand for which stems from the 
industry being modeled by Lucas and Prescott. If this other market is 
modeled explicitly, nontrivial modifications also occur in the analysis 
of the original industry. Thus, consider the example of a corn-hog 
model in which jjart of the output of one industry, corn, is an input 
into the production of the other industry, hogs. If the technology is 
such that hog producers have an incentive to forecast future corn 
prices, it follows that state variables that appear in the laws of motion 
for the total output and price of corn will also appear in the decision 
rule of the representative hog producer. Also, because the corn pro¬ 
ducer has an incentive to forecast the price of corn, which depends 
partly on the demand for corn from hog producers, the state variables 
that appear in the laws of motion for total hog output and the price of 
hogs will appear in the optimal decision rule of the representative 
corn pioducer. Hence, each industry inherits the state variables of the 
other. Furthermore, the equilibria in the two industries must be 
defined jointly, since the laws of motion for endogenous market-wide 
state variables are simultaneously determined for the two markets. 
There continues to be a fictitious social planning problem that a 
rational expectations equilibrium .solves, one that is a version of an 
interrelated factor demand problem in which the planner is jointly 
optimi/.ing the performance of both industries.'*^ 

These remarks indicate that the analyst will often face a hard 
practical decision about which dynamics he takes as given from out¬ 
side the model. The internal logic of this class of models tends to 
propel the analyst toward a general equilibrium formulation in which 
the laws of motion characterizing each distinct industry are deter¬ 
mined simultaneously with the laws of motion for all industries with 
which it buys and sells. In any given application, the researcher will 
have to choose what laws of motion he takes as given from outside the 
model, for the jjurposes of the analysis at hand. 

Conclusions 

Remaking dynamic econometric practice so that it is consistent with 
the principle that agents' constraints influence their behavior is a task 

These claims are proved for a particular version of a “corn-hog model" in some 
unpublished notes (Sargent 198(k). 
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that is far from finished. Further, properly allowing for the implica¬ 
tions of the principle will surely require abandoning many presently 
received ways of interpreting data. A variety of setups can be imag¬ 
ined that are consistent with the principle. For example, a variety of 
variations of the setup of this paper can be imagined in which agents 
optimize but have smaller information sets than have been attributed 
to them here. Also, information discrepancies across classes of agents 
can be assumed. In many such cases, endogenous variables such as 
prices will play an important role in conveying information to agents. 
In models with dynamics as complicated as those of our examples, 
these variations introduce substantial analytical difficulties. To date 
there is very little work which investigates the econometric implica¬ 
tions of such complications to setups like ours. With or without these 
complications, building a dynamic econometrics that is consistent with 
our simple principle from economic theory is a challenging task. It 
is sure to require substantial changes in the ways that applied 
economists interpret economic time series. 
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Activist Monetary Policy under Rational 
Expectations 


Peter Howitt 

Vnmersily of Western Ontario 


Tile purpose of this paper is to argue that the pursuit of an activist 
monetary policy may make economic sense even when people’s ex¬ 
pectations are formed rationally. The paper presents a simple model 
in w'hith (I) prices are costly to adjust, (2) there is uncertainly 
concerning the parameters affecting aggregate demand, and (3) 
there are fxisitive costs of gathering and processing information. By 
reference to this model it can be shown that an activist monetary 
polity may or may tiot be useful in offsetting aggregate disturbances, 
depending upon the extent of information costs and of parameter 
uncertainty. 


I. Introduction 

By an ac tivist monetary policy I mean a policy whereby the monetary 
authority permit.s the money supply to react systematically to infor¬ 
mation concerning aggregate disturbances. The proposition com¬ 
monly known as the Sargent-Wallace (hereafter SW) proposition* 
asserts that an activist monetary policy cannot succeed in offsetting 
aggregate disturbances when people’s expectations are formed 
rationally unless the monetary authority possesses information un- 


An earlier version of this paper was presented at the meetings of the American 
Economic AssociaUon, New York. December 1977. I am indebted to Russell Boyer. 
Stanley Fischer, Joel Fried, Herschel Grossman, Robbie Jones, David Laidler, Robert E. 
Lucas, Jr,. Michael Parkin, Arthur Robson, and an anonymous referee for helpful 
comments and criticisms, and to the Social Sciences and Humaniues Research Council 
of Canada for financial support. 

‘ See Sargent and Wallace 1975. The proposition is derived in a more general context 
by Barro (1976). 
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available to private agents. It asserts that in the absence of such supe¬ 
rior information systematic variations in the money supply will be an¬ 
ticipated and thus neutralized even in the short run. 

The SW proposition has been criticized by previous writers, Taylor 
(1975) pointed out that it will take private agents time before they 
learn the exact nature of the monetary policy.* During this transition 
period the SW proposition is invalid. Poole (1976), Fischer (1977), and 
Phelps and Taylor (1977) have pointed out that even if private agents 
pos.sess the same information as the monetary authority they may not 
be able to act upon it immediately because of contractual precommit¬ 
ments, in which case the SW proposition is invalid. It has also been 
widely acknowledged that if the arguments underlying the SW prop¬ 
osition were valid it would be difficult to explain the observed persis¬ 
tence of movements in output and employment.'’ 

The present paper focuses on another objection to the SW proposi¬ 
tion; namely, that it fails to take into account (1) the costs of gathering 
and processing information and (2) uncertainty concerning the 
structure of the economic system. People may choose not to collect 
and process the monetary authority’s information concerning aggre¬ 
gate disturbances when it is costly to do .so.'* Indeed, everyday obser¬ 
vation suggests that most people pay no attention to such information. 
In such circumstances people may still be forming their expectations 
rationally, in the sense that they are making efficient use of (costly) 
information, yet they are not in a position to anticipate and neutralize 
the effects of an activist monetary policy. In order for the SW propo¬ 
sition to be true the monetary authority’s information must not only 
be available to private agents it must also be economically usable by 
private agents. 

This qualification to the SW proposition has been noted by Barro 
(1976), wtio argued nevertheless that the monetary authority could do 
just as well by disseminating its information as it could by varying the 
money supply. However, this paper attempts to show that such a 
disseminating policy may be inferior to an activist policy when there is 
uncertainty concerning the values of parameters in the economic 
system because (1) people may not have enough incentive to process 
information, and (2) even if people do process the information there 
is no guarantee that this will produce the same outcome as would a 
properly designed monetary policy. A demonstration of this argu- 


’ A simitar point was made by Benjamin Friedman (1979), who showed the similarity 
of rational expectations to error learning during the learning process. 

'' See, however, Lucas 1975 and Sargent 1976. 

* This [roint has been strc5.sed by Rtidedge (1974), Frenkel (1975), Feige and Pearce 
(1976), Laidler (1978), Shiller (1978), and Friedman (1979). 
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ment can be made precise only in terms of a formal model, but it can 
be sketched in the following terms. 

First, even if the information concerning aggregate disturbances is 
provided to people they must still incur the cost of processing it if they 
are to use it. But they may not find it economical to incur that cost if 
they think that not many other people will be using the information, 
even when its social value exceeds the processing cost. Consider, for 
example, an agent who is deciding what price to charge for the single 
good that he sells. Suppose he is told that the demand for money has 
doubled. If he thinks that all other agents are using this information 
he can calculate that his best course of action will be to cut his price in 
half, along with everyone else. The information may be quite valuable 
to him and to society because failure to conform in reducing his price 
would leave him charging a suboptimal relative price. But if he thinks 
that no other agent is using the information his best course of action 
will depend on the effects of that change in the demand for money 
on the demand for his product. If this effect is difficult to estimate 
accurately then the information may have little value to him. A con¬ 
vention like universal indexing that leads people to believe that 
everyone else will be using the information as well may be necessary 
for the private value of such information to be as great as its social 
value. Everyday observation again suggests that such a convention 
rarely exists. 

Second, in a world without complete Arrow-Debreu contingent- 
claim markets there is no guarantee that even an otherwise Irictionless 
competitive equilibrium will be efficient with respect to the informa¬ 
tion possessed by private agents. It is possible that the monetary 
authority is in a position to make better use of aggregate information 
than are private agents, even if the above-mentioned indexing con¬ 
vention could be established. In the example of the previous para¬ 
graph, Brainard’s (1967) analysis suggests that if the values of the 
parameters of the economic system are not known with certainty then 
the optimal reaction of the monetary authority is to allow the money 
supply to increase, but by less than 100 percent. But if the informa¬ 
tion is disseminated prices may indeed fall in half, which would be 
equivalent to a 100 percent increase in the money supply. As in the 
theory of the second best, full information results in the best equilib¬ 
rium, but when information is incomplete then more information 
does not necessarily result in a better equilibrium.* 


“This result is similar to those of Hirshleiler (1971) and Grossman and Sughu 
(1976), who argued that the equilibrium allocations with more information may be 
Pareto inferior to those with less. 
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The remainder of the paper attempts to make these arguments 
more precise by developing a simple model of a monetary economy in 
which prices are costly to adjust. Section II describes the basic features 
of the model; Section III analyzes the private decision whether or not 
to use information; Section IV characterizes the optimal policy of the 
monetary authority showing how an activist policy may be justified by 
the first of the above arguments; Section V considers a variant of the 
model in which the second of the above arguments may be used to 
justify an activist policy; and Section VI contains some concluding 
remarks. 


II. The Basic Model 


Preliminarie\ 

This section describes a simple hypothetical economy possessing the 
features described above. In this economy there are two tradable 
objects: money and a good. There are also two classes of economic 
actors; a monetary authority, whose Job it is to determine how much 
money will exist and to distribute the money, and a set of private 
agents (referred to simply as “agents”), who produce and consume the 
good. In the interest of simplicity we suppose that each agent pos¬ 
sesses identical tastes, technology, and initial endowments. (Thus we 
are making no distinction between firms and households.) In order to 
establish a basis for exchange among these identical agents we assume 
that each of them has an absolute aversion to consuming units of the 
good that he himself has produced. Thus he is induced to sell all of 
these units and to purchase for consumption units of the good that 
are produced by others. 

To keep the demand and supply functions simple we suppose that 
each agent evaluates his demands according to a Stone-Geary utility 
function and produces subject to a quadratic cost function. That is, 
each agent’s utility during a period depends upon q, the amount of the 
good consumed during the period, m, the real value of money bal¬ 
ances held at the end of the period, and q, the amount of the good 
produced during the period, according to the particular function 

U = Kq"{m + x)'-« - {kl2)q^ (1) 

where /(; is a positive, nonstochastic parameter equal to the slope of the 
agent’s marginal cost schedule, a and x are independent stochastic 
parameters with 0 < a < 1, and K is a stochastic parameter defined as 
K = a "(1 - fl) “ For reasons that will become apparent later we 
shall refer to the parameter x as “the level of aggregate demand.” 
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In order for purely monetary changes potentially to have real 
effects even in the short run we must not assume that there exists a 
central Walrasian auctioneer who is able each period to compute a 
single market-clearing price at which all trades are to occur. The 
actual transaction prices must be determined before the agents have 
collected enough information for them to be able to compute this 
Walrasian equilibrium price. We accomplish this by assuming that 
each agent is responsible for setting his own selling price. He must set 
the price before learning exactly how much he will be able to sell, and 
all trades must occur at these posted prices whether or not markets 
are clearing in the usual sense.* Assume also that the quantities trans¬ 
acted at these prices always equal the quantities demanded. In other 
words, when markets are not clearing the sellers bear all the resulting 
quantity rationing.^ 

At the time of the price-setting decision each agent’s expected 
utility will depend negatively on the variance of his error in forecast¬ 
ing demand, because he must produce subject to increasing marginal 
cost. Thus he will have an incentive before selling his price to gather 
and process information concerning the demand f unction facing him 
so as to reduce the variance of this forecast error. The decision on 
what information to use at this stage will be the main focus of the 
following analysis. 

When determining the value of M, the nominal per capita stock of 
money, the monetary authority is a.ssumed to be unaware of the exact 
state of aggregate demand. Suppose, however, that he is able to 
acquire partial information by observing a variable, s, that is corre¬ 
lated with aggregate demand. In particular, assume that 

X = cs (2) 


' This setup mav be contrasted with l.u< as's (1972), in which the agents arc randomly 
divided into subgroups, each with a miniature Walrasian auctioneer, and witli no 
communication between groups during the market period. Both setups require agents 
to make trading commitments before being completely informed of other agents' 
dispositions to trade. Lucas's approach has the advantage that market-clearing models 
have been more thoroughly analyzed in the literature than price-setting models. Ours 
has the advantage of replacing the Walrasian auctioneer with a scheme that bears some 
resemblance to the way prices actually get determined in many real-world markets. 
Whether our results could be transposed to a suitably constructed “markeKlearing" 
model is an open question. 

’ This rule also aiqicars to be compatible with the way transaction quantities are 
determined in many real-world markets, esjiecially those organized by specialist 
traders—wholesalers, retailers, etc.—who post prices and hold inventories which they 
allow to act as buffer slocks against fluriuali<Mis in demand. By absorbing the nonprice 
rationing they provide a service to customers, for which they arc rewarded by a spread 
between their buying and selling prices. In effect we are assuming that each agent is a 
combination consumer-producer-trader. (However, we are ignoring the potentially 
important role of inventories by assuming that the good is not storable.) 
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where r is a stochastic parameter. We shall refer to.? as the "indicator” 
of monetary |K)licy. I'he monetary authority is assumed to obey the 
following rule:’* 

M=M(l-gs) (3) 

where M (> 0) is the average per capita slock of money and g is the 
"policy reaction coefficient.” Thus the monetary authority follows a 
"feedback.” rule of reacting to observable information concerning the 
level of aggregate demand, and its two ciecision variables are M andg. 
The .S\V proposition asserts that the stochastic behavior of aggregate 
output in the economy will be independent of the monetary author¬ 
ity’s cliOK e of g. 

r/ic of Aiin'ilie'< 

Dining each petiod activities proceed according to the fcillowing 
sequence. Fiist, there is a stage in which the monetary authority 
decides on his (lolicy. Tliat is, he chooses values for g and M. When 
making this decision he is assumed to po.s.ses,s an accurate model of the 
economy, including exact knowledge of all deterministic variables, 
parameters, and functions, and the probability distributions of all 
stochastic variables. He does not, however, know the exact value that 
will be assumed during the period by any stochastic variable. As soon 
as he makes his decision ail of his information is conveyed to each 
private agent, including the chosen value.s ol g and M. Thus there is 
no scofie for the monetary authority to fool private agents, even in the 
shoit run, afioiit the nature of his |)olicy, or to make this polity 
decision using inloi inaiion that is not available to private agents. 

Second, there is a stage in which private agents make their 
monitoring decision. That is, each one must choose whether to pay 
tlie cost ol leaiiiing the exact value of v hetore setting his price, in 
whitli case he will be able to reduce the variance of his forecast error, 
or to wait until after his price has been set, when the value of .1 will be 
revealed to him at no cost. In the interest ol simplicity we assume that 
the indicatoi is the only stochastic variable whose value is potentially 
observable before the jirice-setting decision is made. 

Third, there is a stage in which the exact value ol the indicator v is 
determined. Any agent who has decided to monitor observes v and 
pays the cost at this stage, as docs the monetary authority (unless he 
has set g = 0). Note that once again we are not giving the monetary 
authoiity any informational advantage. At this stage, before he 


" A [nirply random, iinpiediciablv toin|><)ncnt could be added lo (.S) but, a.s Barro 
(1976) lias al.so shown, the optimal policy, if'feasible, would lie to sec its variance to rero. 
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chooses his price anyone can choose to be as well informed as the 
monetary authority. 

The fourth stage is the one in which the values of all prices and of 
the money supply are decided. The money supply is determined 
automatically by equation (3), and the private agents choose their 
prices in such a way as to maximize expected utility. If they have 
decided to monitor the indicator (to learn the value of .t), then private 
agents can forecast the exact value of M by using equation (3) and can 
use this knowledge in making their price decision. Otherwise they 
must make their decisions on the basis of expectations concerning M, 
which they form using (3) and their knowledge of the probability 
distribution of s. When setting his own price no agent is able to 
observe the others’ prices. However, he is able to tell whether or not 
the others are monitoring the indicator, and this will be enough in 
most cases to allow him to predict exactly what the others’ prices will 
be. 

The fifth stage is the one in which all uncertainty is resolved. In this 
stage the monetary authority distributes money to the private agents 
in equal amounts, and the exact values of a,x,K,c, and .i are revealed 
to everyone. Whatever values of the utility parameters o, x, and K are 
determined in this stage, they are shared by all agents in common. 
Thus, with this information each agent is able (i) to calculate exactly 
how much his sales will be for the period and (ii) to compute his own 
demands for money and the good. 

In the sixth and final stage the agents visit one another to execute 
their planned purchases, and each agent must produce to meet the 
demands placed upon him. Up until this stage each of the identical 
agents will have been led to make identical decisions. Thus every 
agent will be charging the same price. Assume that in such a situation 
the quantity demanded from each agent will be identical.® 

To understand what happens in this sequence we must start at the 
end and work back. Consider an agent who, during the sixth stage, is 
going to be called upon to produce the quantity q. Suppose that all 
other agents have set the same price, P, and that he himself has set the 
price rP. Then during the fifth stage he will be aware of the value of q 
and of the exact values of all the parameters in his utility function (eq. 
1). He will formulate his own demands,f and m, so as to maximize this 
Utility function subject to the budget constraint 

m + q = m + rq (4) 

* Alternatively, we could suppose that they all face a stochastic demand tuncUon with 
independent, identical probability distributions, the randomness represenung the in¬ 
determinacy in the distribution of sales among identical sellers charging identical 
prices. However, the addition of one mure random factor would add nothing but 
notational complexity to the analysis. 



JOURNAL OF POLITICAL ECONOMY 

where m = M/P denotes his initial holding of real balances.*“ Thus his 
demand for the good will be 

q = «(m + rq + x), (5) 

and by substituting from (4) and (5) into (1) we can see that the level of 
utility that he will obtain is determined by the indirect utility function 

r(m,q,x,r) = m + rq + x — (k/2)q'‘. (6) 

During the fifth stage each agent will also be able to predict his 
sales, q, in the following way. First, suppose that every agent is charg¬ 
ing the same price, P. Then every agent will sell the same quantity, q, 
and will formulate demands according to (5) with r = 1. But every 
agent will also realize that his sales, q, will be equal to the per capita 
demand, q. Thus the values of ly and q can be computed by substitut¬ 
ing q = q into (5), with r = 1, to produce 

q = b(m + x) (7) 

with 


h = 


I — a 


> 0 . 


( 8 ) 


Equation (7) indicates the amount that each agent would demand in 
stage 5, as well as the amount that he would expect to sell, under the 
assumption that r = 1. And indeed the equilibrium will be one in 
which r = 1. But to see how prices are chosen we must indicate how 
much each agent would expect to sell if r 1. Let us suppose that this 
quantity is given by the expression 

q/m, X, h, r) = max [b{jn -I- x) + r){l — r), 0) (9) 

where n > 0 is a measure of the “conjectured competitiveness” in the 
economy. In other words, it measures the extent to which each agent 
supposes that he would lose sales by raising his price above the market 
price. The limiting case in which n = -l-» is the one usually associated 
with the assumption of perfect competition. 


'* It has been argued that even in markets where demanders choose quantities at 
predetermined prices the demand decisions must take into account the future relauon- 
ships between trading partners, as well as the usual objectives, in which case the 
qtianttues chosen may be on neither the demand nor the supply curve as usually 
conceived (Barro 1977). This consideration is probably important in markets where 
exchange is conducted on a highly personal basis because of the costs of switching 
trading partners, as in labor markets, markets for personal credit, and so forth. 
However, there are also many markets where exchange is quite impersonal and buyers 
may. because they can remain anonymous to the sellers, choose their quantities without 
taking this consideration into account. When we assume that demanders choose their 
quantities according to the usual sort of utility maximization we are implicitly assuming 
this sort of impersonal arrangement. 
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We now have introduced a total of seven random variables: b, c,s,x, 
a, K, and Af. Let us suppose that the probability distributions of the 
first thiee of these are mutually independent and that 

> 0;E(c) = y > 0;£:(s) = 0 
var (b) = o-J > 0 ; var (c) = tr? > 0 ; var (i) = > 0 . 


At the outset of the fourth stage each agent realizes that if he sets 
the price rP his utility will equal 


Af .1 

^ (1 - + rq 


^ (1 - gs),cs,b,r 


(y)? yd - gs),cs,b,r 


( 11 ) 


(This is obtained by substituting from [2], (3], and [9] into [ 6 ].) He will 
know that the stochastic variables 6 , c, and.t obey ( 10 ), and he may or 
may not know the exact value of .s, depending on whether or not he 
has chosen to monitor s in the second stage. Suppose that in stage 2 he 
has made the same monitoring decision as every other agent. Then he 
will be able to calculate P, the price set by all other agents. He will also 
know the values of Af, and k. Thus he will choose a relative price so 
as to maximize the expected value of ( 11 ) conditional on the informa¬ 
tion set /, where / = {.?} if he has monitored and 1 = 0 otherwise. The 
first-order condition of this maximization (as.sume ihdil q[m, x, h,r] > 0 
lu.pr.l) can be expressed as 

{A(Af (1 - gO/f’ + <■■»] ■•■ ” (1 " '■) I f(12) 

where 

q^ = n/(l + nk). (13) 

Let the superset ipU denote the information set, / = 1 denoting / = {.?} 
and i = 2 denoting / = 0, Then the solution to (12) can be written as 
the relative-price function 

r = P„(P,M,g,s),i = i,2. (14) 

In order to tell what price to set each agent must also calculate P, the 
price that the other agents each will set. To do this he calculates the 
value ofP that makes the value ofr in (14) equal to unity. That is, he 
expects the price to be one such that every agent (including himselO >s 
induced to conform. This will also be the actual price set by all agents 
in stage 4. It can be expressed as 

P =P‘(M,g,.v),i = 1,2, (15) 

and, from ( 12 ), it must satisfy the condition 

£(^ |/) = £{/»[Af(l - gs)/P -I- f,s] I / } = q^. 


( 16 ) 
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In the limiting case of perfect competition, (16) states that the market 
price will be set so that the expected market demand (per capita) 
equals the competitive market supply (per capita), ^ ^ lim g„ = 1/A. In 

T ' • 1 1 • I f W— 

Other words, there is expected market clearing.' 

Let m„ = g„//3. From (10), (15), and (16) the prices can be expressed 
as 

= 'W(l - gsVirhn - ys), (17) 

PUM,g.s)^MIrh„. (18) 

Thus the quantity of output that will be produced per person in stage 
6 can he expressed as a function of the information set, the reaction 
coefficient g, and the random variables A, c, and.« by substituting from 
(2), (d), (17), and (18) in (7) to obtain 

q'n = + if - y)s], (19) 

ql = - gs) + «]. (20) 

I'he essence of the SW proposition is contained in (19). If agents 
monitor i (hen the behavior of the quantity q is independent of the 
reaction coefficient g. On the other hand, (20) shows that 9 is affected 
by g if no monitoring occurs. 

From (10), (19), and (20) the mean and variance of output per 
person, as seen at the beginning of stage 3 (i.e., before the exact value 
of s has been ascertained), can be expressed as 

E( 9 ‘) = £(,> (.V) = E(gl) = ^ £( 95 1 0. (21) 

var q], = (jlml + + cr^)cr?, ( 22 ) 

\di ql = (Tlml + + crj)((r?. + (y - gmj^]. (23) 

From ( 6 ) and (17)-(21) the utility that each agent expects at the end of 
stage 2 may be expressed as 

= EV{MIP^„,q'„,x, 1) = - (kl2)ql - (A/2) var^J,; i = 1,2. 

(24) 

It follows from (22)-(24) that the gain to society from everyone 
monitoring the indicator (i.e., - Vl[g'\) arises solely from the 

consequent reduction in the variance of output. This reduction in 
variance yields a welfare gain because the marginal utility of more 
production is (by assumption) a constant but the marginal cost is 
increasing, so that total utility is a concave function of the level of 
output. 

" An almost identical condition was assumed to hold by Fischer (1977, pp. 195-97) 
and by Phelps and Taylor (1977, pp. 166-70). 
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III. The Monitoring Decision 

Equations (17)-(24) show how each agent’s price, output, and ex¬ 
pected utility will depend on the monitoring decision made in stage 2 
and the reaction coefficient chosen in stage 1. This section analyzes 
the monitoring decision. 

Zm (> 0) denote the cost, in utils, of monitoring the indicator. 
This cost will generally equal + Zp where z^ is the cost of collecting 
the information (i.e., of determining the exact value of s) and Zp is the 
cost of prtKessing the information (i.e., of computing and posting the 
optimal price given the value of s). Each agent will choose to monitor 
if the gain to him in expected utility is at least as great as z„. 

The size of this gain will depend on whether or not the other agents 
are monitoring. Consider first the case in which none of the other 
agents is monitoring. In this case the agent will derive the expected 
utility Vl{g), as determined by (24), if he too chooses not to monitor. 
But if he chooses to monitor, the market price will continue to be PI, 
as in (18), and he can now profit from knowing the exact value of the 
indicator before setting his own price. Because he can infer, as before, 
the exact value of P before determining his relative price, that relative 
price will be the one that satisfies the first-order condition (12) with / 
= {,?} and P = Pl(M, g, s). That is, 

r'n{Pl,M,g,s) = 1 -I- s^(y - gm„)/(q„ + n ). (25) 

Let P^nig) denote the expected utility of the agent who monitors when 
no one else is monitoring. This can be obtained by substituting from 
(25) into (6) and usings = ^ ql + n(l - r) with ql given by (20). As a 

result of this calculation it can be seen that the gain in expected utility 
from deciding to monitor when others are not monitoring is 

LJ(g) - Plig) = 4>(n)<AP^{y - grhnf (26) 

where d>(n) = (1 + nkf/2n{2 + nk).'^ Note that 

<t>(n) > (k/2) and <t>(n) -» (ft/2) as n -*• ». (27) 

The nature of this gain is easy to see. The expected output of the 
agent is not affected by the decision to monitor but its variance is, and, 
as we have argued, a reduction in variance increases expected utility. 
Specifically, whenever the level of aggregate demand is high (i > 0) 
the monitoring agent will, according to (25), raise his price, thereby 
damping the increa.se in his sales (assuming thatg < ymp'). Likewise, 
when s < 0 the monitoring agent will bolster his sales by reducing r. 
Not only does this reduce variability it also adds to the agent’s ex- 


“ This result is also derived in the Apjrendix. 
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pected revenue. In the limiting case of perfect competition. (25) 
implies that these changes in r will be imperceptible. In this case the 
agent is able to adjust the mean position of his random sales .schedule 
whenever a nonzero value of .t is observed so that the expected value 
of his sales conditional upon s is equal to q,'^ and the entire gain to the 
agent from deciding to monitor arises from the resulting reduction in 
the variance of sales. In particular, (26) and (27) imply that the gain is 
equal to (A/2) times this reduction in variance.''' 

Next, consider the case in which all of the other firms are monitor¬ 
ing. In this case the agent will derive the expected utility f'i(g) as 
determined by (24) if he too chooses to monitor. But if he chooses not 
to monitor then the price will continue to be P'n as in (17), and the 
agent will lose by not being able to foresee, before setting his own 
price, the variations in ,f and in P), that his competitors are taking into 
account. The nonmonitoring agent will choose a price P so as to 
maximize I + n(l - PIP\),x, P/P'„]. Let VJ(/f) denote the 

expected utility of the agent who does not monitor when others are 
monitoring. Then, as the Appendix demonstrates, the gain in ex¬ 
pected utility (rom deciding to monitor when others are monitoring is 

L;,(g) - 17.0?) = 0(«) (28) 

I T* V dl r;i 

where c„ = P'„(M\ g, s)''[EP'„(M\ g, s)“']"' is the reciprocal of the 
market price, exjiressed as a latio to its mean value, and where ip(n) = 
n + n^k/2. Note that 

./)(«)—> X as n—» 30. (29) 

If e\eryone is to monitor then the gain from deciding to monitor 
when everyone else is monitoring must be at least as great as the 
monitoring cost; that is, 

V^i(g) - g z™. (30) 

Likewise, if no one is to monitor, it is necessary that 

( 31 ) 

From (26) and (28) it follows that for large enough n at least one of 
these conditions must be satisfied. But it also follows that both of them 

The limiting case of jjerfecl competition is denoted by the absence ol the subscript 
n on the lanables t'l,. g,. etc. 

Notice that we arc not allowing a monitoring agent to gam by selling Ins inlorma- 
lion. Allowing a market for information would complicate the analysis without affecting 
our main result. For in the case that we focus on below where an activist policy 
dominates all otheis the private value of information is less than the cost ol processing 
It. In this tase no one would willingly pay the monitoring agent anything for his 
information. 
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may be satisfied. In other words, convention appears to play an 
important role in the monitoring decision. It may be that everyone 
will monitor if they all expect everyone else to, but no one will monitor 
if they all expect no one else to.'® 1 shall assume that monitoring will 
occur if and only if (31) is violated—in other words, that the conven¬ 
tion of monitoring will develop only when the convention of non¬ 
monitoring is unsustainable.'" Notice, however, that when the reac¬ 
tion coefficient assumes the value 

in = yl^n (32) 

then the convention of monitoring could never be sustained as long as 
z„ > 0, because from (17), (26), and (28), 

l^'niin) - yidn) = l^l(gn) “ n(gn) = 0. (33) 

In order to simplify the analysis further, let us assume the limiting 
case of perfect competition. Then, from (2), (26), (27), (31), and (32) 
the necessary and sufficient condition for monitoring to occur is 

(k/2)K.aifiHl - g/gV > z„ (34) 

where = y^Ky'^ + (r'i) is a measure of the accuracy of the indicator, 
being the square of the coefficient of correlation between x and the 
conditional forecast ys ofx based on the indicator, = var x is the 
variance in aggregate demand, andg = yfik, the limiting value of (32). 

According to (34), the likelihood of monitoring occurring is larger 
(i) the larger is the slope, k, of the typical producer’s marginal cost 
schedule, because the larger this slope the more the firm loses from 
variability in its demand that could be avoided by monitoring; (li) the 


In Mith (ases (he private value ol the inlormation ronlamcd in ,s i-i larger when 
everyone else has the inlormation than when no one else has it. This result may appear 
somewhat counterintuitive, but it arises from the following considerations. In stage 4 
the agent who knows s may gam because this allows him to vary his price in anticipation 
of variations in (a) the variable x which is correlated with and (h) his competitors’ 
pnee, which may be predicted once i is known. When n is large, then b will be the 
dominant factor if others are observing.!, because variations in his competitors' absolute 
selling price imply variations m his own relative .selling price, and thus large variations 
in Ills sales, il he is unable to anticipate them. But if no one else is observing s then h no 
longer applies because he does not need to observe.! in order to anticipate his competi¬ 
tors’ price. More generally, the result arises Ijecause the main use of the information is 
to allow the agent to coordinate his activities (specifically the activity ol setting his 
nominal selling price) with the activities of other agents. As an analogy, the value of 
knowing that you should drive on the left in England would be much less if none of the 
other drivers knew it. This analogy also illustrates why multiple equilibria that must be 
selected by convention can arise under such circumstances. _ 

“ Thus we may say that an equilibnum with respe« to any (g, Af. n) consists of a 
monitoring decision i E (1,2i_and a price function Ti,(Af.g,.0 such^hat (a) for anvi the 
optimal relative price r',{P, M. g, .() equals unity when P = P'.iM, g. (), and (A) the 
monitoring decision is i = 1 only if E;(g) - KJ(g) a and i = 2 if and only if EJlg) - 
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more accurate is the indicator, because an inaccurate indicator con¬ 
veys little useful information; (iii) the more variable is the level of 
aggregate demand, because there is little gain in monitoring a variable 
that is relatively constant; (iv) the larger is jS, because there is little gain 
to monitoring a variable if its expected effect on demand is small; and 
(v) the further is the reaction coefficient from g, because wheng = ^ 
then, as can be inferred from comparing (22) with (23), the monetary 
authority is already accomplishing everything that private agents 
could accomplish by monitoring. 

IV. Optimal Monetary Policy and the Incentive to Monitor 

Three implications of this analysis are worth noting here. First, the 
SW jiroposition is valid for all values of g satisfying (34). For in this 
case monitoring will occur, and, as noted before, the behavior of 
outjiut IS unaffected by the value ofg. According to (22) and (24) each 
agent’s expected utility will also be unaffected by the value of g. 

Second, whatever can be accomplished by monitoring can be ac¬ 
complished just as well by having the monetary authority set g = 
.Xccording to (34) no monitoring would occur, but according to (19), 
(20), and (24) the behavior of output and expected utility would be 
the same as if monitoring was occurring. 

Third, as was argued in the introduction above, private agents may 
not have much incentive to monitor.t even when the social gains from 
everyone deciding to monitor are large. That is, the private value of 
monitoring, F'’(g) — F'^fg), may be small when the social value, F’(g) - 
1 ‘(g), is large. In particular, it follows from (22)-(24) and (26) that 

[F'’(g) - F^(g)] = XjF>(g) - F^(g)], (35) 

where = /3^/(y3^ + crj) is a measure of the predictability of the effect 
upon effective demand of changes in *, being the square of the 
coefficient of correlation between the actual effect bx and the pre¬ 
dicted ef fect /3x. If the value of the coefficient h is known with cer¬ 
tainty, then the private and social values coincide. In the limiting case 
of pure uncertainty (when (jS/ooP —» 0 ) the private value is infinitesi¬ 
mal in comparison with the social value. 

The reason for this divergence of private from social value is seen 
most easily in the case of a perfectly reliable indicator (er? = 0 ) and a 
neutral monetary policy (g = 0). If everyone monitors then the price 
level will adjust to fluctuations in * so that the sum m -t- x is constant 
and equal to m, with output equal to 9 ' = frm. Thus everyone 
monitoring together will completely eliminate the effects of * upon 
output. An isolated agent monitoring when no one else is cannot stop 
m + X = m + X from fluctuating. He can, and will, through impercep- 
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tible changes in r alter the mean position of his demand curve so as to 
offset the predicted effect fix, but there will still remain an unpre¬ 
dicted effect (b — P)x. Specifically, by substituting (25) into (9) we see 
that his output will be = bm + (b — p)x. Thus the social gain to 
monitoring comes from eliminating the total effect bx of changes in 
aggregate demand, whereas the private gain comes from eliminating 
only the predicted effect fix. The ratio X|, of private to social gain is 
just the ratio of variances of the predicted and total effects. 

Consider three different monetary policies, (i) Policy N is the neu¬ 
tral policy, g = 0 , which is assumed to result in no administrative cost, 
(ii) Policy D is the disseminating policy suggested by Barro of settings 
= 0 but also publicizing the value of i, which is assumed to result in 
the administrative costz^ per person, (iii) Policy A is the activist policy 
of setting g = g, resulting in the administrative cost 2 „. Assume that 
the cost la is incurred whenever any nonzero value of g is chosen. 
Then policy A will dominate any other p>olicy with g ^ 0, and we 
may restrict our attention to policies A, D, and N. Which policy is 
optimal will depend upon the relationships between the costs and 
benefits of monitoring and the administrative costs of the different 
policies. Let 

F* ^ y‘( 0 ) - F*(0) = (k/2)Kr var (bx) (36) 

denote the social gain to monitoring and F” = F®(0) - F*(0) = 
denote the private gain to monitoring, wheng = 0. Then according to 
(34) monitoring will never occur under policy A and will not occur 
under policy N if X^V’ g t,. + Zp. Policy D is assumed to save the private 
agents any collecting costs, but if they are to use their knowledge of 
the value of v they must still incur the processing cost. Thus monitor¬ 
ing will not occur under policy D if X^V* S Zp. Table 1 depicts the cost 
per person of the different policies in the three situations indicated, 
where the cost is measured as F‘( 0 ) minus the expected utility result¬ 
ing from the policy plus the monitoring and administrative costs. In 
each situation the optimal policy is the one with the least cost. 


TABLE 1 

Cost per Person of Monetary Pol.icy 



Policy 


Situation 

N 

D 

A 

I. z, + 1 , < XjV' 

+ z. 

Zj + I, 

2. 

11. Z, < A»V' S ir 

+ Zp v« 



111. X,V Sz, 

V ’ 

z< F V' 



Noth,— N - rwuirjl policy. I) - dUwmiiuung polKy. A =■ acvviu policy 
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Note that if the indicator is unreliable enough (i.e., kr small enough) 
or if effective demand is stable enough (var [bx] small enough), ceteris 
parthus, then a neutral policy will be optimal because both of these 
factors make V‘ small. Both of these factors also figure prominently in 
Milton Friedman’s (195.S, 1968) case against activist policy. For the 
same reason, a small slope to marginal cost (small k) favors a neutral 
policy. Note also that in situation III the disseminating policy is 
dominated by the neutral policy. 

For our purposes the most important implication to note about 
table 1 is that an activist policy will be uniquely optimal whenever 2« < 
F* g that is, if (i) the indicator is reliable enough, effective 

demand is unstable enough, or marginal cost rises fast enough that V' 
exceeds the cost of administering an activist policy; and (ii) the cost of 
processing the information contained in the indicator is large enough 
or the predictability of the effect of changes in the level of aggregate 
demand is small enough that V* S Zp/Xj,. These conditions do not 
require the monetary authority to possess any cost advantage. If X* is 
very small, they may hold even if the cost of administering an activist 
policy is much larger than the private cost of monitoring. If the 
conditions hold then an activist policy is optimal because despite the 
large social gain to monitoring no one will do it even if the cost of 
collecting information is saved. For even under a disseminating policy 
the user of the information contained in the indicator must incur a 
processing cost, and, as we have seen, he will be led to underestimate 
the gains from using the information. 


V. A Variant of the Basic Model; Monetary Policy and the 
Efficiency of Monitoring'^ 

One special feature of the ba.sic model employed above is that the 
term.s m and x enter into the aggregate demand function (5) with the 
same coefficient. This is why the optimal reaction coefficient g always 
makes the expected level of output (conditional on s) equal to its 
optimal value, q (see [20]). When m and x have different random 
coefficients then Brainard’s (1967) analysis suggests that the optimal 
value of the reaction coefficient will generally be smaller than this. 
Such a case is easily constructed by replacing the utility function (1) 
with the variant 

U = K(q - x)"w'-« - (kl2)q'‘ + 2x, (37) 

in which case the indirect utility function is still given by (6), but the 


The rationalization for activist policy presented in this section is similar to the one 
presented by Fane (1977). 
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effective demand function changes from (7) to 

q = hm + X. (38) 

As in (19) and (20), the value of output with or without monitoring 
can be expressed as 

9'n =*(»«» - y«//3) + cs (39) 

q'i = nil - gi) + CS. (40) 

Once again the expected value of output is 

E(qi,)=Eiq'n\s)=E(ql) = qn (41) 

and the variance of output is 

var q'n = alml + (rliicrf. + y^crDI^^] (42) 

var^J = a-lml + (r'ila-^ + iy - + igmnfcrl]. (43) 

Let us again restrict our attention to the limiting case of perfect 
competition. As before, the optimal valueg' of the reaction coefficient 
will be the one that minimizes var iq^). That is, 

g' = Ky/q- (44) 

If the activist policy g = g’ is pursued and no monitoring occurs, 
then 

Eiq'^ 1') = 9 + (1 - h)yi- (45) 

As in Brainard’s analysis, only the fraction A* of the predicted effect of 
a change in the indicator is offset by optimal monetary policy. In 
contrast to this, (41) implies that all of the predicted effect will be 
offset by the outcome of monitoring. Thus optimal monetary policy 
without monitoring can in this case accomplish more than monitor¬ 
ing. With private monitoring the real money supply will overreact to 
information about aggregate disturbances. Specifically, it follows that 
the social gain to monitoring when g = g' is 

V'ig') - V^ig') = (A/2)X,o-W[1/(/3* + C7l) - l//3^] < 0. (46) 

Optimal monetary policy can be characterized in this variant model 
as we did in the basic model with results that are similar but with a 
complication, because even when g = g' and the social gain to 
monitoring is negative, the private gain in this case is actually positive. 
Specifically, 

V^ig) - VHg) = (*/2)X,(7*(1 - X^/g'l^'" (47) 


See Appendix. 
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from which it follows that V^(g') - V^{g’) > 0. Thus optimal monetary 
policy may require^ to be somewhat larger thang' so that people will 
not be motivated to monitor. 

For our purposes the important feature of optimal monetary policy 
is that the activist policy {g = g') will be optimal whenever 

(A/2)X^(r2.{l - + (kl2)K(T% (48) 

and 

S (A/2)X,o-ia?[l/|3^- + ot5)]. (49) 

If (48) is satisfied then monitoring will occur under a neutral or 
disseminating policy but not under an activist policy. If (49) also holds 
then the activist pttlicy is optimal because its cost (relative to F^[g']) is 
i„. whereas, from (46) and the fact that l^'(g) is independent of g, the 
cost of each of the other policies will exceed the RHS of (49) by the 
sum of monitoring and administrative costs. As in the previous sec¬ 
tion, the monetary authority needs no cost advantage for these condi¬ 
tions to be satisfied if X(, < I. For example, they may be satisfied even 
if z„ > z,. + Zp, because the first term in (48) is just X(, times the RHS of 
(49). 

I'hiis, an activist monetary policy can be justified because the mon¬ 
etary authority, by avoiding the overreaction to the indicator that 
would occur with monitoring, is able to accomplish more than could be 
accomfilished by private monitoring. The overreaction of private 
agents monitoring s occurs because each individual sees himself as 
being able to offset the effect of.« by changing his relative price, the 
coefficient of which, n, is known with certainty. Thus, he will try to 
offset completely the predicted effects of a change in s. As Brainard 
has shown, this is the optimal policy when the effect of the price 
setter’s action can indeed be predicted with certainty. But the effect of 
every agent trying to alter his relative price is a change in the 
absolute price level, the effect of which cannot be predicted with 
certainty. As Brainard has also shown, the optimal reaction when the 
effect of policy is uncertain is to attempt to offset less than 100 
percent of the predicted effect of the disturbance. The crucial as¬ 
sumption in this argument is that the degree of uncertainty attached 
to the effects of relative-price changes is different from that attached 
to absolute-price changes. Private agents take the former into account 
but not the latter. In the particular model used above the former 
uncertainty is less than the latter, but this is not essential to the 
argument. In the reverse case private agents would itnderreact to 
monitored aggregate disturbances, once again giving a rationale to an 
activist policy. 
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In summary, this paper has argued that an activist monetary policy 
may be justified even when people’s expectations are formed 
rationally if .there are costs of gathering and processing information 
and if there is uncertainty concerning the exact values of parameters 
in the economic system. The justification that we have presented is 
twofold: first, that private agents may not be motivated to use infor¬ 
mation that can be used profitably by the monetary authority and, 
second, that even if they are motivated to use it they may not make as 
good use of it as could the monetary authority. 

The formal argument has been presented with a simple model 
employing specific functional forms chosen for their tractability. 
Therefore, much work remains before the degree of generality of the 
results can be ascertained. 

The first rationale that we provided for an activist policy should be 
distinguished from that of Fischer (1977) and Phelps-Taylor (1977). 
Their models are similar to ours in that prices arc set in advance of 
trading rather than being determined by an independent auctioneer 
according to market-clearing conditions. However, their models allow 
activist monetary policies to improve welfare because they postulate a 
temporary rigidity in prices that prevents private agents from realiz¬ 
ing all mutually advantageous and perceived gains from trade. This 
raises the question of whether the contractual arrangements implicit 
in such models would be competitively sustainable (see, e.g., Barro 
1977). Our argument has gone one step further and shown conditions 
under which these contracts are in fact sustainable. We have not 
postulated any rigidity or imperfection that allows the monetary au¬ 
thority to react more quickly or with more flexibility to variation in the 
indicator than private agents can. Instead we have shown that pri\ate 
agents may choose not to act on as much information as the monetary 
authority possesses, or even as much as the monetary authority makes 
freely available to them, even when it would be Pareto improving for 
them to do so. In other words, this argument also provides an expla¬ 
nation for the fact that private contracts are typically w ritten in nomi¬ 
nal terms without being indexed to the money supply or other aggre¬ 
gate variables. 

The paper has not addressed the question of whether or not the 
particular conditions derived above for the optimality of an activist 
policy actually exist in any real-world situation. The purpose has 
been mainly to argue that as a matter of principle the assumption of 
rational expectations does not imply the uselessness of an activist 
policy. It depends at least on the factors discussed .above. .Any bal- 
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anced evaluation of alternative monetary policies should also take into 
account at least two other important considerations. First, as Hayek 
(1945) has argued, the most important problem to be resolved is 
probably not what policy to pursue but whom to entrust with the task. 
An activist policy will work only if the authority can be entrusted or 
persuaded to pursue the public interest instead of his own and to 
pursue it competently. Second, as Hayek has also argued, the cen¬ 
tralization of decision making is most easily rationalized when the 
decisions require the u.se of general knowledge rather than local, 
specihe knowledge. In the model of the present paper every indi¬ 
vidual agent is essentially identical, so there is no clear distinction 
between these two kinds of knowledge. But more generally one would 
expect the aggregate disturbances to which monetary policy reacts to 
come under the heading of general knowledge, about which most 
induidual agents can be expected to have little expertise. Thus a 
further argument in lavor of an activist policy is that it allows society 
to centralize, and thus to bring to bear an eflicient degree of expertise 
on, decisions concerning such general knowledge. 

Appendix 

This Api-iendix sketches a formal derivation of the private \ahie of monitor¬ 
ing. In general terms, suppose that a decision maker has an inffinnation set / 
and his problem is to choose the value of decision variable v so as to 

max £ (Off + o'l.x + | /) (A J) 

Ixl 

where the a.’s arc random parameters. Then the expected v.ihie of iccciving 
the mfortnation tontained in the new infbimation set /' is the expet ted 
difteience m the optimized value of (A I) resulting horn lepl.icing/ by/'; that 

IS, 

/•;{[£(«, |/)]-'/4/-:(«,|/) - |A•(o,|/')l-/4/• ((o|/')}. (,\2) 

1. To derive (26) from (A2) note that in this case a, = + ri)(I + nk), ix^ ~ 

~(ii/2)(2 + rik), / = 0, and /' = (x). _ 

2, fo derive (28), let x be the average relative price, g, s) 'J, and 

note that in this case Oi = e„(q], + ri )(l + nk ), a-, = -e^(n/2)(2 -(■ nk)J = 0, and 

/' = {,x}. 

‘i. To derive (48) proceed exactly as in deriving (26) and lake the limit as n 

—► X'. 
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Equalizing Discrimination and Cartal Pricing 
in Transport Rate Regulation 


Kenneth D. Boyer 

St/ih' I ’nivt‘r\tly 


I hctt' (HO poi.sihle oiilcoincs of transport regulation: (1) main- 
taniiiig a < at i ter < at tel and (2) imposing equalizing discrimination 
against adianraged and in favor of disadvantaged shippers. Both 
fuiu lions have re<|iiited a complex rate structure to enforce the 
resfx'ciise forms of prue discrimination. Using a sample of freight 
hills from motor carriers and railroads, this paper demonstrates that 
tfie principal result of motor carrier regulation has been to maintain 
a cartel of truckers, while railroad regulation has thwarted the wishes 
of the railioad cartel by imposing equalizing discrimination on weak 
and strong shippers. Motor carrier rales respond in an economically 
rational matiner to costs and shipper bargaining power; rail rales are 
eithet untesponsive or perversely responsive to the same factors. 
Deregulation should have divergent effects in the two industries. 


Deregulation is a policy that has been recommended in recent years 
for .some but not all regulated industries (see, e.g,. Council of Eco¬ 
nomic Advisers 1979). Candidates for deregulation are most often in 
the transport .sector and never in the traditional public utilities such as 
electricity and natural gas distribution. Why is transport regulation con¬ 
sistently singled out for attention? Also curious are the extraordinary 
attempts of motor carriers to retain the regulatory fetters, while the 
railroads make somewhat half-hearted attempts to remove theirs (see, 
e.g., Williams 1978), What is it about the type of regulation that makes 
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oiH' transport industry wish to retain—at aW costs—what another 
transport industry is willing to shed? 

As this paper explains, the answer to both questions is found in the 
institution of transport rate making. There are two key differences 
between irad.tmnal lorms of price regulation and transport rate con¬ 
trols. First, in most instances, regulation involves category pricing 
white .ran,port r..« are teili.red individual buyer,. sL.iL ,r,„t 
piiri rate, are lyp,rally ,he re.ul, of cnlleciive rate Lking. while moM 
o er regu ate prices are not. Together, these aspects allowf the 
regulatory authority to bestow advantages and disadvantages on indi¬ 
vidual communities and industries; they also allow the industry to 
stabilize a sellers' cartel. 


This paper demonstrates that there would be two distinct results 
from deregulating the motor carrier and railroad industries since 
there arc divergent outcomes of regulation in the two modes. Current 
motor carrier rales show classic signs of cartel rate making; railroad 
rates reveal Interstate Commerce Commission (ICC) attempts to 
equalize advantages and disadvantages of shippers and Ux;alilics. De¬ 
regulating the trucking industry would inhibit cartel rale making, 
deregulating the railroad industry would lead to a more economically 
rational pattern of prices, with some rates rising and some falling. 

To emphasize the unusual nature of tran.sport pricing, Section 1 
shows what category pricing would be like in transport. Section 11 
shows how individualized rales and collective rate making lead to 
unpredictable outcomes in the regulatory process. Unlike category 
pricing, transport rate making is ideally suited for either cartel pricing 
or enforcing equalizing discrimination. Section HI investigates the 
current motor carrier and railroad rate structures and finds that 
different principles are apparent in the pricing of the two modes. The 
paper concludes with implications of the findings for deregulation 
arguments. 


I. Transport Prices Are Individual Rather than Category Rates 

Compared with motor carriers and railroads, the rate structure for 
most regulated industries is the model of simplicity; The customer's 
class of service is determined, and the block pricing for the class is 
applied. Such category pricing has two important implications. First, a 
customer's charges can be raised or lowered only by altering the prices 
assessed everyone in that category. Customers cannot demand special 
price concessions and are protected From unusually high rates by the 
presence of other members in the category. Category pricing prevents 
bargaining over individual charges between suppliers and customers. 

The second implication is for regulators. Because the pricing is 
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relatively simple, the regulatory authorities can make informed deci¬ 
sions about each price. The costs of providing service to a category of 
customers can, at least in principle, be calculated with some accuracy; 
again in principle, the regulators can check the supplier’s cost calcula¬ 
tions with a limited staff. 

The costs of providing transport service are quite variable (Harris 
1977; Spady 1979). Thus category prices for railroads and motor 
carriers would be more complicated than the price structures of most 
regulated industries. Such a scheme would, nonetheless, be feasible, 
f or example, railroad car rentals (which could change daily according 
to availability) could be separated from haulage charges. Haulage 
charges could be divided into categories depending on the number of 
cars in the train, the regularity of movement, and so forth. Track 
rentals could also be charged and graduated according to expected 
maintenance charges on a link traversed. Congestion tolls could be 
assessed if a shipment used a link with excessive traffic. Separate 
charges could be assessed for using switching locomotives and renting 
yard space. 

Similarly, motor carriers could be compelled to charge separate 
prices for separate services: one price for pickup, another for termi¬ 
nal sotting, another for use of line-haul service on a certain link, and 
so on. I'otal transport charges would then be determined by summing 
the costs of the service categories used. 

As long as a transporter could charge for only a limited number of 
service categories, this scheme would mean that a customer’s charges 
could be altered only by changing the rates of all those in a category, 
and that, by litniting the number of prices reviewed by the IC(>, a 
competent staff could determine the relation between the price and 
the cost of a service. As demonstrated below, if transport regulation 
had the nature of category pricing, there would be .scant reason to 
expect intense |xjlitical pressure either to preserve or to remove the 
regulatory apparatus. 


II. The Implications of Pricing Services Individually 

Most transport sales are not based on category prices but use a scheme 
of exception or commodity rates. Each price is tailored to the indi¬ 
vidual buyer (Locklin 1972). As demonstrated below, this has inevita¬ 
ble consequences for the relations among carrier, customer, and reg¬ 
ulator. 

A system of class rates for motor carriers and for railroads does 
exist. Under this scheme, each commodity is given a class rating and 
each origin-destination pair is given a rate base number. The trans¬ 
port price is the product of these two numbers. The class-rate tariff is 
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thus a two-category scheme. It makes pricing simple and thus capable 
of intelligent oversight by the ICC; it also means rates cannot be 
changed for one shipper without simultaneously changing them for 
all others in the same class. 

Because the class-rate scheme is only a two-category system, it can¬ 
not adequately reflect carrier costs which vary in more than two 
dimensions. In particular, neither economies of traffic density nor the 
likelihood of receiving a backhaul is accounted for in the class-rate 
system. 

These unusual cost circum.stances can be accommodated through 
the commodity-rate and exception-rate schemes which allow the 
quotation of a special price for shipping a good defined at greater 
than seven-digit precision between any two stations. That rates can be 
quoted so precisely reveals that they apply in practice to individual 
shippers. Commodity and exception rates are the opposite of category 
rates—they are individual rates. 

Commodity and exception schemes are now almost the universal 
pricing method for rail shipments, and they are the predominant 
form of truckload rates. The class-rate system is reserved for occa¬ 
sional shippers and for much le.ss-than-truckload traffic. 

To understand the nature of transport rate regulation is to under¬ 
stand the interaction between the ICC and those parties that propose 
a change in a commodity or exception rate. More than 85 percent of 
motor carrier rate changes are reductions proposed by carriers or, 
more Itequently, the rate bureaus to which they belong,' The uniter- 
sal rationale is to retain or gain traffic that would otherwise be moved 
by another mode. 

Under individual pricing, shippers may gain advantages over com¬ 
petitors if their own rates are lowered but other shippers' are not. 
To bring this about, a shipper must convince the carrier that either 
(1) existing tratfic will be diverted without the lower rate or (2) new, 
formerly uneconomical markets will be opened up if the rate is low¬ 
er ed. A shipper has two hurdles: First, the shippers must convince the 
carriers who would offer the rate that the threat of traffic gain or loss 
is genuine;'' second, should the rate be protested, the ICC must rule 


' this ligure was quoted by M. A. Crtjdeekei <>1 rhe Central Stales Motor Freight 
Bureau My understanding ol regulatory prtKedurc in this paper henehted bv discus¬ 
sions with (jode< kei and J. R. Ahlsirom ot the Western Truck Line Caimmiitec. 
Substantial alterations in the procedures for regulating railroads and motor earners 
weie passed by Congress in the time subsequent to the writing of the paper. The forms 
»l regulation discussed in this paper were those practiced in through the s|>rmg of 


* Motor carriers claim that collective rate making is necessary to protect them from 
ship[)ers opportunistic misrepresentation of the likelihtxid of traffic diversion. 
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on it. To go into (ff'f'ect the rate must be declared just, reasonable, and 
nondiscriininatory. 

Precisely what criteria the ICC uses to judge rates have never been 
dehned. The commission does demand cost evidence in the form of 
standard cost statements. It recognizes, however, that the costs sum¬ 
marized in the statements are average costs, and thus additional 
evidence of haulage costs is used; the standard piece of evidence is the 
rate of analogous hauls. 

The basic ICC regulatory procedure assures a balance of rates on 
analogous shipments. A just, reasonable, and nondiscriminatory rate 
is one which is not out of line with rates on analogous hauls. What is in 
practice regulated, then, is the rate relationship among analogous or 
competing hauls. 

The result of this regulatory procedure has been a transport rate 
sttiicture of fantastic comjjlexity; Hundreds of billions of individual 
prices have been established and approved. In contrast, under the 
typical category scheme used in most regulated industries, there are 
rarely more than 100 separate rate categories. The complexity of the 
transport rate structure precludes staff investigations of price-cost 
relationships for individual rates. Indeed, the very complexity of the 
rate structure discourages cost finding. With traffic equal to the sum 
of large numbers of unique movements, virtually all costs will be 
found to be joint or common and thus logically or practically untrace- 
able. 

This inability to cost individual movements has distinct advantages 
both to carrieis and to the ICC since it increases the discretion of each 
party to pursue separate goals in the regulatory process. The lack of 
reliable cc«t figures frees the rate bureaus to engage in demand 
elasticity-based pricing which is implicit in the policy of granting rate 
reductions only to those shippers who can credibly threaten traffic 
diversion. Without cost data, discrimination is extraordinarily difficult 
to prove. The complaint of a firm that lower rates are charged to a 
rival can be answered by the apology that the rate concession was 
necessary to retain traffic. It is, of course, crucial to the proper 
working of the scheme that price-induced traffic diversions to firms 
with identical cost structures be inhibited, otherwise the process of 
collective rate making will evolve toward cost-based prices. 

The difficulty of relating prices to costs gives wider latitude to the 
regulatory authorities to pursue what Hilton (1972) has called “the 
basic behavior of regulatory commissions.” Hilton identified the goal 
of regulation as dissipation of monopoly profits through cross¬ 
subsidization; a preferable description in the case of transport would 
be requiring discrimination which equalizes the advantages and dis¬ 
advantages of different purchasers of transport service. The phrase 
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“equalizing discrimination” is preferred to the more familiar title of 
“cross-subsidization” since the latter implies that the carrier uses 
profits from one service to finance another. But if carrier operations 
generally run a deficit, no service is paying for any other. The term 
“equalizing discrimination” emphasizes that the ICC is interested in 
equalizing the conditions of advantaged and disadvantaged shippers 
rather than in having one party pay the bills of another. 

Equalizing discrimination is the natural result of attempting to 
prevent “unjust” or “extractive” discrimination (i.e., pricing to extract 
rents from shippers and localities) without using a cost comparison. 
For example, the volumes generated by a large shipper typically will 
produce both lower carriage costs and many eager carriers, while a 
small shipper will have fewer alternatives and higher carriage costs. 
To prevent exploitation of the small shipper’s lower demand elastic¬ 
ity, in the absence of a reliable cost comparison, the regulatory au¬ 
thorities have no option other than to require equal prices for both 
shippers. The burden of this equalizing discrimination naturally falls 
on the advantaged shipper. 

Explicit examples of equalizing discrimination can be found in 
policies that require identical prices for shipment of goods f rom the 
Midwest to any East Coast port regardless of distance, require low 
rates on California citrus fruit and high rates on Florida produce so 
that both can compete in Northeast markets, and discourage the 
quotation of unit train rates even when there are obvious cost advan¬ 
tages (MacAvoy and Sloss 1967). 

The pressures on the ICC to practice such equalizing discrimina¬ 
tion are most intense in the case of relatively low value, homogeneous 
commodities such as agricultural products, coal, steel, and paper. 
Demand for these goods is most sensitive to delivered prices, and 
their profitability is most strongly affected by transport prices. These 
are also the goods carried predominantly by rail. In contrast, the ICC 
gets less intense pressure to use transport prices to equalize the 
conditions of manufacturers of finished goods—the most important 
customers of motor carriers. This is because demand is less sensitive 
to transport prices and because transport rates are a smaller part of 
delivered cost. 

Not only is there less pressure on the ICC to produce equalizing 
discrimination for those commodities hauled primarily by motor car¬ 
riers, but the commission has less leeway, since carriers can use their 
own trucks if common-carrier rates get loo high. One would exf>ect, 
therefore, that the ICC would give more care to regulating rail than 
motor carrier rates: Their range of actions and the participants’ 
interests are greater than in truck rates. 

In the next section the rate structures of the two modes are investi- 
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gated to test the hypothesis that the regulatory influence of the ICC is 
more evident in railroad than in motor carrier pricing. 


III. Empirical Evidence of Different Regulatory Regimes for 
Railroads and Motor Carriers 

To demonstrate the dual outcome of ICC regulation, individual rates 
were regressed on factors which proxy demand elasticities and costs 
of providing service. A cartel unthwarted by the ICC will have rates 
that vary directly with the co.sts of service and inversely with demand 
elasticities. 

It is a good deal more difficult to specify what would determine 
rates under equalizing discrimination. However, two aspects are likely: 
First, tost savings of a large shipping volume will not be passed along 
to customers in the form of lower charges; and second, having fewer 
shipping options is likely to lower rather than to raise rates since a low 
demand elasticity indicates a vulnerable shipper. 

The complexity of rail and motor carrier rate structures derives 
f rom the discriminatory aims of both the rate bureaus and the reg¬ 
ulators. But this same complexity precludes investigating rate struc- 
tuies in any manner other than explaining the patterns of actual 
charges for pm chased services.’ A sample of rail freight bills has long 
been available from the ICC, but the similar data file for motor 
cat 1 iers is the pi oprietary information of motor carrier rate bureaus.^ 
This study was made possible by obtaining the latter file for the year 
1972 and comparing it with the similar rail file. 

rite SIX analyzed commodities (the same for both lail and motor 
carriers) are perhaps not typical of either motor carrier or rail traf fic 
as a whole; the two modes do not have identical traf fic mixes.’ Rather, 
the commodities were chosen to be representative of the goods con¬ 
tested by both railroads and motor carriers. 

“ flic use of a samplf ol acuial frcighi bills lo tiiixicl ihc rale strut lure ralber tliaii the 
alternative proteditte ol tisitig rates printed in formal larifl statements is preferred for 
several leasons' (1) There are always numbers of rates available to any ship[rer, and it is 
not clear which would be the governing rate; (2) charges are sometimes dilfcrent Irom 
those printed in tariffs; and (3) a laigc part of the rate structure is nominal in the sense 
that no freight is ever moved under these rates. 

* Tlie tail data arc the JCT “One Percent Waybill Sample of Carload Terminations’’ 
(L S. Interstate Commerce Commcssion 1972). The motor carrier data are from the 
’All-Bureau laintinuirig Iraftic Study lor 1972" (Rocky Mountain Motor Freight 
Bureau 1972). 

’ The SIX commtKlmcs arc prepared feed NEC, assigned STCC numbers 2042129 
and 2042190 and NMFC number 729f0; wrapping paper, Sl'CC 2084120/21, NMFf: 
111,310, 2021490, 1.31800; pulplxrard or liberboard, STCC 263111.3/16120, NMKf; 
131310. corrugated liberboard or puipboard l>oxes, STCC 2651141, NMFfi 29275; 
tires, STCC 3011110, N.MFC 1572.30. For a complete description of commodity sclec- 
uon procedures and data fillers, see Boyer (1978), 
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The charges for shipping a hundredweight of each of the six 
commodities between various end points were studied. One of the 
strongest influences was found to be the size of the shipment. In the 
Appendix, I describe the procedure used to adjust the reported 
charge per hundredweight in each freight bill for variations caused by 
differences in loading weight. The dependent variable whose varia¬ 
tion is explained by the regressions in this section is a least-squares 
forecast of what the charge per hundredweight on a freight bill would 
have been had the shipment been made at a standard size of 340 cwt 
for trucks and 680 cwt for trains.* Only boxcar movements of rail 
commodities were considered. 

The independent variables which could be generated from the 
freight bill samples were the length of the haul (Ml)-, the number of 
shipments of this commodity, on this haul, in the sample (SUP)-, the 
number of alternative destinations open to a shipper of this commod¬ 
ity from this origin (DST); the number of origins for this commodity 
from which a- receiver at this destination can choose (ORN)-, and 
existence of active competition from the other regulated mode of 
transport (INT). It can be assumed that costs per hundredweight 
increa.se with distance and decrease with shipping volume. Similarly, 
large values for DST and ORN as well as a positive value for /NT 
should indicate a shipper with a higher demand elasticity. In addition, 
the existence of private trucking suggests that shipping volume will be 
positively related to demand elasticities. 

The functional form for these regressions was: 

In (RATF,) = ^W/,) + + b^(ln Ml,)■' 

-f- Ml, - In lOOf -t- ^sA(ln Ml, - In 500)’ 

+ C, In (SHP,) + C 2 In (DST,) + C, In (ORN,) -i- C^(INT) 

n 

+ '^dPj,+ e,, 

j=i 

where 

RATF 

Ml 
M 
N 

I 
& 
k 

a' ® These correspond lo (ruckloads o( 17 ions and carloads ol ions which were laken 

to be representative in Meyer cl al. (1959). 

■yt 


i = cents per hundredweight shipping charges (see 
Appendix); 

= length of haul; 

= 1 if MI, > 100, 0 otherwise; 

= 1 if A4, > 1,000, 0 otherwise; 

= number of freight bills in sample corresponding to 
this haul and commodity; 


S 
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DST = number of different destinations for freight bills in 
sample with this origin and commodity; 

ORN = number of different origins for freight bills in 
sample having this destination and commodity; 
INT = 1 if there is a freight bill in both samples with this 
commodity, destination, and origin and 0 if there is 
no interinodal competition; and 
D,, = 1 if observation is for commodity^ and 0 if obser¬ 

vation is not for commodity j. 

The functional form contains a cubic spline of distances with knots 
placed at 200 and 1,000 miles.^ I used this piecewise cubic approxi¬ 
mation of the distance taper to avoid prior constraints on the shape of 
the distance relationship. 

[ he regressions in table 1 show the results for the motor carrier 
rate strucTure. Table I also contains two rail regressions, one for the 
complete set of six commodities and one for a restricted set of the 
four goods for which haulage weight/rate relations could be estimated 
(see Appendix). 

As reported in table 1, the results strongly support the ct:)njecture 
that motor carriers use a cartel rale structure unhampered by gov¬ 
ernment interference. Large shippers do get lower rates (the elasticity 
of rates with respect to the number of shipments is -.039) as pre¬ 
dicted bv the cartel model; the cost of serving large shippers is lower 
and the likelihood of switching to private trucking is higher. The 
coefficient on the number of destinations used by the seller (DST) and 
the number of origins used by the receiver of a shipment (ORN) 
similarly confirm that the greater the number of competing hauls, the 
lower the rales. Finally, the coefficient on INT suggests that motor 
carrier rates are approximately 6 percent lower where there is actual 
competition from railroads than where there is not. All of these 
findings are cotisisteni with cartel rate making in the trucking indus¬ 
try. 

I'he ccrntrasi to the rail results is striking. The response of rail rales 
to larger shipping volume (SHF) is insignificant. A large number of 
destinations (DST) lowers a shipper’s rates, but a larger number of 
origins (ORN) does not similarly lower rates. And, finally, the pres¬ 
ence of motor carrier competition (INT) is associated with higher, 
not lower, rail rates. With the exception of the negative sign on DST, 
the remaining results are those predicted by a policy of equalizing 
discrimination where rates are either unresponsive or perversely re¬ 
sponsive to cost conditions and demand elasticities. The negative 

^ On rsiiniacing spline functions wilh ordinary least squares, see Suits, Mason, and 
Chan (197H) 
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TABLE 1 

Determinants of Regulated Freight Rates, 

Cubic Spline Regressions 


Motor Rail 

Carrier,- 

F.q (1) F,q. (2) Eq. (3) 


In Ml 
(In MI)* 

(In Ml)* 

(In Ml - In 100)’ 

(In Ml - In 500)’ 

SHP 

DST 

ORN 

INT 

Dummy variable' 
3011110 
2651141 
2631115 
2621490 
2084120 
2042190 

R* 

N 

F 


3.85 (8.71) 
-.77 (.73) 

.05 (.04) 

-.04 (.06) 

-1.92 (.10) 

-.037 (.014) 

- .0062 (.0042) 
-.012 (.1153) 
-.062 (.021) 

.12 (.0079) 

.26 (.016) 
-.29 (.013) 

- 12 (.013) 

-.12 (037) 

-.24 (.016) 


-17.35 (4 58) 
3.16 (89) 

-.22 (.058) 

.22 (.074) 

.23 (.10) 

-.011 (. 012 ) 
-.014 (.006) 
.0018 (.0070) 
.060 (.020) 

.46 (015) 

.51 (.028) 

.067 ( 0090) 
.13 (017) 

-.11 (.066) 
-.61 (014) 


-20.64 

(4.87) 

4.11 

(.96) 

-.26 

(06) 

.27 

(079) 

.27 

( 11) 

-.008 

(012) 

-014 

(.006) 

-.00031 (.0071) 

.064 

(021) 

.48 

(.015) 

.09 

(.008) 

.15 

(.017) 


.54 

6,472 

658.78 


82 
4,062 
904 06 


,58 (.014) 

.83 

3,836 

855.56 


S<U'8( Fs — S Imenuje (:<*mmert« Commts»K>n 1972. R«Hky Mntiniain Mim»r Frfighi Bureau 1972 
Notf —Dependent variable “ (itied cenui/hundredweight for cf\d*poin( pairt Nunihen m parrnihc»e» are SF.i 


coefficient on DST indicates that there is some rational response to 
demand elasticities as well—an indication of some cartel rate making 
within a structure whose primary characterization is enforcing 
equalizing discrimination. 

The regressions in table 1 give other evidence consistent with the 
hypothesis that railroads are thwarted by regulatory authorities while 
motor carriers set rates just as a successful trucker's cartel would wish. 
The first evidence comes from the spline variables on mileage. The 
mileage tapers, expressed on the vertical axis as a percentage of 
charges for a 500-mile haul, are graphed in figure 1,* The two modes 
appear to be very similar; the rail taper has unusually low short-haul 
rates and relatively high charges for the longest hauls. This is pre¬ 
cisely the opposite of what would be generated by cost-based rates and 
contradicts the basis for many studies of optimal intermodal traffic 
allocation (Friedlaender 1969; Levin 1978). The slopes and positions 
ol t.ie two curves indicate that the observed propensity for long hauls 

* On the safe assumption that rail rates will be lower than road rates at the standard 
500-inile haul, the results suggest that rail and truck rates either diverge as mileage 
increases or that the rail curve cuts the motor carrier rate curve from below. 
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Fic. I —Mileage tapers tor rail and motor earners Souue' spline (oeUii icnts esli- 
maifd in table 1. 


to be shipped by rail and short hauls by truck is not the result of 
shippers responding to cost-based rates but, rather, must be due to a 
convergence of service qualities. 

rhe results are consistent with an ICC policy of “fair sharing 
Irafbt” among modes. Rail rates are held high tor long hauls to allow 
motor carrier participation and are allowed to fall for short hauls 
where motor carriers do not have to fear serious traffic diversions.® 
A final piece of evidence for the differing regulation of motor 
carrier and railroad rates comes from the commodity-specific dummy 
variables in table 1. Converting logarithmic coeflicients to percentages 
yields the data in table 2. The numbers are average rates for each of 
the six commodities. The variation in rail rates explainable by com- 

* On fair sharing ol traltic, sec ihc “Ingot Molds Oasc" (American (Commercial Lines, 
Inc., el al. v. Louisville anti Nashville Railroad 0>. ei at. 1968). Note that the evidence is 
also consistent with one or the other, but not both, of the modes of pricing under cartel 
rate malting 
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TABLE 2 

Value of Service Pricinc 


Commodity 

Percentagk oe 

Charges on 

Average Commodity 

Motor 

Carrier 
from Eq. (1) 

From Eq. 

Rail 

(2) From Eq. (3) 

3011110 

1 13.22 

159.44 

163 02 

26,51141 

130.31 

167.35 


2031115 

74.41 

106.97 

109.46 

2621490 

87.96 

114.37 

116.72 

2084120 

88.58 

88.95 


2042190 

78.01 

53 80 

55.49 

sn 

21.82 

42.90 

44.05 

St)i Kt.r — T.iblp 

i. dntilog ol commodity dummy v<inal>lr4 • 



modily value is more than twice as high as that for motor carrier rates; 
The column standard deviations are 42.9 percent for rail as opposed 
to 21.8 percent for truck. This suggests that rail rales are more closely 
related to commodity type than are truck charges.’® This finding is 
consistent with a railroad industry which maintains an obsolete system 
of value-of-service pricing and a trucking industry which has adapted 
to an environment of intermodal competition. 

IV. Conclusion 

I'he evidence in this paper shows that both motor carrier and railroad 
rates are different from those that would be observed without regula¬ 
tion. The fiattern of rates suggests, however, that regulation of the 
two modes has had different results. Perhaps the strongest support 
for the hypothesis that railroads and motor carriers are subject to 
different types of regulation comes from the public posture of the two 
modes toward deregulation. Within the last years, the Association of 
American Railroads has endorsed a "deregulation" proposal. How¬ 
ever, the proposal does not suggest that collective rale making be 
abandoned. In contrast, the American Trucking Associations is 
vehement in its opposition to any changes in transport regulations. 
This is consistent with the findings of this paper; (1) Motor carrier 
regulation ratihes a carrier cartel, while ( 2 ) rail regulation has 
classic equalizing discrimination as its primary characteristic. 

The removal of government oversight over rates and entry and the 
simultaneous outlawing of collective rate making should have diver- 


" The simple correlation between cols, a and b is .71. 
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gent outcomes in the railroad and motor carrier industries. A de¬ 
regulated trucking industry would—at least in truckload service— 
develop a rate structure more closely related to costs of service. A 
general reduction in trucking prices would be expected. 

Deregulating rail rates would have far more complex results. Given 
the small number of rail carriers in any market, a strictly cost-based 
rate structure could not be predicted; nor, if rail marginal costs are 
below average costs, would this result be desirable. Instead, my 
analysis suggests that a more rational rail rate structure would result, 
one based both on costs and on the ability to divert traffic (the latter 
related, presumably, to trucking costs). Deregulating rail rates should 
not lead to a general increase or decrease in rail rates but to a 
reshuffling of prices. 

This expectation is confirmed by the recent removal of ICC over¬ 
sight ovei' fruit and vegetable tariffs. The diversion of traffic to 
unregulated truckers over the last 2 decades effectively defeated 
the attempt at equalizing discrimination, and in 1979 rail rates on this 
traffic were deregulated. The result was a simultaneous increase of 
some rates and a sizable diversion of traffic to railroads in response to 
lower rates for other hauls (Association of American Railroads 1979; 
American rrucking Associations 1979). There is every reason to 
expect that deregulation of rail rates for other commodities would 
have a similar eflect. 

The disparate effects of regulation in the two modes are the direct 
lesiilt of the institution of pricing services individually rather than by 
categories. Individual pricing, which has allowed the ICC to pursue a 
policy of equalizing discrimination, has also required the use of rate 
bureaus and collective rate making. Had the transport industries used 
category rate making, regulation to enforce equalizing discrimination 
would have been hampered and collective rate making would have 
been unnecessary, and, therefore, it is doubtful that the fervent calls 
for and against deregulation would be heard today. 


Appendix 

rile regic>i.sions in the text explain variations m fitted oi standard rales lor 
each commodity-origin-destination combination. These rate.s are adjusted foi 
differences caii.sed by variable loading weight among different commodities. 

While most discussions of transport pricing proceed on the assumption tliat 
there is a single charge per unit of service, in lad this is not the ca.se. There 
are separate charges per hundredweight for shipments of a commodity be¬ 
tween two end points for (a) minimum charge, (h) less-than-truckload (L TD, 
(r) truckload (TL) or carload, and (d) very heavy shipments. Due to the 
different charging legimes there is an inverse relation.ship, “a weight taper," 
between charges |)er hundredweight and loading weight. To develop a 
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Structure of single rates for commodities between end [joints, the first step 
was to estimate a weight taper for each commodity. 

The weight tapers were estimated on the basic assumption that for every 
origin-destination pair, the shipping charges per hundredweight for a ship¬ 
ment of one size would be proportionate across different levels to the charges 
for a shipment of a standard vehicle load of the commodity. The standard 
procedure of spline estimation was used to estimate the shape of the lapier in 
the absence of prior knowledge of its functional form. Knots were arbitrarily 
placed at 100 and 300 cwt for motor carrier e.stimation and at 500 and 1,000 
cwt for railroads. 

The functional form used to measure the rate ta|)er with respect to loading 
weight for each commodity was: 

In centsi = A, (In wti) + A 2 (ln wlif -t 6,(ln + A^fCln wt, - \nj f 

+ br,0(\nwl, - lug)* + ^ Cjfjji + fi, 

J-t 

where 

cents, = cents per hundredweight for shipment; 

wt( = weight of shipment in hundredweight; 

Dji = haul dummies: Z>j, = I if the shipment is on end-point pair^, 
Dj, = 0 if the shipment is not on end-point pair; ; 

/•' = 1 if «)<( > 100 for motor carrier or wli > 500 for railroads, 0 

otherwise, 

/ =100 for motor carriers, 500 for rail; 

C = 1 if let, > 300 for motor carrier or ict, > 1,000 for rail¬ 
roads, 0 otherwise; 

g = 300 tor motor carriers, 1,000 for rail; and 

f( = error term. 

The values of the splitie variables arc listed in tables A1 and A2 for rail and 
motor carriers, respectively. The number ol dummy variables is listed in each 
table as the number of hauls. For two commodities, there was insufficient 
variation in the independent variable to estimate the relationships for lail- 
roads. 

In general, the tapers have at least one steeply downw.ird-sloping seg¬ 
ment which for motor carriers can be interpreted as the transition from l.TI. 
rales to TL rates. As should be expected from commodities that differ in load¬ 
ing characteristics, this steep downward-sloping section occurs at different 
weights for different commodities. 

Some commodities have Hat motor carrier tapers beyond the first down- 
waid-sloping segment, while others continue to fall; still others have a slightly 
upward-sloping segment at the very end Since spline functions are piece- 
wise cubic, it IS not clear how much lailh should be plated in the aberrant 
Ijehavior of the tapers of some commodities for veiy large and ven small 
shipment sizes. 

1 he downward-sloping segment of the weight la[)er for rail-shipped com¬ 
modities is found m most cases at much larger shipment sizes than lor motor 
carrwrs. For most rail commodities, the downward slope is not as steep as for 
trucks. As must be expected for commodities with different loading charac¬ 
teristics, there are exceptions to this rule. Without exception, however, the 
downward slopes for tail commodities are at a much higher range than lor 
truck commodities. 
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Commodity 




2621490 

2631115 


Indefendlnt 

2042190 

Wrapping 

Pulpboai d or 

3011110 

Variabi.f. 

Feed 

Paper 

Fiberboaid 

Tires 

h, 

172.48 

- 103.78 

-128.61 

-7 47 


- 30 02 

17.61 

21.13 

1.10 

b-i 

1 72 

-.99 

-1.16 

-.05 


-3.81 

1.85 

1.62 

-3.71 


5.11 

1.69 

- 82 

251 37 


.963 

998 

.980 

.979 

r" 

.08 

.90 

378 

296 

Obsci vations 

1,187 

562 

3,159 

999 


890 

432 

1,941 

577 

F 

.5 38 

231 

147 63 

35 12 


S«n k< » —Iritersiate Ounmeiic ('.tininiivkiofi 1972 

Votii — rk'iMrndeiil variable -- In {(cni& pri lMin<ire<iwnghO taiiiaiioiK foi wme atul lN>xei cDuld iiol l)e nliiiiated 
due lo inMilhcicni vanaiion in die independrrii vanabJes 


Two (.oetficients ot multiple determinalion arc icpoi led lor each equation. 
As usual, R- is interpreted as the peicentage of the total variation in the 
dependent v;u table which can he explained by the i ight-hand-side variable, 
riiis is misleading, however, .since the extensive use of dummy variables 
attoimts for inuc h of the apparent explanatory powei of the i egressions. The 
second coelficient of multiple determination. is defined as- 







/ . 

where I'j, is the Mil observation on the dependent variable in group j.Y) b >ite 
mean ol the de|XMideiil variable lor olxservations in group/, and fj.’is^^,^ 
fitted value of the ith ohseivalion on the dependent variable in group J. 
Numbers in the row marked r- aie thus a more accniale indication ol the 
explanatory power of the five spline variables. The reported F-sialisiic is 
ccimputcd using 

The duiiimy variables iii each equation weiecompuied using the analysis ol 
covaiiance. They weie evaluated, howcvei, not at 0 ewt but at d40 cwi fot 
irticks atid 680 cwt lor ttains. The dummy variables thus ate interpreted as a 
standaid late lor each haul and commodity: a lea.st-scjuaies forecast ol the 
I ate ' ad all shipments bt;en made at the standaid si/c. The standard rate thus 
puiges all inlluence ofvaii.iblc loading weights within a commoditv. Foi the 
two rail commodities whose weight lajreis could not lx: estitnated, the stan¬ 
dard rate was computed as the average charge foi the haul, icg.iidless ol 
loading weight. 
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and Existence of the Bid>Ask Spread 
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By considering investor order placement strategy, this papter dem¬ 
onstrates that transaction costs cause bid-ask spreads to be an equi¬ 
librium property of asset markets. With transaction costs, the prob¬ 
ability of a limit order executing does not go to unity as the order is 
placed infinitesimally close to a counterpart market quote; thus, with 
certainty of execution at the counterpart market quote, a "gravita¬ 
tional pull” is generated that keeps counterpart quotes from being 
placed inhnitesimally close to each other. An equilibrium spread is 
defined and its size linked to market thinness; implications are noted 
for the design of a trading system. 


I. Introduction 

This paper establishes that transaction costs in secondary asset mar¬ 
kets cause individual investors to use order placement strategies that 

We thank Amir Barnea, Avraham Bcja, Rirharcf Burton, Mark F.akcr, Dan Galai, 
Kenneth Garbade, Thomas Ho, Wesley Magat, David Peterson, and Marshall Sarnat 
loi their helpful comments. Karher drafts of this paper have been prcsenied al semi¬ 
nars ir. economics and finance at Duke Universiiy, the tfebrew t'niversity al Jerusalem, 
New York Universiiy, Tel-Aviv University, and the Joini National Meetings of Tl MS/ 
ORSA (New York Giiy, May 1978) 
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result in a nontrivial market bid-ask spread.' We define an equilibrium 
market spread and demonstrate that it will be greater for thinner 
securities. 

The analysis fits into a growing body of literature which increas¬ 
ingly is being referred to as the microstructure of security markets. 
Stigler (1964), Demsetz (1968), West and Tinic (1971), Tinic (1972), 
and Tinic and West (1972, 1974), along with Farrar, Smidt, Stoll, and 
others involved in the institutional investor study (see U.S. Securi¬ 
ties and Fixchange Commission 1971), were among the first to focus 
rigorous analytical attention on the operations of security markets. 
More recently, microstructure theory has been explicitly considered 
by, among others, Carman, who coined the term (1976), Beja and 
Hakansson (1977), and Cohen, Maier, Schwartz, and Whitcomb 
(hereinafter CMSW) (1978). The analytical issues addressed involve 
the interplay among market participants, trading mechanisms, and 
the dynamic behavior of security prices. In the present paper, we 
study the formulation of optimal investor trading strategies and how 
these interact with one aspect of the dynamic behavior of security 
prices, the bid-ask spread. Spreads are of concern to investors because 
they are a variable cost of trading by market order and because they 
cause an inflation of transaction-to-transaction returns variance. 

The pioneering analysis ot bid-ask spreads was provided by Dem- 
setz (1968). Furtlier studies include: rime (1972), Tinic and West 
(1972, 1974), Renston and Hagerman (1974), Carman (1976), Ham¬ 
ilton (1976, 1978), Branch and Freed (1977), Stoll (1978(7, 1978A), 
Ho and Stoll (1979, 1980), Newton and Quandt (1979), Schleei 
and Mildenstein (1979), Smidt (1979), and Amihud and Mendel- 
son (1980). In CMSW (1979) we discuss this literature and contrast 
it with the formulation presented here. Except for Ho and Stoll 
(1980), there has been no explicit consideration of the transition from 
individual spreads to the market spread. It will be clear from our 
analysis that this transition is not a simple aggregation process and 
that the market spread is the product of a dynamic interaction in¬ 
volving many market participants. Al.so, previous theoretical models 
have generally assumed that investors can be dichotomized into two 
grijups—immediacy demanders and immediacy suppliers. Our 
model of investor order placement strategy suggests that such a 
dichotomy will not be observed in the marketplace. 

For the market, the spread is the difference between the lowest ask 
and the highest bid of all participants. In markets composed of many 


* Wc have elsewhere (Cohen et al. 1980) considered how ihe impact ol transaction 
(o.sts on stork price movcmenis introduces serial correlation in returns data and causes 
estimates of the market model beta coefheient to be biased. 
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traders with heterogeneous belief's and trading propensities, one 
might expect to have orders at virtually every permissible price in the 
neighborhood of equilibrium and hence to find no significant market 
spread.^ However, we show that even when expectations and trading 
propensities are heterogeneous, the spread is a property of asset 
markets that have temporarily cleared.'* The analysis yields an exis¬ 
tence proof of a noninfinitesimal spread with continuous pricing; a 
fortiori this proves for discrete prices the existence of a positive 
spread for nontrivial reasons."* 

The es.sence of our argument is as follows. At any point in time, any 
investor might alternatively seek to trade via a limit order (be an 
immediacy suppliei), trade with certainty via a market order (be an 
immediacy demander), or not seek to trade at all. Limit orders create 
the book, and market orders clear out limit orders. Becau.se execution 
via market order is certain, while execution via limit order is not, it 
never pays for any investor to place a limit order (e.g., a bid) at a price 
too close to that of a counterpart limit order (e.g., an ask). Intuitively 
stated, as a trader contemplates placing a bid closer and closer to an 
ask already established on the market, he is increasingly attracted by 
this counterpart offer; at some point, the “gravitational pull” exerted 
by the established ask will dominate. The trader will “jump” his price 
and execute with certainty via a market order. 

Section If establishes the scenario for our analysis. Section III 
focuses on the probability of a limit order executing and shows that, 
with transaction costs, tfiis probability does not go to unity as the price 
at which a limit order is placed becomes infinitesimally clo.se to a 
counterpart market quote. I his demonstration underscores an inves¬ 
tor’s need for an order placement strategy and provides the founda¬ 
tion fot our gravitational pull model. Section IV' models the investor’s 
order placement decision and develops conditions under which he 
will transmit limit or, alternatively, market orders to the market, or do 
nothing. The analysis is developed in a dynamic programming 
f ramework, although we are interested in the descriptive modeling of 

* Will'll an as.si'l market has ilcared, there is neither exiess (feniantl nnr excess supph 
111 the sense that at that iiinmeiu no market participant 15 willing to bus the asset at a 
price eijtial to or greater than the ask, and no one is willing to sell at a priie eipial to or 
less than the bid. Hence we iiiii.st have a market spread that is at least equal to the 
niminuiin allowable iiricc change lor the asset m question (on the major I'.S slock 
exihanges, this is 1/8 ol a dollar loi most loinmon slrnks) Howeser, it is not ohsioiis 
why spreads greater than miiiimiiin allowable price changes are coniiiionlv observcKf 

’ While the analysis presented here is applicable to any asset market, our toriiial 
model treats the secondary market for iinaiuial securities Seturilv markets have two 
(onvenient properties: All units o( an asset are identical, and such markets arc imper¬ 
sonal (which means that bargaining, as distinct from trading, strategies need not be 
(onsidered) 

* .See n. 2 above. 
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an investor’s decision process rather than in actually generating a 
normative solution. Section V demonstrates that implementation of' 
the strategic order placement decision (which implies the gravitational 
pull effect) causes a noninfinitesimal bid-ask spread to exist. This 
section also defines an equilibrium bid-ask spread, discusses condi¬ 
tions under which it will exist, and shows that it is positively related to 
market thinness. Section VI considers implications for the design of a 
security market trading system and summarizes our analysis. 

II. The Scenario 

Consider an investor who maximizes the expected utility of terminal 
wealth and, for simplicity, let him allocate funds between only two 
assets: a risk asset and cash (which we take to be the numeraire asset). 
In the absence of transaction costs, the market would be monitored 
continuously and appropriate transactions would be made with each 
change in the market price and the investor’s demand propensities. 
I'hen, if price were continuous, there would be no spread and the 
market price would be determined by a straightforward aggregation 
of individual demand propensities. 

Howetcr, a variety ot transaction costs impact on the investor’s 
trading decisions. The fixed (with respect to number of shares traded) 
costs of assessing information, monitoring the market, and conveying 
orders to the market imply that the investor will make trading deci¬ 
sions only periodically. Further, when decisions are made, he will not 
convey his full set of demand propensities to the market. For one 
thing, trades that involve sufficiently small portfolio adjustments 
would not Justify the transaction costs incurred.* Also, attempts to 
transmit several limit orders simultaneously would be likely to over¬ 
load our current system. Furthermore, a continuous auction which 
does not generate a Walrasian .solution cannot readily handle multiple 
buy-sell orders that, ex ante, are intended to be alternatives.® 

In light of transaction costs (and also taking account of the timing 
and magnitude of exogenous cash flows), the investor will establish a 
discrete set of decision points. In the analysis presented in Section IV, 
we take the frequency of the.se points as predetermined. Upon 
reaching a decisiczn point, an investor can do nothing, or hit an 

‘ li can be readily demonstrated that variable as tvell as fixed transaction costs make 
sufficiently small portfolio adjustments prohibitively expensive. 

* That IS, if any array of buy and sell orders from one investor is executed sequen¬ 
tially, the desired quantities at each price should be dependent on the exact sequence of 
purchases and sales followed. However, the investor does not know ex ante whicti 
sjietific sequence will occur This problem could, of course, be handled (at a cost) by 
conditioning orders on the sequence of prior transactions (i.e,, if the limit order to buy 
200 at $.50 executes, then sell 100 at $.56). 
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existing limit order with his own market order, or place his own limit 
order at a “better” price and run the risk of its not executing. A 
concurrent strategy issue also exists when the investor finds it pro¬ 
hibitively expensive to convey his entire set of demand propensities to 
the market; for all intents and purposes, he must select the single best 
alternative to transmit. 


III. The Probability of a Limit Order Executing 

In this section, we establish the conditions under which the probability 
of execution does not approach a limit of unity as the price at which 
the limit order is placed is taken to be infinitesimally close to the 
counterpart market quote. Under these conditions we obtain a prob¬ 
ability jump (at the counterpart market quote) that underlies the 
gravitational pull effect developed in Section V. By showing that the 
probability jump can be attributed to the existence of transaction 
costs, we establish the basic linkage between spreads and transaction 
costs. 

Consider the case where an investor contemplates submitting a limit 
bid at the price at time t, and let price be a continuous variable. A 
similar argument can be constructed for the case of a limit ask. As¬ 
sume that if the limit order is unfilled by the next decision point at 
time t + 1 the order will be canceled. Let L be the length of time 
between decision points t and I + 1. 

Let be the market ask price at time t. Consistent with the 
random walk version of the efficient markets hypothesis, we make the 
Markov assumption that each subsequent market ask depiends only on 
the last previous market ask. If we also assume that each change in the 
natural log of the market ask,Z(, is a random variable that is indepen¬ 
dently and identically distributed over time with mean zero and vari¬ 
ance var (Zj), then we can model the market ask price generation 
process as a compound Poisson stochastic process:^ 


' For cxpositional simplicity we assume that the stochastic processes considereti in this 
section have no drift (that is, their expected value is zero) It should be noted that even 
though In Pf'' IS assumed to have no drift, the pnee series itself, Pf*. mav have drift 
For example, when In {PI’AIP}''') is normal, with mean zero and standard deviation cr. 
then PH*,IP}'* is log noimal with mean exp (!^ <z’). 

There will be a realization of the random variable at each point ol time when an\ 
one of the following events occurs to affect the specific limn order which sets the market 
ask. (a) it is withdrawn; (6) it executes agamsi a crossing buy order; or (r) it remains on 
the book but is no longer the market ask since a lower limn sell has been submitted 
Note that events of types a and b necessarily result in a change in the market ask 
when price is continuous and utility functions are heterogeneous {which implies a zero 
probability ol having two or more orders at a specific price). Also note that only events 
of type h are associated with transactions; hence the numbei of Z, which matenalize 
during any interval will usually exceed the number of transactions in that interval. 
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In/^'‘(A) = In + 2 ] (1) 

(=1 

where A is the time from the last decision point. When A equals L, we 
haveP"''(A) = \ when A equals 2L, we have/'f''^(A) = Pz+a, etc. The 

number of changes in the market ask that take place in the time 
interval A is .V(A). We assume N(\) follows a Poisson process with 
arrival rate u. 

The next step is to determine the probability of execution (during a 
time interval of length L) of a limit bid which is submitted at a price 
pLH gieaiei than the current market bid (Pf"*) but less than the 
current market ask (P}'*). Clearly, as the limit bid price approaches 
(from below) the tnarket ask, the probability of execution increases. 
One might suppose that this probability approaches unity as the limit 
bid approaches the market ask; however, this need not be the case. In 
the Appendix we restate formally and prove: 

Pkoposi I ion 1:11 /’/'■' is geneiated by the compound Poisson pro- 
(ess ol equation (1), then no matter how close the limit bid ajtjjroaches 
(Irom below) the market ask, the i)robability of the limit order 
e.xecuting is less than unity in any time interval of finite length. 

.Since a market order will always execute with probability one, propo¬ 
sition 1 gives a jirobability jumjj at the market ask. 

rransaction costs are crucial to the existence of the probability 
juinj). In the absence of transaction costs, one might expect that, 
follow ing the work of Merton (1973), the logarithm ol the market ask 
would best be described by a Wiener jjrocess with zero drift: 

c/1 iiT;''(A) = (rdZ(A). (2) 

where rr is ihe instantaneous variance of the process and dZ{A) is a 
stanclarcli/ed Wiener process. In this case, the price P}'^{A) would 
experience an infinite number of adjustments in the interval 0 ^ A ^ 
L. In the Appendix we restate formally and prove: 

Proposition 2: If T'”''(A) is generaied by the Wiener process of 
et|uation (2), then as the limit bid a|j|>roaches (f ioin below) the market 
ask, the probability of the limit order exe< uting a|3proaches unity for 
all time intervals of the finite length. 

Proposition 2 implies that for the Wiener process there will be no 
probability jump at the market ask. 

W'e next consider whether there is a relationship between the com¬ 
pound Poi.s.son process of eijualion (1) and the Wiener process of 
equation (2). In the Appendix we restate formally and prove: 
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Proposi tion 3: If the random variable Zj in the compound Poisson 
process of equation (1) has two equally likely possible values, +a and 
—a, and if the arrival rate v of this compound Poisson process in¬ 
creases without bound while a simultaneously decreases in such a way 
that is constant, then the compound Poisson process approaches 
the Wiener process described by equation (2) with the variance cif the 
Wiener process, <r^, equaling va^. 

Proposition 3 states that the compound Poisson process can be ex¬ 
pected to approach the Wiener process under appropriate assump¬ 
tions." Thus the probability jump of proposition 1 would disappear as 
1 / increased and var (Z() decreased. In the Appendix we restate for¬ 
mally and prove: 

Profosii ION 4: As the arrival rate v in the compiound Poisson prfx;ess 
of equation (1) increases, the probability of a limit order executing 
increases for all P'," < P}‘‘'. 

Hence the probability of a limit order executing increases at all Pf" 
< P}'' as the activity proxy 1 / increases. Stated conversely, the "thin¬ 
ner" a security, that is, the less active are investors in submitting 
orders to trade in the security, the lower will be the probability of 
execution at each P',” < P}’'*, and therefore the greater will be the 
probability jump at P, 

The four propositions stated in this section (and proved in the 
Appendix) have important implications for the analysis of the impact 
of transaction costs on bid-ask spreads. Without transaction costs, the 
market price could be expected to lx*have as a Wiener prcKcss; that is, 
there would be an infinite number of infinitesimally small price ad¬ 
justments. The investoi who was considering placing a limit order 
could reduce the probability oi his order not executing to as close to 
zero as desired simply by placing his order close enough to the 
counterpart market quotation. 

On ihc other hand, with tran.saction costs and a finite number of 
investors, the market price would generally not behase as a Wiener 
process. Investors would find continuous adjustments in their 
poitfolios too expensive, and market prices would behave as a 
stochastic jump process ([iioposition 3 shows that this process could 
generally be expected to approach a Wiener process as the older 
arrival rate is increased). One such jump process is the compound 
Poisson, and it also is consistent with a (inariingale) efficient market. 

" riie particular clisInbiitKin cliosen lor 7, in proi^iosilion .1 is not crilnal. Other 
appropriately chosen cli.si rcte or continuous diMribniions can also be shown in the limit 
to go to the Wiener puKess. 
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Proposition J demonstrates that for such a stochastic jump process a 
probability Jump exists. Proposition 4 has shown that the probability 
Jump will be greater for thinner securities. In Section V we will show 
that the larger probability Jump for thinner issues leads to equilibrium 
market spreads which are larger for thinner issues. 

IV. A Model of Investor Order Placement Strategy 

We now consider the question of when an investor will choose to trade 
via a market order or limit order, or not seek to trade at all. The 
problem is structured as follows. Because of the costs of monitoring 
the market, let the investor consider rebalancing his portfolio only at 

[ueselected points in time, / = 1,2. T - 1, where T is the 

investment horizon.“ In order to simplify the analysis, we now con¬ 
sider the placement of only purchase orders for the risk asset (omit¬ 
ting the symmetric case of sell orders without loss of generality). 

At any of the T - 1 decision points, the investor is faced with three 
possible courses ol action: 

a) Submit a market order to buy shares at the cui rent market ask 
price of P}''. 

h) Submit a limit order to buy shares at a limit bid price of P/" < 

pm 

r) Do nothing. 

In modeling the investor’s choice among these three alternatives, 
we find it convenient to make the following assumptions that simplify 
the analysis without materially affecting the nature of the conclusions. 
We assume that all orders are for a fixed number of shares AA( and 
that when any market or limit order is executed, it is satisfied fully at 
the stated price; this avoids the tedium of writing (average) transaction 
price and transaction costs as functions of the number of shares 
exchanged and of defining probabilities of partial execution We as¬ 
sume that unfilled limit orders are canceled prior to the next deci¬ 
sion point; this avoids both the need to include additional state vari¬ 
ables (the price and quantity of any limit order outstanding) and the 
need to analyze additional courses of action (submit a market order 
and leave the old limit order outstanding, or submit a market order 
and remove the limit order, etc.). Finally, we assume no lags in the 
transmission of information and orders; this avoids the complexity of 
dealing with changes in the current market quotes during the time the 
investor formulates and implements his decision. 


“ Nole that the rebalancing points need not be the same for all investors, so that 
trades can w tur at any time when the market is open even if specific traders are not 
fontinually in the market. 
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If option a is chosen, the market order will be executed, and the 
investor’s holdings at timet + 1 become = N, + ^N, for which the 
investor pays a total cost of + C" where C" is the total cost of 

transmitting and executing the market order. 

If option b is chosen, then one of two events can take place; (61) 
The limit order is executed. The investor’s share holdings then be¬ 
come = Nt + ^N, for which the investor pays a total cost of ■ 
P/-® -I- C'-‘ C'-* where C^' is the cost of transmitting a limit order to 

the market and C'-* is the cost of executing a limit order. (h2) The limit 
order does not execute and is canceled prior to the next decision 
point. The investor’s share holdings then remain unchanged , = 
Nt) and his cash is decreased by the cost of transmitting the limit 
order, C'-'. Option h will be chosen over a if the gain associated with 
the possibility of trading at a more favorable price outweighs the loss 
associated with the probability of not trading at all. 

The investor must consider four subjective probability distributions 
in order to make an optimal decision. These are: (1) the joint prob¬ 
ability distribution of market bid and ask prices at time < -t- 1, condi¬ 
tional upon the quotes at time l; (2) the probability of a limit bid order 
submitted at time t executing before time I + 1; (3) and (4) the joint 
probability distribution of market bid and ask prices at time / + 1, 
conditional upon the quotes at time t and further conditional on 
whether a limit bid submitted at time t either did or else did not 
execute prior to timet 1. In the Appendix we discuss in more detail 
these four subjective probability distributions (only three of which are 
independent). 

A dynamic programming model of the investor’s choice among 
options a, b, and c is formulated in the Appendix. The investor is 
assumed to maximize his expected utility of terminal wealth, max (i\, 
where is the expected terminal utility of choosing option 
a, trading via a market order; U 2 is the expected terminal utility of 
choosing option b, seeking to trade via a limit order; and U 3 is the 
expected terminal utility of choosing option c, doing nothing. It is 
convenient to focus on the utility gain, or AI/j. which results from 
choosing option a or b rather than option c: AI/, = - f/,; 

= U-APt”) “ V 3 (note that t/j and Af /2 are functions of the 
limit bid price). Clearly At/ 2 (P{-® = P^*) = At/,, since, at this price, the 
market order and limit order strategies are effectively the same (the 
probability is unity of the limit order executing at a price of P^'). 

Let us now consider the conditions under which I/, > max {U 2 , (-' 3 ). 
in which case the investor will submit a market order, or C, ^ rnax 
(U,, U 3 ), in which case the investor will submit a limit order. Supjxise 
that at the current market quotes the do-nothing strategy is domi¬ 
nated (i.e., max [(/,, f/j] > [/,). Given our utility gains AC, and 
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At/aCP,'") and the probability function for limit order execution de¬ 
veloped in Section III above, we know the utility gain from placing a 
market order and can readily obtain an expected utility gain function 
for the limit order strategy. These are depicted in figure 1; the shape 
of the function is explained as follows: 

1. While the utility of a consummated trade decreases monotoni- 
cally with P'-®, the probability of execution increases with P'®, with two 
probability jumps (at the market bid and the market ask). Thejump at 
Pi“" simply reflects the institutional reality that orders placed at prices 
less than or equal to P"" would not have priority over the current 
market bid, whereas an order placed at a price above would be 
at the top of the limit order book. The jump at follows from 
proposition 1. 

2. Since the probability of execution is constant to the right of P”\ 
the expected utility gain has a peak at Pi”'', corresponding to the utility 
gain of transacting by a market order. 

3. The probability of execution increases between P”® and P;”'', 
with the greatest relative increase just to the right of P;”®. We expect 
this large probability increase in this neighborhood because the strat¬ 
egy considerations of other investors might lead them to place limit 
orders just to the right of Pi”® in order to capture the largest price 
advantage. Hence the second peak in the expected utility gain func¬ 
tion will occur at some point P", between P,”® and P”'. 

4. The probability of execution decreases rapidly in a neighbor¬ 
hood just to the left of P”® because of existing limit orders on the 
book. However, since there would be a clustering of limit orders near 
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we would expect the probability of execution to decrease more 
slowly as the limit price, P''*, moves further from Pf'*. Hence, we 
might find a third peak in the expected utility gain function to occur 
at some point P' to the left of P}"*. 

Which of the three peaks in will dominate depends on 

the particular form of the probability function for limit order execu¬ 
tion as well as the expected utility function. In the particular diagram 
presented in figure \A, has positive values (hence a do- 

nothing strategy is dominated), has values greater than At/, (hence 
the market order strategy is dominated), and gives a global maximum 
atP". Assuming only one limit order per investor, the optimal strategy 
is to place a limit bid at P", and hence P" will become the new market 
bid. The setting of such a bid quotation can be considered a Stigler 
(1961)-type search activity."* 

Clearly the limit order strategy need not always be superior; in 
selecting this strategy, the investor afso accepts the chance that the 
limit order may fail to execute, in which case the investor would be 
worse oil than had he done nothing since he would have also lost the 
cost of placing the limit order, C'. If the probability of failing to 
execute is high enough and AT, > 0, the investor will prefer the 
market order strategy as illustrated in figure \H." 


V. The Market Spread 

It is clear from the preceding analysis (Sec. IV) that, with a 
continuous auction market, each investor’s order placement decision 
is made with reference to prices already established on the mar¬ 
ket—the bid and ask quotations which define the market spread. In 
turn, each investor’s order may affect the market spread to which 
subsequent tradets react. Hence, the market spread is the product of 
a dynamic interactive proce.ss. In this .section we show that a non/ero 
market spread must exist, define the equilibrium market spread, and 
relate a security’s equilibrium spread to its thinness. 

The limit order book comprises the limit orders transmitted to the 
market by a subset of the many traders in a security, and the spread is 
es.sentially a gap in the limit order b<M>k. Therefore, having estab- 


By the placement of such a limit ordei. a seller or buyer annouiues his propensity 
to It ade with the lui()C of getting llie auention ot a counterpart market participant who 
would also be wilhrifi; to trade 

'' There are two other versions of fig. 1. One would be analogous to panel .■!, but with 
the global maximum at P' rather than at P"\ in this case, the investor would submit a 
limit bid at P' (Ix'low the market bid). The second version would be where the rero 
fxiint on the utility gain axis is higher tlian both max ^l\(P^) and A/',, in this rase, the 
investor would do nothing. 
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lished that individual investors will sometimes seek to trade via limit 
orders, we must show why a noninfinitesimal gap in the array of such 
orders exists between the market quotes. We do so by considering the 
impact that the jump in the probability of a limit order executing (Sec. 
Ill) has on the investor’s optimal order placement strategy (Sec. IV). 

We have established the conditions under which a limit order will be 
placed: we can now consider whether or not limit orders will neces¬ 
sarily be placed so as always to preserve a nonzero bid-ask spread. By 
continuity of the utility function, and because of the discontinuous 
drop in the probability of execution that must occur at the market ask 
as we move to lower prices, we must have a discontinuous drop in 
as we move past the market ask to lower prices. Then, 
because of continuity of the probability function until we get to the 
market bid (at which point another discontinuous drop occurs), it 
necessarily follows that Af/ 2 (f*('*) cannot exceed Ab', within a nonzero 
neighborhood to the left oi' Hence, no limit bid will be placed 
infinitesimally close to the market ask, and therefore we must have a 
nonzero bid-ask spread. 

Intuitively viewed, consider an investor who is thinking about 
whether or not to place a limit bid at some price below the market ask. 
.As the potential limit price rises toward the market ask, it becomes 
relatively more attractive to transact at the market ask with certainty, 
■At some point, the investor will prefer buying at a discretely higher, 
unitary probability of execution. In other words, certainty of execu¬ 
tion at the market ask creates a kind of gravitational pull which causes 
investors to jump their price once their potential limit bid gets close 
enough to the market ask. This clears out limit bids when they get too 
close to the market ask, thereby leaving a nonzero spread between the 
market bid and ask, 

VVe next consider the issue of an equilibrium market spread. 

Df.hnh'ki.n; In a dynamic trading proce.s.s we define the equilib¬ 
rium market spread as the bid-ask spread at which, lor the next instant 
of time, the probability of the spread increa.sing is equal to the prob¬ 
ability of the spread decreasing.'* 

This definition does not imply that if away from equilibrium the 

Note that the probability is not conditioned on the lurrent status ol individual 
investors, recent prices, 01 other market condiuons. 

Note that we u.se the term "equilibrium market spread” to refer to one particular 
point in the probability di.stribuUon of the market spread, rather than to refer to a 
statistic foi the central tendency of the entire distribution We do not adopt a central 
tendency dehnition (sucli as expected value, mode, or median) of the et)uilibrium 
spread bctau,sc it does not provide a ready link to the market forces that generate the 
spread and to thinness, for empincal research, the average market spread is likely to be 
a rea.sonable proxy for our definibon of the equilibrium market spread. 



EXISTENCE OF THE BID-ASK SPREAD 209 

spread will necessarily move toward it on the next order, but only that 
it is more likely to do so than not. 

We assume that a unique equilibrium exists. To examine the rea¬ 
sonableness of this assumption, first consider an arbitrarily small 
spread. Because the probability of a limit order being placed has a 
discrete jump at we have already argued that there will be no 
limit orders placed in some nonzero neighborhood below P}'^ by all 
investors seeking to buy the security. (The generalization from a 
single investor to all investors takes place simply by choosing the 
smallest neighborhood found among all investors.) Therefore, for a 
sufficiently small bid-ask spread, the unconditional probability of the 
spread increasing must exceed the unconditional probability of the 
spread decreasing as analyzed from the standpoint of the buyer. A 
similar argument holds for the seller. 

Now consider what occurs when the spread widens. The larger the 
spread is, the greater the potential utility gain from an optimally 
priced limit order, while the utility gain from a market order will 
remain unchanged or fall. The gain in utility occurs because the 
shares can be bought (sold) at a lower (higher) price via limit order 
than by market order.'* With the rise in utility UilP}-”), investors will 
begin to shift their preferences from market orders to limit orders. As 
more and more investors shift to limit orders, the probability of the 
spread increasing falls, while the probability of it decreasing rises. 

One of two situations must occur. Either the spread would tend to 
reach a point at which the two probabilities would be equal, in which 
case it would be the equilibrium market spread, or the limit order 
book would be empty on one or both sides." Since when the order 
book is empty market orders cannot transact, we would be at a point 
where the probability of a limit order is greater than that of a market 
order. Although such a point might be achieved without the equilib¬ 
rium market spread ever being attained, we believe it would be 
atypical.'* 


We as.siime that the spread does not impart any intormaiion to the mtcstor on the 
direction of future price movemcnCs. 

“ No olhei p»)ssibility exists. So long as there is a greater probability that the spread 
will increase than that it will decrease, the spread will on expectation grow—limited 
only by the collapse of the limit order book on one side or the other. 

With regard to the a.ssumption of uniqueness, investors might have utility liincuons 
which could lead to more than one equilibnum market spread. Consider the case where 
an investor views a very wide spread as signaling the advent of new information which 
may cause a price adjustment ol unknown direction. The investor sensing greater nsk 
might now choose a do-nothing strategy ovei placing either a limit or market order. 
This could support a very wide spread as a new market equilibrium spread. Of course, 
investors would eventually conclude that the new information was not fonhcoming. 
and trading would resume al its former frequency with the old equilibrium spread 
reestablish^. 
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Our definition of the equilibrium market spread provides a direct 
link between the market spread and thinness. Recall that proposition 
4 showed that the probability of a limit order executing decreases, for 
ail values of as a security’s order arrival rate decreases. Thus, 
ceteris panbus, thin issues have a lower probability of limit orders 
executing than do thick issues. This lower probability in turn de¬ 
creases the proportion of investors choosing limit orders over market 
orders at any given si/e of the spread: this implies that, for a thinner 
issue, a wider spread would be required for the forces that increase 
and decrease the spread to be in equilibrium. Hence we have: 

Proposition 5: I'hinner .securities will, ceteris paribus, have larger 
equilibrium market spreads. 


VI. Conclusion 

We have presented a proof ol the existence of market spreads in 
markets with many traders. The formulation treats continuous prices 
and allows (or heterogeneous trading propensities. The proof is not 
dependent on the demand ot traders for immediacy or on the cost to 
market makers of providing immediacy. 

The literati!I e on bid-ask spreads does not appear to have recog- 
ni/ed that aggregation from individual to market spreads is a con¬ 
siderably more complex process than the standard aggregation from 
indiMdiial to market demand and supply functions. Neither has it 
been established that ordinary investors may sometimes seek to trade 
via limit orders (and at other times via market orders), hence that 
these investors will sometimes supply (and at other times demand) 
immediacy, and that in choosing between market and limit orders 
investors implement an oi der placement strategy. In addressing these 
issues, we have sought to establish the links between transaction costs, 
individual investor order placement strategy, market thinness, and 
market sjneads. 

We have first established that, with transaction costs, the probability 
of a limit order executing does not rise to unity as the price at which the 
order is placed gets infinitesimally close to a counterpart market 
quote. We have next shown that the resulting investor trading strate¬ 
gies generate what we have referred to as a gravitational pull effect. 
Essentially, in the neighbfirhood of the current market bid and ask 
quotations, what might otherwise have Ijeen limit orders are instead 
submitted as market orders (at slightly less desirable prices) so as to 
achieve certainty of execution. These market orders trigger trades 
which clear limit orders off the book, widening the market spread. 
The gravitational pull effect explains why market spreads may be 
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substantial even in markets composed of many traders. Finally, we 
have defined an equilibrium market spread (where the forces that 
tend to widen and to narrow the spread are in balance) and have 
shown it to be positively related to a security’s thinness (measured 
inversely by the order arrival rate). 

Our formulation has several implications for the design of a market 
system. A primary objective of system design should be to expand 
the extent and frequency with which investors interact with the mar¬ 
ket by minimizing various transaction costs. Decreasing variable 
transaction costs will decrease individual spreads and generate a 
greater order How; decreasing the costs of monitoring and com¬ 
municating with the market will also increase the frequency with 
which investors rebalance their portfolios; and consolidating the cur¬ 
rently fragmented system (by. e.g., instituting a consolidated limit 
order book) will reduce search costs and further shrink spreads by 
increasing the effective thickness of the market. These costs are all a 
function of market structure and hence should be amenable to 
reduction by appropriate system design. However, a major cost of 
interacting with the market is the cost of decision making, and this 
might not be subject to significant reduction by exchange organiza¬ 
tion. For this reason, it is possible that, especially for thinner issues, 
spreads will remain sizable in a restructured national market system. 


Appendix 

In ihi.s A[)|)erKllx, wc restate forttially and prove propositiotis 1-4 (jl'Seciioti 
111 abuse and dcselop the more teihnital aspectsol the arguments presented 
in Secttoti IV. 

Si'ilKin III 

Fajiiation (1) ol Seetton 111 ]>resents a compouttd I’oisson process as the 
st<K hastte process whicli generates a sc<|uetue «)l market ask prtces oter time, 
'flits cati be used to detertnine the probabilitv that a litint bid order will 
execute during a paituular lime period. 

.Suppose that the price ol the limit order to be submitted is greater than the 
current market bid but less than the current market ask. that is, that < 
jii.H ^ p.i/c I',, Ji-icrtinne the probability that the potential limit bid will 
execute in a tune interval of length we muse find the probability that P^'{\) 
will dec 1 ease to a price c'ciual to or less than P}".'" There lore, let >'(/-. /'“V Pf'") 
be the piobability that the minimum value that P,*''*(A) achieves in the interval 
0 « A =£ y, ts ecpiai to or less than As/’'" approaches vve would ex pec t 
the probability of the potential limit bid executing to increase, since the 
amount that /’“•'(A) would have to decline would be reduced. However, m the 

'* for P)'' cliscretcly grc.uet than the possibility of anolhet mvcsioi submiiling a 
limit Older with a price greater than PJ" will only lend to decrease the probabilitv of 
execution ol die oiiginal hniil ordei. TIuis our proof of existence ol die probabilitv 
jump at IS con.seivalive 
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limit as PI* approaches P/''* from below, without further analysis it is unclear 
how far the probability will rise. Therefore, let 

(t>(L,Pf*)=l- lim y{L,P{‘\Pf‘'), 

pL» ^ pHA 

whete the limit is understood to be from below. We now prove: 

Propositkjn 1: If Pf'''(A) is generated by the compound Poisson process of 
equation (1), then <l)(L, P’f*) ^ 0 for all intervals of length L < oo.*^ 

PRoot; Since Z, is stcx-hastic with mean zero, P(Z| ^ 0) > 0. Furthermore, 
the value of the fViisson random variable iV(i) will be finite in any interval of 
length L less than mfinity. Therefore, the probability of N(L) consecutive Z/ 
observations that are greater than or equal to zero is given by [P(Z( > O)]*''*, 
which, since .Vf/.j is hnite, must be strictly greater than zero. Notice that if all 
the Z, observations are greater than or equal to zero, the value of Pf'^fA) must 
he greater than or ec^ual to Pf * throughout the interval 0 « A ^ L; thus the 
limit order P|'* would not have executed. This is sufficient to demonstrate the 
probability jump, (ilearly, there are other sample paths that P,**'*(A) could 
have followed which also would have failed to execute the limit order. 

Proi-o.sition 2: It P"'^(A) is generated by the Wiener process of equation 
(2), then Pf^) = 0 for all intervals of length I, < 

PRoot By the reflection principle for a continuous Wiener prexess (see 
Karlin 1968, pp. 276-77) and since In P,"'’(A) is drifticss, 

>■(/., Pr. Pi-") = Prjmin P,"''(A) « Pf-*} = Pr{min In P,*''’(A) « In PJ*} 


= 2 Pr{ln P«/, < InPf-*} = 2 Pt{P}</, < Pj-"} 



where the latter probability distribution follows from the dehnition of the 
Wiener [irocess. Now 


Imi Y{L,Py\P‘,'‘) 

... 





.Substiuiling the vaiiable y = (x — P^^)l(<Ty/L), the preceding limit equals 
(2I\~2tt) /"j exp {-Viy‘}dy = 2(‘/2) = 1. 

Pkoi’osi I ION ,8: If the random variable Z( is expressed as a Bernoulli 
random variable, with Pr(Zj = a) = Pt(Z, = —a) = 1/2, and if the arrival rate v 
of the Poisson process A'(A) goes to infinity, while simultaneously reducing the 
si/e of a in such a way that vcP remains constant, the compound Poi.sson 
proce.ss approaches the Wiener process described by equation (2) with cr* = 
i/a . 

Proof: This follows from the theorem that the characteristic function for 
the compound Poisson approaches that ol the Wiener process (see Parzen 
1962, p. 99), 

Proposition 4: [dY(L, P*'"^, P{'"))/d»' > 0 for all P''" < Pf*'’ and P < ». 

Proof: By the Markov property of the compound Poisson prexess, in¬ 
creasing (decreasing) the order arrival rate by some factor \ is identical to 


This pro[x>sition would also hold for other stcx'hastic jump processes where the 
number of price changes is finite in any hnite interval. 
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increasing (decreasing) the length of time L between decision points by A. 
When A is increased, those sample paths of the process which initially satisfied 

min P{'‘*(A) « Pj-^ 

will still continue to do so. On the other hand, some sample paths where 

min P^*(^) > /’{•* 

will now satisfy the inequality 

min P}*‘*(\) « P',-". 

Therefore, K(i, f’f'®) would increase for A > 1 for all values ot P',” < P^*. 
Similarly, Y(L, Pf*, P'i^) would decrease if the order arrival rate A is de¬ 
creased. 


Section IV 

We now present the more technical aspects of the model of an investor's order 
placement strategy discussed in Section IV The subjectively determined 
probability distributions are specified as follows. First, the probability distri¬ 
bution of future market bid and ask prices is given by h | Pf^^.P^”), 

where A is a Joint density function tor market asks and bids (PflJ, P“") at time 
< -(- 1, given the prices at t. Consistent with random walk models of security 
price behavior, we condition future prices only on current prices. (Consistent 
with Section 111, we let the investor’s subjective probability of a limit order 
executing before t + 1 be given byPi"*), where P}-* is the limit bid 
price. (Again, this is consistent with random walk theory, since we need know 
only current market quotes to predict whether or not the limit order will be 
executed.) Last, we assume the investor determines conditional probability 
distributions of future prices m the event of either a successful or unsuccess¬ 
ful limit order. That is, for a limit bid at price P}-" submitted at lime I, let 
k(Pti'],P}i''i I Pf'^, P;’"', Pf-®) be the joint density function of market bid and ask 
prices at time / -t- 1 if the limit bid executes prior to / + 1, and let /(P^J- 
Pm I Pf*- Pf". Pi'") be the joint density function at time t + 1 il the limit bid 
fails to execute prior to t + 1, Note that only three of the tour subjective 
probability distributions A,/», A, and / can be independently determined, since 
by Bayes’s theorem we must have 


A(x,y I P?‘'.Pi"‘) = /tfPf'.Pr.PDAfx.y |p,''\ P?"’. P!*) 

+ [ 1 - P(p;", P*>\ P;''«)]/(x, v I p,”". p,“". Pi”) 


for all possible values of P"'', P?'*, and PJ"®. 

We can now give a formal statement of the model. Given observed market 
quotes of P,*''* and P,"* and holdings by the investor of .V, shares of the security 
and S, dollars in cash, at time t the investor’s expected utility of terminal 
wealth can be written as i/i, = fAP^*, P"”. N,, S,).'" 


Note that the investor’s underlying utility function measures the uulitv ot the 
wealth he will possess at some honzon T, where T > I. The investor will choose those 
decisions at times t. / + I, , .., T - 1 which maximize the expected ulilitv of his terminal 
wealtli. As of time ( (the investor’s current decision point), viewing the expectauons 
operator as ranging over relevant random variables pertaining to times between / and 
T, we define 1 ( 1 , to be the expected utility of the investor’s terminal wealth ('.learly this 
depends ufxin the assets the investor has at time / (N, and S,) and the current market 
prices at which the investor ran buy or sell shares {Pf* and PJ'"). 
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Wc now derive an expression for ifi, in terms of the various probability 
assessments and costs. Write ip, = max (U,, U 3 ), where (/, = expected 

lerininal utility if a market order is placed at time 1 . = expected terminal 

utilit) if a limit order is placed at time/, and L ’3 = expected terminal utility of 
doing nothing at lime I. More precisely, letting x and y be values for the ask 
anti bill piites at lime / + 1, respet lively, we have 


= I f (••'•v.A't + - (■;“ - N ■ P;o) ■ h(x.y\P’^*,PY'')dxdy. 

As staled m Section I\’, we asMitiie that the number of shares to be purihased, 
A.\', is fixed. Delme the runciion 


L .,(P‘«) = fjiP'x, />,»") j I J,„(x, V. .V, + A.V, 

a 1 ) a 0 

X\ ■ P',") ■ k{x,y\Pl'',P;"‘.Pl'’)ilxdy 

+ [ 1 - p(P‘,'‘, py\pm \ f ,(.v, V, s, - /;'•') 

Jo Jo 


■ /(X, VI p;'\ pr. pyuxdy 

(ot all possible jnices of the limit order. I'hen we have 

r, = max L'..{P',"). 

PI" 

I'ltKills. for the tio-nothing option we have 

I', I I /-t ,(x. V. A'„ S,) ■ k(x, V I Pr.Pmixdy. 

J 0 J O 


1 he dvnainii programming retur.sion permits ns to obtain a solution foi-y, 
111 the order - l.'P — 2,... ,2. This is possible since we assume the utility 
\ahie Ifir/; is known for all values of ihe parameter T” A'y, and Sj. We 

base defined tins retursioii in terms of the four stale variables f’i*''', T”", iV,, 
and S, We also have two det ision variables. Pi" and the decision as to which of 
the three courses of action to lake. 
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Land Value Capitalization in Local Public 
Finance 


David A. Starrett 

Stanfnril Vnix'i’rsity 


\\c explore the ronditions under which the welfare benehts of local 
piibiic fToods projects will he capitalized into land values. We find two 
i\pes of sufficient conditions, one involving similar communities and 
the other involving differentiated communities. The form of 
(apitali/.dtion differs between these tases, and we explore the nature 
of these different es. V\'e also examine intermediate ca.ses and iden- 
tif\ models in which there will Ije no capitalization. 


I. Introduction 

The idea that the value of local public goods projects will be 
capitalized into land rents has a long tradition in the public finance 
literature, dating back at least as far as Tiebout (1956). F.mpirical 
evidence has been mixed, but .some positive results have been re¬ 
ported by Oates (1969), among others. This simple idea is potentially 
very powerf ul. It provides a simple measure of project benefits which 
reveals (if perhaps after the fact) individual preferences for public 
goods. And if rent changes can be predicted in advance, it may 
provide an objective function for community decision making. 

Furthermore, the presence of capitalization forces has important 
equity implications. To the extent that capitalization occurs, benefits 
tend to accrue to landlords at the expen.se of renters. We will show 
that capitalization reflects a real welfare increase only if these two 
groups are considered equally deserving. If, on the other hand, rent¬ 
ers are considered more deserving, then the presence of capitaliza¬ 
tion clearly will worsen the distribution of benefits within society. 
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The “capitalization hypothesis” is interesting from a purely 
theoretical point of view as well. It is clear that increases in land rent 
values are not themselves a net social benefit; owners of the land will 
benefit, but renters will suffer. Indeed, in the standard surplus theory 
of welfare measurement, changes in land prices (or prices in any 
other competitive market) never appear as net benefits (in aggregate, 
the gains to winners just offset losses to losers). Thus, if capitalization 
occurs, it must happen because the true benefits induce a corre¬ 
sponding change in rents. 

We will argue that there are two separate and independent forces 
which can induce capitalization, one involving competitive forces be¬ 
tween communities and one involving such forces within a particular 
community. Furthermore, these two forces have quite different im¬ 
plications for (1) what benefits are capitalized and (2) where those 
benefits are capitalized. Thus, we will derive several different capitali¬ 
zation theorems and explore the conditions under which each holds. 
In the process, we will find some conditions under which capitaliza¬ 
tion does not occur at all or occurs only partially. 

Let us refer to capitalization which derives from forces between 
communities as “external capitalization" and capitalization which de¬ 
rives from forces within communities as “internal capitalization.” The 
intuitive argument for external capitalization is quite simple. Suppose 
that a project is built in one community designed to make people 
better off there. Now, if people in all communities have similar tastes 
and are free to move among the communities, outsiders will necessar¬ 
ily be attracted to the project-building community. And they will 
continue to move until the welfare incentive disappears. The only 
factor which can stop this movement is a differential location cost, that 
is, an increase in land rents in the project-building community. This 
type of argument is the one used by Polinsky and Shavell (1976) to 
justify a form of capitalization. However, the Polinsky/Shavell model 
is not very general, since the benefits accrue to absentee landlords. We 
will show below that increases in land values generally will overstate 
true net social benefit, although they may be a good approximation to 
community net benefit. 

Naturally, when we drop the assumption of free mobility between 
((immunities or the assumption of similar tastes between com¬ 
munities, the intuition for external capitalization is weaker. Indeed, 
several authors have argued that we should not expect land value 
capitalization in specialized (or Tiebout) communities which are im¬ 
mune to migration forces due to differences in tastes (see Hamilton 
1975 and Polinsky and Rubinfeld 1978). 

However, there is a quite different argument for internal capitali¬ 
zation which is based on specific properties of local public goods. 
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Presumably public goods are local in character when proximity mat¬ 
ters. For example, one has to make trips in order to appreciate such 
public goods as parks, museums, civic centers, sports complexes, and 
even roads and highways. Now, if there is a positive correlation 
between the amounts of {uiblic goods and the differential desirability 
of various locations, some degree of capitalisation will occur as rents 
adjust. We will show later exactly what conditions are necessary for 
full capitalisation. This type of argument has been used in the litera¬ 
ture to justify the use of rent gradients for measuring the cost of 
pollution near a .source (such as an airport or factory).' It is also in 
the spirit of arguments for capitalization given by a number of 
authors, among them Strotz (1968), Lind (1973), and Pines and Weiss 
(1976). These authors essentially start with the assumption that a 
pioject will differentially improve land quality and explore whether 
or not those improvements will translate into rent increases. I'he 
resulis are instructive, but the underlying approach seems to beg the 
question somewhat. I'be (irimary question is, Will a local public proj¬ 
ect change the differential quality of land? For the rase of irrigation 
projects discussed by Lind, the answer is clearly yes, but in other cases 
It IS less obvious. Indeed, we will show that, for some types of public 
goods, no capitalization occurs at all. 

We w ill want to pay attention to the system of taxation in discussing 
botli typos of capitalization. One might guess that the form and 
nature of capitalization will depend on whether or not the public 
goods are financed out of a property tax. We will show that this is 
.sometimes, but not always, the case. 

We adopt a somewhat generalized version of the Polinsky/Shavell 
community model in the sequel. This model has a number of special 
features but is chosen because it facilitates the exposition and makes 
for easy comparisons with the pieceding literature. However, many of 
the results reported here remain valid in a much more general eco¬ 
nomic model. The interested reader is referred to .Starrett (1978) for 
some general results on capitalization. 


II. A Prototyjre Model 

Throughout this section we deal with a model in which there is a 
single local public good (q) being produced independently by a num¬ 
ber of communities. Further, we will suppose that projects initiated in 
a community will not affect private good.s prices other than land rents. 


For an example of this aigumeni in the hteratuie, see Freeman 1971. 
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This may seem a serious restriction, but it is not, really, since changes 
in other prices will have the usual canceling effects on the two sides of 
the market. With nonland private goods prices fixed, we can aggre¬ 
gate all these goods into a single commodity and let this commodity be 
the numeraire in what follows. 

Agents must engage in effort (take trips) to enjoy the public good. 
Naturally, such trips are costly; here we will assume that the expense 
can be represented by a simple cost function in terms of the 
numeraire; where g is the number of trips taken and s stands 

for location within the community. Location will be specified by di¬ 
viding the community’s land into discrete zones and assuming that 
location can adequately be specified by designating the appropriate 
zone. 

Each consumer gets to choose a zone and an amount of land within 
the zone (/j. The market for land will be assumed competitive, and r, 
will stand for the rental rate in zone s. 

(Jonsurners earn income in terms of the numeraire. We will want to 
separate this income into three terms; profit shares (H), rental income 
on land (R), and other income (mainly labor) (Y). Thus, an individual 
in income class ? has income /' = IT -E E' + /?'. Such an individual will 
be required to pay taxes T‘. Fhe 7' may be a function of other 
parameters, depending on the nature of taxation. 

A consumer ol preference type « has a preference function of the 
form -T‘ —/ (g.s) — r,7). The arguments of this function 

are the level of public goods provision, the number of trips taken, the 
amount of land consumed, and the amount of other private gix>ds 
consumed. It seems appropriate that bothg dndq should affect utility. 
For example, if the public good is a museum, agents will care about 
the size ol the museum as well as the number of visits made. 

Some of the results l eported below will depend on the properties of 
(' and /. Specific assumptions will be introduced as we go along, but 
the following general discussion may be helpful here. It seems rea¬ 
sonable that the marginal cost of trips (df/dg) will increase as the 
distance from the public good increases. We exptect to find that use 
decreases with distance if and only if this assumption holds. Also, it is 
reasonable to suppose that the marginal benefit from public goods 
(dUldq) increases with use (g). This marginal benefit will decrease 
with distance whenever both assumptions hold. 

Each consumer in a particular community gets to chooseg, (, and s. 
(At a latet stage such a consumer may be able to choose a community 
of residence as well.) We will assume throughout that there is free 
mobility within the community, so that s is a free choice. It is ana¬ 
lytically convenient to think of each resident as making a conditional 
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choice of g and f for each s. This process generates a first-stage 
optimization problem: 

max (/“[q.g/,/' - T -/(g,s) - r/J. (1) 

of 

A solution to problem (1) defines demand functions g?' s g'>(q,I* — 
T,r,„i) and ^ ) and an indirect utility function Vf = 

V"(q,T,J' — At a later “stage," the consumer makes a discrete 

location choice, seeking to max, Vj'. We will use the notation V®' to 
denote the final indirect utility function. 

The remainder of the community economy consists of firms, about 
which we need to .say little except that they behave competitively and 
are not affected by the public goods, and the government, which 
produces the public good from private goods, paying for those goods 
with revenues from the taxes levied on households. Letting r(^) stand 
for the public goixls cost function, the government must satisfy a 
balanced budget condition T ~ ^ 9 ), where T stands for loial tax 
revenue. 

We assume that the private .sector is always in market equilibrium. 
For nonland commodities, this simply means that markets clear at 
fixed (unchanging) prices. For land, it means that rental rates must 
adjust until the market for land clears in each zone. 

We will formulate questions about capitalization as follows. We 
measure welfare using a standard Bergson-Samuelson formulation. 
We ignore distributional considerations by assuming that the distri¬ 
bution of income is initially optimal (such an assumption is clearly 
necessary for pure forms of capitalization). Next, we consider an 
initial situation and a proposed new project. Some form of capitaliza¬ 
tion occurs when there is a correlation between the resulting welfare 
change and the associated rental change. We will see that this correla¬ 
tion can take many forms. 

All precise statements about internal capitalization require some 
boundary condition on community rents. This boundary condition 
will take different forms, depending on the structure of the town and 
the organization of taxation in the town. We will discuss two different 
types of communities in this regard. In the first type of community 
the boundaries will be considered potentially variable, with “outside” 
land always available to the community at an exogenously given op¬ 
portunity cost; the boundary rent is thus exogenously given in such a 
community. The real prototype for such a community would be a 
town or metropolitan area located in the middle of farmland. We will 
refer to such communities as isolated. 

In the second type of community, the boundaries are predeter¬ 
mined (presumably by legal arrangement). The ef fect of a project on 
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boundary rent will be determined by changes (if any) in the aggregate 
demand for land in the community. The real prototype here would be 
a subcommunity within a larger metropolitan area. We will refer to 
such communities as adjacent. 

The local public goods model being proposed here seems most 
applicable to the case of isolated communities, since it is difficult to 
ignore the importance of direct spillovers among adjacent com¬ 
munities. However, it still seems useful to treat the adjacent case. 

To demonstrate that there are separate forces generating external 
and internal capitalization, we will need to show that it is possible to 
have each without the other. We will begin with a discussion in which 
we assume that no external forces are present; that is, we assume that 
community policy will not lead to any net migration to or from the 
community. Within this context, we explore the conditions for inter¬ 
nal capitalization. We derive alternative sets of assumptions under 
which there is no capitalization or full capitalization. 

Then we will return to the issue of external forces. We will give 
these forces their best chance to work by assuming free mobility 
between communities. To isolate the effect of these forces, we will 
adopt a model of the community which is consistent with no internal 
capitalization. Within this context, we again exhibit conditions under 
which there is either no capitalization or full capitalization. 

A. Internal Capitalization 

Assuming away migration effects, there are essentially two ways in 
which project benefits can gel translated into increased land values; 
through a change in the extensive margin or the intensive margin. If a 
project increases the aggregate demand for land, it may lead to an 
increase in the general level of rents (by operating on the extensive 
margin for land). Intuitively, this force could go either way. When 
taxes are raised (to pay for a new project), there is likely to be a 
substitution away from goods (including land). On the other hand, 
when public goods provision is increased, there is likely to be a 
complementary increase in the demand for land. Since the net effect 
on rents is clearly ambiguous, there is no reason to expect systematic 
capitalization from impact on the extensive margin. In fact, we will 
argue that the extensive margin.is likely to be of no importance, 
especially in isolated communities. 

However, the project may affect the differential value of locations 
and thereby influence the rent structure through the internal margin. 
We will show that this force may lead to systematic capitalization in 
relatively large communities, within which there is a wide range of 
choice regarding use of the public goods. Restricting this range of 
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choice can lead to any intermediate case from full capitalization to no 
capitalization. 

B. Sujfiaent Conditions for No Internal Capitalization 

The essential restriction which is needed to rule out internal capitali¬ 
zation is that there be no effective differential choice in the use of 
public goods within the community. A number of sets of assumptions 
will do for this purpose. 

The easiest case to explain is one in which there is no differential 
choice within the community at all. By this, we mean thatg^ = g and f 
= i for all agents in all zones; these conditions would hold if the plot 
size is institutionally given and the public good has the characteristics 
of national defense so that re.sidents get the benefits regardless of 
their actions. 

A resident of preference type a and income class i now chooses 4 
(the only remaining choice variable) to max^ -r -f(g,s) 

- r,C]. Cdearly, this problem reduces lor every agent to one of 
minimizing costs; min, t/(g.t) - r,?]. Since the costs to be minimized 
are the same lor ail agents, the only possible equilibrium rent struc¬ 
ture must be one which makes those costs independent of ,s. (At such a 
cost structure, everyone is obviously indifferent as to where they live.) 
Now, this independence condition must be true both before and after 
the project i.s initiated. It follows that the derivative of costs with 
respect to q must be independent of s. But with g and ( fixed, this 
means that (dldq)T, must be independent ofs; that is, if rents change at 
all they must change in a uniform way. However, rents at the bound¬ 
ary cannot change; There is no change in the aggregate demand for 
community land, so the boundary rent will not change in either 
isolated or adjacent communities. Hence there is no capitalization. 
Thus we can identify one case which formalizes the assertions of 
Hamilton (1975). 

We can introduce further choice into the community and obtain the 
.same results as long as we are willing to restrict the degree of diversity 
and the degree of complementarity of tastes. For example, suppose 
that tastes are separable and similar in that preferences of all agents 
can be represented in the form U = C'{q,g) + 0'^(€) + I' - T' - 
JigrS) — r,(, where e indexes incomes class, as before. Now, if it should 
turn out that the optimal choice ofg is independent ofs (so that there 
IS no effective differential choice within the town), we can again argue 
that there will be no capitalization. Clearly agents still make the same 
choices regardless of income class. Thus, rents must still adjust so that 
everyone is indifferent as to where they live {V, is independent ofs). 
This condition must be true both before and after the project, so 
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(d/dq)V‘, must be independent of s. Differentiating and using the 
envelope theorem, this condition becomes that 




( 2 ) 


is independent ofs. But since we assumed thalg, was independent of 
1 , the first term of (2) is independent of s, and we are back to the 
statement that the rent structure must change uniformly if at all. In 
fact, it cannot change at all, because there is no change in the exten¬ 
sive demand for land (given separable preferences).^ 


C. Sufficient Conditions for “Full” Internal Capitalization 

Internal capitalization can take a number of different forms, de¬ 
pending on specific assumptions made concerning the boundary con¬ 
ditions, the type of taxes imposed, and the nature of property 
ownership. We will delineate these cases as we go along. As will be 
seen, all of the propositions reported here rely exclusively on the 
intensive land margin to “enforce” capitalization. There may be some 
very special functional forms which will generate systematic capitali¬ 
zation from the extensive margin, but these would seem too special to 
be of much interest. 

There are two principal assumptions which taken together 
guarantee that project benefits will be translated to land rents through 
the intensive margin. The first is that all the benefits of the project 
must be “intrainarginal" in that boundary residents are marginally 
unaffected. We would expect this condition to hold if marginal resi¬ 
dents choose g = 0 (so that there is a complete revealed range of 
choice within the community). But the condition they may hold more 
generally: for example, in the case of a museum or park, “boundary" 
residents may make so few trips that, at their current level of activity, 
they could make no better use of a larger facility. We formalize this 
condition as follows. Let cr stand for some boundary region. Then 
“marginal indifference” will mean that 

- r> -f(g,cT) - rj\,] = 0. (3) 

The second necessary assumption is that residents do not tend to 
sort themselves out within the town according to their relative prefer- 


“ This case is closest in spii it to the example of no capitalization giv en m Polinsky and 
Shavell (1976). It also corresponds to Hamilton’s (1975) model of Ticbout commiiniiies 
in which there is no internal spatial striicliire. As weshcjw below. Hamilton’s conclusion 
that there would be no capilaliralion in Ticbout communities dc|)end5 critically on the 
assumption of no internal spatial structure. 
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ence for the public good. Since systematic differences will generally 
lead to sorting, this means that the community must be reasonably 
homogeneous in attitudes toward public goods. We will see later that, 
when sorting occurs, it will tend to mitigate capitalization. 

The required homogeneity will result whenever agents are addi- 
tively separable in their preferences on any component of consump¬ 
tion concerning which they systematically differ. Here we will treat 
the case where individuals differ in incomes and have preferences 
which are additively separable in the associated numeraire consump¬ 
tion good. 

Consequently, we now assume that a resident (i) of the community 
has preferences which can be represented in the form U* = 0{q,g,() -f- 
/' — T' — fig,s) - r,€. Now, it is obvious that residents will not sort 
themselves out by income class. Conditional on living ats, all residents 
would make the same choices of g and € ig*, = g„ all s), so at 

any specified rent structure, they would all agree on the best location; 
hence, as before, the rent structure must adjust until all locations are 
equally desirable for all residents. 

We now evaluate the first-order welfare effect of a new project: 

dlV =2a>,(fF'. (4) 


Since we are ignoring distributional factors, we must assume that 
whatever income differences prevail are “optimal,” implying that the 
welfare weights (<o,) are equal (otherwise, pure transfers could be 
made in such a way as to improve net welfare). We normalize units by 
setting the common weight equal to one. In evaluating (4) we treat 
everyone as if they were living at the boundary region cr and take into 
account all potential effects which q could have on the parameters 
faced by consumers. 

Performing the differentiation, applying the envelope theorem 
where appropriate, and aggregating where possible, we have 


^ = Af 0(q,g,,e„) - 

dq Oq dq 




dY 

dq 




dn 

dq 


dR 

dq 


dT 
dq ' 


where N stands for the total population of the community and the 
income variables without superscripts stand for aggregates over all 
re.sidents. Now, marginal indifference (3) means that the first term is 
zero, and dYIdq must be zero as well, since Y cannot change under the 
assumption that nonland private goods prices are fixed. Furthermore, 
government budget balance implies that dT/dq = dVtdq. Making these 
substitutions, we have, finally, 

^ dr„ 

dq dq dq dq " dq ' 


( 5 ) 
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The formula (5) can be thought of as a generic capitalization result 
from which a variety of specific relationships can be derived, 

1. Local Ownership 

Let us first look at the case where local firms (as well as local land) are 
all owned by local residents. In that case 11 should be thought of as 
total profits generated in the region. Assuming that local firms behave 
competitively, we know that dfl can be written in the form dfl = X - Sp, 
where X is the vector of net outputs and p the private goods price 
vector. But the only prices which change are land prices, so we have 
dU/dij = -{dRfIdq), where Rf \s the value of land used by firms. 
Substituting this relationship into (5), we have 

dW _ dRr _dT _ dr„ 

dq dq dq dq' ^ ’ 

where is the value of residential land. 

Alternatively, we could take the view that firms exhibit constant 
returns to scale and are constantly in long-run equilibrium. In this 
case, the above analysis is invalid since any infinitesimal change in 
prices will lead to a discrete jump in the firm’s decisions; in particular, 
if rents go up, ceteris paribus, all firms go out of business. The correct 
analysis would then set dfl = 0 in (5) and conclude (as in most of the 
previous examples in the literature) that capitalization involves all 
land value. Upon reflection, it seems that the long run is too long to be 
relevant here. Firms are not free to leave when rents rise, and the 
increasing rent payments by firms should be counted against profits. 
Hence we will stand by the formulation (6). 

7 he final form capitalization takes depends on the boundary con¬ 
ditions. In the isolated case, it seems reasonable to suppose that drjdq 
= 0 even though the boundary of the town may shift. Conceptually, 
we can think of there being many boundary regions which are shared 
by residents and farmers. As long as some of these regions are still 
shared after the project is initiated, boundary rent cannot change. 
Thus for the case of isolated communities, we assert dWIdq = (dR^ldq) 
- (dl'ldq). Verbally, the gross benefits of the project are capitalized 
into residential land values. Net benefits (dWIdq) are gross benefits 
minus costs (dr/dq). 

A more general statement can be made which will apply to adjacent 
communities as well. If the aggregate demand for land increases so 
that drjdq > 0, then the value of residential land overcapitalizes gross 
benefits ([dRrIdq] — N€„[dr„/dq] exaedy capitalizes gross benefits) 
while if drjdq < 0, then the value of residential land undercapitalizes 
gross benefits. 
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2. National Ownership 

The maj'ority of firms are not locally owned. Indeed, most firms are 
national in the scope of their operations, and local ownership has no 
meaning for such firms. Suppose we take the position that all 
ownership is in national firms. What happens to the analysis? Ac¬ 
tually, not very much. Indeed, the formula (6) is still correct for 
society as a whole, but now some of the costs of the project are paid by 
“foreigners” who own shares in the firms which are operating locally. 
Indeed, if we take the position that each community is small relative to 
the country, then it is a good approximation to assume that local 
branches of national firms are owned entirely by outsiders. In that 
ca.se, it follows that gross benefits to residents of the community are 
capitalized into total land value, while gross benefits to society as a 
whole are capitalized into residential land values. The difference 
represents an externality. We will discuss other examples of such 
externalities later. 


3. Property Taxes 

Up until now we have dealt with only the case in which taxes were 
treated as lump-sum taxes by consumers. Since the major locally 
imposed tax is a property tax, it is important to see how the analysis 
needs to be modified in that case. If an ad valorem tax at rate t is 
imposed, then an agent using an amount of land ( in zone i will pay 
taxes tr,f, where r, must now be thought of as the rent net-of-tax 
payments. 

Now, the problem for a typical individual can be stated as max„Y., 
[0 + /‘ — f(g,s) - r,(l + <)^1- Clearly, it is still true that all agents 

will make the same choices, and, indeed, the method of analysis is 
exactly as before. Performing the < alciilations (and taking the point of 
view of society in order to avoid any ambiguity concerning the 
ownership of firms), we derive 

where Rr now stands for total residential rent net of taxes. 

The exact form of capitalization again depends on boundary con¬ 
ditions. And now there is some ambiguity concerning these even in 
the case of isolated communities. The issue revolves around whether 
farmers in the boundary regions do or do not pay the taxes. If they do 
pay the taxes, then the free boundary condition is as before. However, 
if they do not pay the taxes (but would have to if they acquired 
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property “within" the community), then the appropriate condition is 
+ t) = f, where f is the opportunity cost of land to a farmer. We 
will refer to this arrangement as agricultural zoning and to the other 
case as blind zoning. 

The case of agricultural zoning is easiest to analyze. Differentiating 
the boundary condition with respect to q, we have (1 + t)(drg/dq) + 
r„ldtldq) = 0. Therefore, referring back to (7), 


dW _ dRr 
dq dq 


( 8 ) 


Here net project benefits are capitalized into net-of-lax residential 
property values, while gross project benefits are capitalized into 
before-tax property values. 

For the case of blind zoning, the term involving drjdq vanishes in 
(7), and we are left to e\'AunieNr„£„(dtldq). Suppose that only residen¬ 
tial land is taxed, so that the government budget balance condition is 
iRr = r. Differentiating this equation with respect to q and substitut¬ 
ing for dl/dq in (7) yields 


dq dq \dq dq / dq dq 


(9) 


where « = NffJRr- The net benefit is now a weighted difference 
between increases in after-tax residential property values and in¬ 
creases in cost. Note that the smaller is «, the closer we approximate 
the results for the previous case. The o stands for the ratio of the 
value of land residents would use if they all lived at the boundary to 
the actual value of residential land. 

More generally, however, (9) tells us that increases in residential 
property value overcapitalize net benefits by an amount related pos¬ 
itively to the gap between extra costs and the extra revenue that would 
be generated without changing tax rates. Clearly, this result is inter¬ 
mediate between pure gross capitalization and pure net capitalization. 


D. Pieference Difference!, within the Community 

Once systematic preference differences are introduced into the com¬ 
munities, the exact capitalization results tend to break, down. Here, we 
demonstrate this and indicate the type of modification which is re¬ 
quired. For this purpose, let us consider the simplest extension possi¬ 
ble, one with two different types of residents in the town (indexed a 
and h). Let there be N„ (N^) «-type {<>-type) residents. It is convenient 
to assume that all a-type residents have the same preferences and 
income (although different incomes could be incorporated as before). 
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Also, we find it necessary to suppose that the plot size within any given 
zone must be uniform, and we may as well assume that this plot size is 
predetermined. 

Now we let S" stand for the set of zones occupied by o-type residents 
and S'* for the corresponding set of zones for ft-typ»e residents. Obvi¬ 
ously if cr is included in both sets, the analysis is exactly as in previous 
sections; indeed, all previous theorems go through with preference 
differences as long as those differences do not lead to systematic 
sorting. Therefore, without loss of generality, we can assume that the 
zoning arrangements are as in the Venn diagram shown in figure 1. 

Since the zoning is discrete, there is little loss of generality in 
assuming that at least one zone is occupied by both types. We let t 
stand for such a zone. 

In what follows, it is convenient to use the shorthand notation Ff(q) 
= - f(gf,s), all .«,* = ajb. Thus, V? can be written in the 

form y" = F^(q) - r,F, + I". Next we evaluate welfare as before, 
except that now we cannot evaluate everyone’s welfare at cr; instead, 
we evaluate a-type welfare at a and 6-type welfare at t: 

W =N„Vi+N^V^, (10) 

= N„F- + + / - N„r„(, - N,r,e,. 


Finally, we make use of the fact that a-type agents are indifferent 
between locations t and cr. This indifference implies that 


rr^r = + F?(q) - F"(q). (11) 


Substituting (11) into (10) yields W = NF°{q) + Ni,[F^{q) - F"{q)] + I - 
and differentiating with respect to q, we obtain: 


dW dU° . „ ^ dr„ dll dR dT 


+ V* 


(9.^/r) - -^(9.,^/r) 


dq 


dq 


( 12 ) 


Examining (12), we see that the terms are just as before except for 
the intramarginal benefit term: 




dU'' 

dq 


(9,g?/r) 


dU” 

dq 


(q,g?A) 


(13) 


We will argue that there is a strong presumption that this term is 
positive. To begin with, the condition g? > g? is a stability condition for 
the town structure we have specified. It guarantees that there is a 
differential benefit to an a-type agent relative to a 6-type agent as one 
moves toward the boundary of the town (6-type agents gain relatively 
more from proximity to the center since they make more trips). 
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Fig. I 


Therefore, as long as the marginal benefit from more public goods 
increases with the number of trips, the intramarginal term (13) must 
be positive. 

Thus, when systematic preference differences are introduced into 
any particular variant of the model studied above, land rents will tend 
to undercapitalize the associated benefit. Operationally, this means 
that if land rents were used as a decision criterion, we might end up 
rejecting projects which ought to be accepted. 

E. Sufficient Conditions for No External Capitalization 

Having explored the conditions for internal capitalization in a com¬ 
munity that was insulated from immigration forces, we turn now to a 
study of those external forces. We will show first that any systematic 
differences between towns can serve to neutralize the external forces 
at least with respect to their effect on small projects. There is an 
important caveat here, however. What we show is that communities 
could arrange things so as to insulate themselves; we cannot show that 
they have an incentive to follow such policies. Indeed, we have shown 
in some related work that communities do not have a myopic incen¬ 
tive to insulate in many instances.’ We will ignore this issue here. 

We employ the simplest model of community here, in which there 
are no choices with regard to public goods use. The reader might 
usefully recall that this is one of the cases in which there is no internal 
capitalization. All agents within the town are assumed alike. To make 
the point about potential insulation in the strongest possible way, we 
will al-o assume that preferences of all individuals in all towns are 
alike and that towns differ only in income levels; that difference alone 


■’ See- Starreu 1977. 
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1 * Artiiallv this case may not be SO unrealis- 

is enough to insulate town^ X- observed Tiebout 

tic. since sorting by income level is 

differentiation among communities. n a w,. ran snerifv the 

Referring back to the first model m Section lU. we can 

characteristics of community A completely by a leve 

(^*X an income level of residents (/*)/ and a cost level (A T 

+r*/,) which must, at equilibrium, be independent of location 
(s). It is convenient to assume that the g and ^ levels are the same 
everywhere so that they can be deleted from the analysis, although 
this is not really necessary. Note that, in the model as now constituted, 
there is no economic distinction between property taxation and 
lump-sum taxation. Now, the welfare (f/**) of an individual with 
income /*■ living in his own community can be expressed as f/*^* = 

- A*). 

We now ask the following question. Suppose that a set of f's, I's, 
and A’s is specified. Will anyone have an incentive to move immedi¬ 
ately, and if not would anyone see such an incentive if one of the ^'s 
were changed marginally? We will show that an appropriate initial 
choice will make the answer to both these questions “no” as long as 
income effects are not negligible. Suppose a k agent considers moving 
to /. We assume that, if he does this, he must pay the taxes (and other 
costs) appropriate to/. (If, instead, he expects to pay the taxes appro¬ 
priate to k, as he surely would if the tax were a property tax, the 
results are still the same, but we would have to incorporate the taxes in 
the I term rather than the A term.) On the other hand, an agent’s 
income is determined by his place of initial ownership. This must be 
true of property income (as long as there are no unanticipated capital 
gains), and labor income is the same everywhere, by assumption. 
Hence, he would expect to get utility (7*^ = t/(y^,/* - h‘). 

The move is not desirable, and would not be desirable for any 
sufficiently small variation in q\ as long as f/** > Similarly, a move 
from j to k is not desirable as long as (/^* = U - h'') < U - h^) 

= f/". Are these two conditions consistent? Let us order; and k in such 
a way that > y'. Then stability certainly requires A* > A^. To see what 
else is required, it is convenient to change variables slightly. If we 
dehne ^ - 9 *, z* = 7* - A*, = P - A*, and ^z = h'’ - h‘, then the 

two stability conditions may be written as U(q'',z'‘) > U{q'‘ + ^q,z'‘ + 

Az) and U (q^.z^) <U(q'‘ + d^q,z‘ + Az), with A^ < 0 and Az > 0. These 
conditions are consistent as long as/* > /^ as can be seen in figure 2 . 

Of course, we have assumed in the construction that the income effect 
is normal and significant, so that increases in income increase the 
marginal rate of substitution of z for q. Clearly, community A could 


* 1 he /* is clearly the average iiiromc in the community. 



land value capitalization 3a 1 



engage in any project of size less than or equal to 69 * without inducing 
entry, while a similar statement holds for community j. 

Naturally, if projects undertaken are too large, entry may be in¬ 
duced. However, we assert that any systematic differences between 
towns (direct preference differences would have done as well) can 
imply no external capitalization of marginal projects. This same 
threshold effect would result if the segregation of communities were 
caused by a fixed tost of moving (rather than by preference differ¬ 
ences).'' 

It is interesting to note that if we now reintroduce the possibility of 
internal capitalization, its presence may be enough to neutralize ex¬ 
ternal foices, even if there are no intrinsic differences at all among 
people. That is, if^* > ^ implies> R’, we will automatically have/* 
> U, assuming that there are no other income differences. 

We chose income differences to illustrate the pirssibility of insula¬ 
tion because they were the most innocuous-sounding differences we 
could think of. Obviously, preference differences are more likely to 
serve this purpose.® Indeed, some might argue that income effects are 
unimportant in the context discussed here. Suppose that we reexam¬ 
ine our problem in the absence of income effects. Interestingly, the 
whole character of the analysis changes. 


’ Of course, if there are no costs of moving and a continuum of preference types, any 
project will induce some entry and. presumably, some capitalization. 

• Since the Ticbout model is cliaracterired by preference difference amon^ com¬ 
munities, there will be no external capitalization of small projects in Tieboiit com¬ 
munities that are “sufficiently” differentiated. 
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F. Sufficient Conditions for Full External Capitalization 

Without income effects or preference differences between com¬ 
munities it is impossible to insulate, and the resulting migration forces 
will generate external capitalization. To show this, let us reintroduce 
preference differences within communities while at the same time 
eliminating any income effects or preference differences between 
communities. Ana-type resident of income class i now has preference 
of the form V“' = (7"(^)4^/' — A. At this point, it is convenient to define 
a fictitious “average” resident of community k. This average resident 
clearly would have preference of the form V* = u'‘(q'‘) + /* - A*, 
where /* is the average income level in A (as before) and t/*' is the 
average felicity function in A. Our assumption of no preference dif¬ 
ferences between communities will be taken to mean that the average 
felicity functions are the same in all communities, that is, = 

0(q). all A. 

Now let us reconsider the stability conditions for equilibrium. 
Clearly, a necessary condition for stability is that our fictitious average 
resident does not want to move in either direction between any pair of 
communities. The corresponding pair of inequalities for communities 
A and; are 

Olq^) + ;* - A* s + /> - AJ (14) 

and 

d(q'‘) +/> - h'^ ^ d(qJ) + F - h^. (15) 

Clearly strict inequality in either direction is impossible, and both 
relations must hold as equalities in any equilibrium situation; the 
average resident must be indifferent at the margin as to where to live. 
It follows that any welfare-improving project anywhere must induce 
entry. 

We can now use equations (14) and (15) to generate a simple 
external capitalization result. These equations must hold both before 
and after a project (in some particular town z), so we can differentiate 
them with respect to In doing so we must allow for the possibility 
that a change in^* could affect T andA-* (J ^ z). We have already seen 
one way in which this could happen through external ownership of 
local firms; now that migration is certain, there are the so-called fiscal 
externalities as well; when one community attracts residents from 
other communities, the lost tax revenue is a real external cost to the 
“losing” communities. For a rigorous demonstration of this fact, the 
reader is referred to Starrett (1980). 

Allowing for potential changes in all variables, we can write 

dV^ 3® dU{q‘) + d/* — dh’ = dl’ — dh^ (16) 
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and 

dV> =dU (q* ) + di^ - dh‘ = dV - dh>, ( 17 ) 

where dV* is shorthand notation for the change in average welfare of 
community i. 

Let us define dW * to be the total external effect; that is, 
dW* = lN>dVi = dl* -X N>dh>.'’ 

And dW^ = N^dV^ will stand for the welfare change in community z. 
Clearly, dlV = dW* + dW^. Simple substitutions yield the following two 
equations for the change in community z welfare and the change in 
social welfare; 

dw^ = ~dW* - ^ (dR» + dn*) + dR‘ + dW 

N* N* 


and 

dW = -^dW* -^{dR* + dW*) + dR^ + dl\^. 

N* N* 

Finally, we must distinguish cases again, according to the nature of 
firm ownership. 

1. Neutral Ownership 

Suppose that each community owns its share of national profits. Then 
A/’Il* - = 0, and we have 

dW^ = ^dW* + dR^ - ^dR* (18) 

and 

dW = ^ dW* + dR‘-dR* . (19) 

N* N* 

Clearly, total rent change in z is a good approximation to net welfare 
benefits in z as long as community z is small relative to the nation (so 
that N‘/N* is small). However, net social benefits will be misrepre¬ 
sented by rent increases to the extent that the external effect is 
significant. 

How important is this misrepresentation? It turns out it may be 


’ The /' stands for total income in community i; the symbol # means the associated 
variable is aggregated over all communities except t. Recall that we are assuming that all 
local public goods projects are paid for locally. Naturally wc could correct for the 
externalities if we could impose the optimal revenue-sharing scheme. 
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quite important even if N^/N* is small. We have studied the measure¬ 
ment of dW* elsewhere (see Starrett 1980) and merely report the main 
results here. 

When community z engages in a project, that project is certain to 
induce some entry. Other communities suffer real welfare losses 
which are equal to lost lax revenue (from the migrators) minus any 
attendant improvement in congestion “costs." Unless communities 
have already seriously overexpanded, the net benefit to outsiders is 
negative (dW* < 0). Hence even changes in residential land values will 
overstate net social benefit, quite generally. 

Of course, if community z is not small as a fraction of the country, 
the theory takes a somewhat more complicated general form. Aside 
from the external terms, net benefits are capitalized into the differ¬ 
ence between the change in land value in z and the “average" change 
in land values elsewhere. This formulation is reminiscent of that 
presented in Strotz (1968) for two regions. 

There is one other major difference from the internal capitalization 
results; net benefits are externally capitalized, while gross benefits 
were internally capitalized. The economic reason for this discrepancy 
is not difficult to find. The marginal potential resident pays local taxes 
in the internal model (he is the boundary resident), while the marginal 
potential resident does not pay local taxes in the external model. 

2. Local Ownership 

If each community owns its own firms, then dll* = -dR*, and we have 

- ^dRf ( 20 ) 

and 

dW = ^dW + dR^r - (21) 

The form of the results is the same except that capitalization is into 
residential rather than total land value. T he reader might think at 
first that there is a contradiction between equations (19) and (21). 
After all, it cannot matter from a social point of view where the profits 
are owned, so why do these equations look dif ferent? The answer is 
that any increased cost of local industrial land will be reflected in the 
term dW* in equation (19) but not in (21). Thus, even in the case of 
national ownership, only increases in residential land values represent 
real social benefit, once adverse land value externalities are canceled 
out. 
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III. Summary^ 

We have shown in the previous sections that there is considerable 
diversity in the form and extent of capitalization, depending on the 
nature of the assumptions made. However, some fairly general prin¬ 
ciples emerge, and we summarize these in this section. 

First, all of the models studied lead to the conclusion that, from the 
point of view of society as a whole, it is residential property values that 
capitalize project benefits (rather than total land value). The theories 
differ on the appropriate land base only from the perspective of the 
project-generating community; if lost profits from increased land 
values are exported, then increases in total land value measure 
benefits to the community, but, if not, then it is residential land values. 

The internal and external capitalization models generally disagree 
on whether gross or net benefits are capitalized. However, it is worth 
pointing out one important case in which the two theories agree on 
this issue. This is the case of property taxation with agricultural 
zoning, for which both theories suggest that increases in net-of-tax 
land values capitalize net welfare benefits (and, naturally, increases in 
gross-of-tax land values capitalize gross benefits). 

The main differences between internal and external capitalization 
results can be traced to the boundary conditions imposed on a repre¬ 
sentative community. Notice that such boundary conditions play no 
role at all in external capitalization; indeed, the external results hold 
regardless of what happens to rents or benefits at the boundary of the 
communities. 

However, internal capitalization results vary considerably, de¬ 
pending on the assumptions governing behavior of land rent and the 
impact of incremental projects, at the boundary. In the extreme case 
ill which the impact is nil and the (net-of-tax) rent goes down by the 
rate of taxation, internal capitalization agrees with external capitali¬ 
zation. But if boundary rent is unchanged (as it will be under many 
plausible conditions) the gross project benefits are internally 
capitalized. And if marginal projects actually make boundary resi¬ 
dents worse off (because of a dominating congestion effect) then land 
values will overcapitalize even gross benefits. 

We should recognize that, in all cases considered, land values will 
tail to capture some of the external benefits (or, more likely, costs) of a 
project. When the only external effect is through nationally owned 
profits, as was the case in the pure internal capitalization model, we 
coult4 correct for the externality by using residential land value rather 
than total land value. However, in the free-trade, free-migration 
model of external capitalization, the external costs of a project arc 
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generally much larger, and we conclude that increases in residential 
land value will overstate true net social benefit by an amount closely 
related to the size of the fiscal externality. 

We close with some remarks on the basic assumptions underlying 
capitalization results. The two types of capitalization rest on different, 
and to some extent complementary, assumptions. External capitaliza¬ 
tion requires that the distribution of agents by economic characteris¬ 
tics be the same in one community as in all others, while internal 
capitalization requires that agents within a particular community must 
have similar economic characteristics (though these characteristics 
may differ from community to community). Thus, the internal 
capitalization results are most applicable in a “pure Tiebout” world 
where each community specializes and attracts a single type of agent. 
The external capitalization model is most applicable in the opposite- 
extieme world of pure scrambling” in which all communities look 
alike in terms of the characteristics of agents. Naturally, there are 
many intermediate cases between these extremes (including the real 
world, presumably!). To the extent that neither extreme is a good 
approximation, capitalization will not hold in either form, or will hold 
only partially. 
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Money and the Dispersion of Relative Prices 


Zvi Hercowitz 

I'nivrtMli of Kmhe.'.lrr mid Sahonat Bmeau of Etonomic Reseauh 


A pri(«- clispfi-sion ecju.itioii is it-sicd with data from the German 
hv(jerinfl-itioii. I'fie equation is dcriveri from a version of Lucas's 
auci Bario's partial inrormation-loeali/ecf market models. In this 
eMension. diffeient excess demand elasticities across connnoclities 
imply a testable dispersion equation, itt which the explatiatoty vari- 
al)le is the inagnituefe of'the nnpeiceived money giowth. In order to 
test ihts h\ix)thesis a price dispersioti serit's is cotistnicted, and a 
tneasiire ol the itnpetceived part ol incmey growth is estitnated. The 
model leccises support from the empirical analysis, although it is 
esidetit that imincluded variables have important efl'ecls oti price 
dispel sicjii 


The existence of a positive correlation between absolute price level 
variability and the dispersion of relative prices was observed by, for 
example, .Mills (1927), Graham (1930), and recently by Vining artd 
Elwertowski (1976). In his study of U.,S. price behavior during the 
period 1920-26, Mills (1927) says: “We have tiol however exhausted 
the possibility of discovering a relationship between price level and 
dispersion. It may be that dispersion depends upon the violence of 
the price change, regardless of direction” (p. 284). Graham (1930) 
finds in the post-World War I German hyperinflation an additional 
dynamic element of price behavior: “It is clear that with the initiation 
of an upward movement in general prices a .series of lags in individual 
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prices developed, that these lags tended quickly to disappear when 
stability of' general prices was reached on a new level, or when general 
prices fell, but that they were nevertheless progressively eliminated 
even though the general price level continued to rise" (p. 175). This 
observation suggests that unexpected events may have an important 
role in the determination of price dispersion. Individual prices dis¬ 
perse at the beginning of an upward swing in the price level when the 
acceleration is presumably unexpected. As inflation continues the 
element of surprise wanes, and prices tend to converge. 

7'he studies cited above failed 10 offer an economic rationale for the 
observed statistical correlation. Recently, a theoretical explanation of 
the relationship Itetween price level variance and relative price disper¬ 
sion was offered by Barro (1976). Using a localized markets 
framework of the type described by Phelps (1970) and employed by 
Lucas (1973), Barro links the dispersion of relative prices to the 
variance of the money supply. The key elements of this model are, on 
the one hand, individuals possessing incomplete current information 
and, on the other, demand and supply in each market reacting to 
relative prices as they are locally perceived. Thus, agents are con¬ 
fronted with the problem of determining whether locally observed price 
movements arc caused by general inflation or by shifts in relative 
excess demand. The larger the variance of the money supply, the 
more likely are agents to attribute ItKal price movements to general 
inflation rather than to relative shifts. Accordingly, as the money 
variance rises, local price changes induce smaller supply-and-demand 
responses—that is, excess demand becomes less elastic. (Consequently, 
stochastic shifts to Icxal excess demand produce larger changes in 
individual prices, so that the dispersion of prices across markets tends 
to increase with the variance of money. In this sptecificaiion of the 
model, in which all markets have the same structure, dispersion is 
unrelated to the magnitude of realized money shocks. 

I'his paper modifies Barro’s framework by interpreting each loca¬ 
tion to be the market of a specific commodity, characterized by a 
particular excess demand elasticity. Because elasticities vary across 
markets, aggregate shcxks affect each commodity price differently. 
I herefore, in this modified setup price dispersion is positively related 
to the magnitude of these shocks. 

The model also predicts that systematic or perceived money grow th 
is neutral with respect to price relationships. Accordingly, a money 
shock in this model is defined to be the comjjonent of money growth 
thai is currently unobservable and cannot be inferred from currently 
available information. Whereas the quotation from Graham sug¬ 
gested that sudden—presumably unexpected—shifts in money 
grr)wth cause dispersion, in this model unexpected monetary expan- 
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sion disperses prices only if it is, at least partially, currently unper¬ 
ceived. 

The main task of this study is to evaluate this hy|X)thesis with data 
from the German hyperinflation, a period of predominantly mone¬ 
tary disturbances. The period considered runs from January 1921 to 
July 1923. The vertiginous monetary expansion initiated in August 
1923 differentiates the last phase of the hyperinflation, and, thus, it is 
not included in the sample. 

The theoretical framework, presented in Section I, neglects some 
important facets of the hypterinflation. Specifically, it ignores the 
foreign exchange market and the sustained divergence between the 
internal and external values of the mark,' which obviously are related 
to relative prices. Also ignored in the main text are changes in the 
velocity of monetary circulation. However, in a brief discussion some 
general conditions are given under which price dispersion is neutral 
with respect to velocity changes. Finally, the only aggregate exoge¬ 
nous disturbances assumed to be affecting the economy are periodic 
infusions of new money made by the government. Real aggregate 
shocks are ignored. Empirically, they are probably of relatively minor 
importance and can be considered to be part of the error term in the 
estimated equations. 

The testing of the dispersion equation requires two important pre¬ 
liminary steps. First, a price dispersion series is computed in Section 
II using an interesting set of data. It consists of monthly averages of 
68 commodity prices, ranging from foods to metals, for the period of 
January 1921 through July 1923 (31 months). Data were unavailable 
for the months prior to January 1921. Next, a money growth equation 
is set up and estimated in Section III. To do so, both an information 
set available to agents economy-wide and a functional form relating 
this set to money creation are postulated and discussed. The ex¬ 
plained part of money growth in the estimated equation is taken as a 
measure of the perceived rate of monetary expansion. Corre¬ 
spondingly, the unexplained part is interpreted as the money growth 
rate that could not be pterceived from the assumed information set. 
These figures, in conjunction with the price dispersion series, are 
u.sed in Section IV to test the price dispersion equation. 

Parks (1978) has also tested a model of price dispersion using pre- 
and post-World War II U.S. data. In Parks’s model dispersion is 
explained by changes in real income and the unexpected part of 
inflation, as measured by the innovation in the inflation rate. In this 
specification expected inflation and changes in real income are 
treated as exogenous variables. He finds a strong positive correlation 


‘ See, e.g., Bresciani-Turroni 1937. 
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between unexpected inflation and price dispersion. His tests also 
suggest a separate but smaller effect of the actual inflation rate. 

The present paper estimates an equation that relates price change 
dispersion to the exogenous shocks affecting the economy, in this case 
unperceived monetary injections. An additional monetary variable 
that is theoretically relevant for price dispersion is the variance of 
money shocks. An estimate for this variance is obtained from the 
money growth analysis and is included in the estimation. The model 
receives significant support from the empirical analysis. In particular, 
the variable measuring unperceived money growth has substantial 
explanatory power for price dispersion. The results also make clear 
that unincluded variables have important effects on price dispersion. 
Some of these are briefly considered in Section V. 


I. The Model 

The economy consists of an arbitrarily large number of physically 
separated markets indexed by z. In each location a specific commodity 
is produced and traded. At each date t the agents, assumed to be risk 
neutral, exchange money only for the commodity being traded in the 
market in which they are cutrently located. At date / + 1, agents 
change location at random, and the process is repeated. Consider now 
the information set available to the agents. It contains not only lagged 
values of all relevant variables, but also current information which is 
limited to the local market price Pi{z), and some economy-wide shared 
knowledge about current variables related to money creation. .Actual 
money growth, however, includes a random term which is assumed 
unknown. 

The supply and demand for commodity z assume the log-linear 
forms; 


y1{z) = a’{z)[P,{z) - EP,] -(- effz), (1) 

yf(z) = -a''[P,(z) - PPt] + {M, - EP,) -t- effz), a'(z) > 0. a" > 0. (2) 

The operator E is the mathematical expectation taken conditional on 
the information available in market z at time /. For each commodity z. 
P/iz) - EP, is the locally perceived relative price. The expressions rf(z) 
and ef(z) represent relative shifts to supply and demand, respectively. 
The excess demand shift, €,(z) = €?(z) — cj(r), is assumed serially 
uncorrelated, normally distributed with zero mean and variance of. 
This variance is assumed to be equal in all markets. For each z, a'(z) is 
the short-run relative price elasticity of supply. Disparity in the supply 
elasticities of different goods follows from heterogeneous prtxluc- 
lion functions. However, in the long run relative prices are assumed 



JOURNAL OF POLITICAL ECONOMY 


332 

fixed because of perfect substiiutabiiity on the supply side. The long 
run is measured here by one period, after which all suppliers can shift 
to other markets. 

Looking one period ahead, all the markets offer the same mean 
price, but, as shown below, the corresponding variances differ ac¬ 
cording to the excess demand elasticities. Because agents are risk 
neutral, they are indifferent between the markets, and thus they 
choose a market for the ticxt period randomly. There is an additional 
point related to the ex ante variability of the individual prices. Intui¬ 
tively, one would expect a market with more price variance to be less 
desirable because local information would yield price level estimates 
of lower precision. However, as shown below, this turns out not to be 
the case. 

On the demand side, the relative price elasticities are a.ssumed 
constant across markets, 'fhedemand function also includes the term 
M, — EP,, which accounts for a real balance effect. 

At the beginning cjf each period the stock of money in the economy 
is increased by iransfers from the government to the public. T his new 
money is assumed to be distributed equally across the markets. Within 
each market, hoveever, the transfers are allocated randomly among a 
large number of agents. The rate of growth of the money stock, m, — 
M, - obeys 

m, = + m, =g, + m„ 

where the X„’s are variables (past or current) that can be observed in 
all locations and the /3’s are known coefficients. The quantity m, is a 
random variable with zero mean and variance 17 ^. I'hus, g, is the 
expectation about money growth formed from all the economy-wide 
shared information. It can be considered the prior expectation. The 
posterior is formed using the additional information conveyed by the 
local price. Thus, while g, is the same everywhere, the posterior 
expectation Km^ is conditional on location-specific information as well 
and therefore varies across markets. 

From equations (1) and (2) market clearing implies that 

= {1 - L o'*)} EP, + {l/(cd(z) -t- [M, -t- e,(z)|. (3) 

For each z, the sum a'(z) + a** is the relative price elasticity of excess 
demand. Let \(z) = l/[at'(z) + a**]. Each market has a constant X(z), 
but across markets \(z) is distributed according to a given density 
function with average value X and “variance” (t\. (Consistent with the 
assumption that agents ptossess accurate knowledge about the structure 
of the economy, this distribution is assumed to be known. 

Following Lucas (1973) and Barro (1976), the solution for prices in 
terms of exogenous variables is obtained using the methtxl of unde- 
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termined coefficients. Given the log-linearity of the model, the solu¬ 
tion for the aggregate price level has the form 

Pt = + ri-ig, n^m,. (4) 

Namely, the aggregate price level will be related to the current money 
stock, which is divided into its different components. Lagged values, if 
added to (4), yield zero coefficients. Since iW,-, and g, are fully per¬ 
ceived at time /, taking the expectation of both sides yields 

EP, = fljM,-, + + Ilab'm,. (5) 

The conditional expectation of m, is now computed. Rewrite (3) as; 

= gi + m, -I- 6 ,(z), 

where 

8,(z) = [1/A(2 )]/’,(z) - [l/\(z) - I]EP, - AT-,. 

The total disturbance affecting market z. S(z), is partly nr>mindl and 
partly real. Agents perceive S(z) and form their expectations about 
its components. Given the stochastic specification of in, and e,(z), the 
mean of the distribution ofconditional on S(z) is 

Em, = g, + - g,j, 

or 

A'tn, =g( +-2^f-:3h«r + C((z)]- 

(Tm + Ot 

Observe that X(z) does not appear in (6). Since agents lotaicd in z 
know this elasticity, they are able to isolate the composite disturbance 
independently of A(z). Thus, while the ex ante variance of prices 
depends on the particular elasticity (this follows from eq. [26] below), 
the precision obtainable from the Iwal information is independent of 
A(z). Indeed P,(z) would convey less valuable information in higher 
piice variance markets if the differential variability was due to a 
disparity in oj. In this model, however, of is the .same across markets. 

Substitute now ( 6 ) into (5) and the resulting expression for EP, into 
(3) to obtain 

EP, = + Tliff, 4- Ha 17) 

= H - A(z)]|ri,A4,_, + Hag, -t- + €,(z)]l 

0-5, + 0-* j (g) 

-I- A(z)[M,_, + g, + m, + €,(z)i. 
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A new expression for the general price level can be computed from 
(8) by averaging with respect to the densities of A( 2 ) and €,(z): 

P, — {I — X) + Iljg, + Ila ——j- TO,] + X(M,_i + gt +■ TO,). 

(9) 

Since equation (9) is identical to (4), the solution for 11,, IIj, and Ils is 
obtained by equating the corresponding coefficients in the two 
equations: 

n. = 1, 

n* = 1. (10) 


Ha 


ai + (l/X)o-f ■ 


Substituting (10) into (8) and (9) and rearranging terms yields the 
solution foi the individual commodity price and the average price 
level: 


P,(i) = M,-, + g, + 


aj, + k{z){]fX)(T} 
o-*, + (1/X)<r* 


[to, + e,(z)], 


P, = Mt-i + g, + 


+ oj 


a*, + (l/X)or? 


m,. 


( 11 ) 

( 12 ) 


I he resulting actual relative price is: 

P,( 2 ) - P, = (1 - 0)X(z)to, + [0 + X(z)(l - e)]€,(z), (13) 

where X(z) = X(z) — X and = tTm/Io-J, + (l/X)o-f]. 

The hypothesis expres.sed by equation (13) is that only the unper¬ 
ceived part of money growth can affect price relationships. Note that 
the realized values of the unpterceived money growth appear in the 
relative price expression. This follows from the confusion between to, 
and c,(z). Since in general £to, ^ m,, part of the money shtxks is 
mistakenly perceived to be a shift in relative excess demand. The 
ensuing short-run supply reactions differ across markets according to 
a'(z), thus causing dispersion among actual prices. On the other hand, 
gi is correctly identified as an aggregate disturbance and therefore 
cannot be confused with a relative shift of excess demand. The neu¬ 
trality of perceived money follows fromy;(z) andyf(z) being functions 
of the relative price and from the one-to-one relationship between g, 
and the expected price level (eqq. [7] and [10]). Given some value for 
gi, the quantities along the supply-and-demand schedules are the 
same as before, for local nominal prices higher by an amount equal to 
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the adjustment in EPi —which equals g(. Therefore, the market clears 
at a Pt(i) which is higher by the same degree in all markets* 

The variance of relative prices at time t, defined as if = MN S,., 
[P,(z) — /’«]*, where N is the “very large” total number of markets in 
the economy, can be computed from equation (13)* 

T? = {(1 - efal + [<J + Ml - WM + (1 - e)^cdml (14) 

An empirical test of this equation requires a measure of dispersion 
among prices or price indexes of different commodities. Mills (1927, 
chap. 3) discusses problems that the interpretation of this dispersion 
measure presents. For example, long-run differential technological 
changes will cause prices to disperse over time. One would like to filter 
out such effects, because the focus here is on short-run distortions 
caused by incomplete current information. The problem is alleviated 
by using rates of price change rather than price levels. Different trends 
do not affect the variation of price change dispersion over time— 
although alterations in these trends will. Thus, some of the long-run 
relative prive movements effects can be filtered from the dispersion 
measure. What remains can be considered to be captured by the 
random term in the dispersion equation. 

The variance of the rates of change in individual prices is calculated 
using equation (13) and the equivalent for I - 1. This variance, 
defined as ^ 

y' ^ ^ (^ 5 ) 

follows as 

>? = 2(1 - 0)*(ricr‘ + 2(0 + Ml - eW. + d - »)V!(m, - m,.,)*. 

Equation (15) is the final price dispersion equation that is generated 


* In order to consider the effects of changes in the veloc ity of money circulation on 
price dispersion, I worked out a similar simple model in which there is some current 
public information about future money growth. This information may be conveyed bv 
political or military events that are believed to have implications for the future state of 
government finances. The prediction of future monetary expansion, which generates 
inflationary expectations, affects the velocity of circulation in the current pericxl. With 
tespect to relative prices, if the knowledge about future money growth is shared 
C( onomy-wide, they will be unaffected by the change in velcxnty. This neutral effect 
follows from the same mechanism determining the neutrality of perceived money. 
Sint e all agents share the same knowledge and arc assumed to use it in the same model 
to predict its effects, they will equally adjust their EP, according to the change in 
velcK iiy taking place. A “one-time jump" in all prices therefore occurs, without affecting 
their d'spersion. 

’ For this compulation, since t,(i) and (aXi)]* are independent of X(i). UfilP, and k<t), 
the following equalities are used: ( 1/Af)2t,(r)'^(r) = O A = 0, (l/A’lSledrlPlAiz))* «= 
(r.»(r{, and (l/A)5;(t,a)]«(A(i)]« = trlftr} A>). 
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by (he model. Because It deas the magnitude of 

the appropriate monetary-shocks variable 

chani^es in ni*. . \ • c 

Consider next the implied relationship between the variance of 

money shocks and the dispersion of relative prices. Barro s theoretical 
result was that o-f„ is positively correlated with relalive price variability. 
However, the ef fect of cff„ is ambiguous in this extended version of the 
model, since it has different and opposite effects on the three terms in 
the yf expression. I'he second term on the right-hand side of (15) is 
the remainder of the expression when all markets are alike; that is, 
when, as in Bano s case, all have the .same excess demand elasticity (crl 
= 0). This term corresponds to his relative price variance, which 
depends positively on erj, when 0 < \ < 1. This condition is the 
counterpart to Barro’s assumption that substitution effects dominate 


wealth effects. 

I'he first term accounts for the positive interaction between ihe 
diversity in elaslicities and the strength of the relative shifts. A term of 
this sort would be included also in the dispersion expression under 
f ull current information. In the present case of partial information, 
the fraction (I - ff) appears here because agents typically underesti¬ 
mate the magnitude of the relative shifts, thus diminishing their effect 
on price dispersion. Because this underestimation increases with irj,, 
tfie first term is negatively related to the money variance. I’he other, 
more interesting negative effect of o-?„ appears in the third term, 
namely, in the coefficient of (m, — If (rf„ increases—or more 

precisely when the public perceives it doing so—money disturbances 
arc less confused with real shifts, implying that a given shock induces 
smaller dispersion. I his ef fect is a relative pi ice equivalent of Lucas's 
hypothesis about the link between the variance of the nominal distur¬ 
bances and the slope of the Phillips curve. 

In the testing of equation (15), repot ted in Section IV, an attempt is 
made to capture the different effects of aj, and its net influence on 
price dispersion. However, the procedure adopted does not indicate 
that shifts in cr^ have an important effect. 


II. Construction of the Price Dispersion Series 

I his section reports the computation of a measure of price dispersion 
for the hyperinflation in Germany during the period January 1921- 
July 1923. The data .set, consisting of 68 series of monthly averages of 
wholesale commodity prices, is obtained from the German slatislical 
yearbook issues of 1921/22 and 1923 (see Statistisches Reichsamt 1921/22 
and 1923). Other series from this source, some reported only until 
December 1921 (seven commodities) and others beginning only in 
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January 1922 (21 commodities), were deleted in order not to intro¬ 
duce a bias due to changes in the sample size and composition. 

Prices are quoted from commodity exchanges of several German 
cities.'* Each series, however, originates in a single location. The 68 
commodities include 27 foodstuffs, 19 textiles and leathers, and 22 
metals, oils, and coals. They are not finished goods but materials in a 
rather raw state. Because weights for the different commodities are 
unfortunately not available, unweighted rates of price change are 
used. Hopefully, the wide range of commodities in the sample ap¬ 
proximates the general relative price instability during that period. 

The individual price rates of change are computed as the first 
difference of the logarithms of the prices. Average values and vari¬ 
ances are then calculated using 


AT, = 


} AP„, 


y? = ^X(AP„)^- (AP,)^ 

where P,, is the price of commodity i and AP,/ = log P„ - log Pi,.,. 
Table I contains the computed values of AP, and y?. Due to missing 
observations, the actual number of commodities included in the cal¬ 
culations varies slightly from month to month. The third column in 
table 1 indicates the number of commodities for which Ijoih P„ and 
are available. 


III. Estimation of the Unperceived Part of Money Growth 

Determination of the unpcrceived component of money growth dur¬ 
ing the hyperinflation retjuircs a specilication of the information set 
assumed to have been available to the public and the functional form 
lor calculating the conditional exjjectation of money growth. Con¬ 
sider the expectation conditioned on economy-wide or “global" infor¬ 
mation g,. This term was defined in Section 1 to be the prior expecta¬ 
tion and is distinguished from the posterior expectation Irccause it 
does not incorporate the additional information derived frenn local 
price observations. 

Ihis global information is assumed t<» consist of the current gov- 
etnment spending in foreign exchange units,.S’,, the current exchange 
rate, e,, and 1-moitth lagged data on the money stcxrk, price level, and 
all oth“r macroeconomic variables. Not included is government reve- 


’ (iiven tins souicc, these data do not present the problem of repot led wholesale 
price data in the United St,ttes. discussed by Stiller and Kindahl (1970). that thev do 
not always reflect discounts fiom list prices. 
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TABLE I 

Mean and Variance of Wholesale Rates or Price 
Changes, Germany, February 1921-July 1923 


Month 

AP, 

Y? 

Number of 
Commodities 

1921. 

February 

-.09 

.015 

63 

March 

-.05 

.Oil 

66 

April 

-.02 

.007 

66 

May 

- 01 

.026 

66 

June 

.05 

.032 

67 

.July 

04 

.034 

66 

August 

.19 

081 

66 

September 

.19 

.033 

68 

(Xtober 

.24 

.034 

68 

November 

.38 

.043 

68 

DeccmJ>er 

-.03 

.062 

66 

1922. 

January 

.05 

.018 

66 

February 

.12 

013 

68 

Match 

.25 

.011 

65 

April 

.12 

.018 

65 

May 

.04 

.013 

66 

June 

,09 

007 

66 

July 

.36 

.017 

67 

August 

.70 

.097 

68 

September 

.40 

.071 

66 

October 

.68 

.064 

66 

Novembei 

.78 

.038 

66 

December 

.22 

.052 

66 

1923: 

January 

.70 

.064 

66 

Februai y 

69 

.099 

66 

March 

-.17 

.041 

65 

April 

.11 

.018 

65 

May 

.50 

.039 

66 

June 

.82 

.048 

62 

July 

1.25 

.151 

63 


S(n a< r —Baitrd on morithlv prue data from Statviurk^j Jahrhueh fur dai DeuiifMe Rnrh ()9?i/22). pp 

2H2-83. and (I92S). pp 286-89 


nue from laxaiion and other sources because this variable depends on 
the current level of economic activity and is unlikely to be prean¬ 
nounced and to be widely known contemporaneously. It is natural to 
assume that the part of government expenditure consisting of the 
reparations to the Allied Powers was known in foreign exchange 
terms. With respect to the other expenditures, the implication is that 
nominal spending was observable and could be readily converted 
given the exchange rate.* 


A referee of this foumal suggested an alcemacive procedure for isolating unper- 
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The prior expectation of money growth is derived from the gov¬ 
ernment monthly budget constraint, namely, 

Af? - Af?_, = S?e? 

— (other forms of nominal government revenue). ' ' 

The superscript 0 indicates that the variables are not in logs but in 
their original form. Equation (16) indicates that creation of high- 
powered money equals the part of nominal expenditure that is not 
financed in some other way. The expression would correspond 
here to the end-of-month money stock. The other forms of govern¬ 
ment finance are taxes, net sale of bills, gold sold to the public, etc. 

If this other revenue comprised an approximately fixed proportion 
of total expenditure over time, the budget equation above could be 
expressed as 

Af® - Af®_, = AS??® random term, (17) 


where A (0 < A < 1) is the average fraction of the expenditure 
financed by money issue. 

The first attempt to generate a perceived money growth series was 
made using an equation of this type. Dividing (17) through by Af?.,, 
money growth appears linearly related to S?r?/Af?_,. The three vari¬ 
ables in this ratio are assumed currently known, and, therefore, this 
specification is consistent with the notion that the conditional expec¬ 
tation can be formed using only currently observable variables. 

However, a regression of this form.® including a constant, shows 
that A was probably not constant over the period. Specifically, the 
existence and pattern of residual serial correlation,’ plus some addi¬ 
tional considerations discussed below, suggest a nonlinear relation¬ 
ship between money issue and sf>ending during that period. 

Assuming, then, that the fraction A is not constant over time, the 
question is whether something can be said about its determinants. In 

reived money using informadon from ihe Ontral Bank's balance sheer. This informa¬ 
tion is available monthly in Staiistisches Reichsami (1925). When the ('.enlral Bank 
discounted a bill, the corresponding credit appeared in the current account of the state 
or the private borrower. C)nly when these funds were withdrawn did the notes in 
circulation increase. Using information about these credits, agents could forecast 
money growth in the near future. 

• The estimated C)L.S equation is the following (see below for the inclusion of the 
lagged spending variable): 


- Mf-, 
At?., 


-.049 + .317 '^*^ 1 + .581 
(019) (.03.3) (.089) 


(«• = .95, D-W = 1.0. <7 = .067). 


’ 7 he residuals are generally negative at the beginning and the end of the period, 
approximately the low and high values of money growth, and generally positive for the 
rest of the sample. 
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order to suggest an answer to this question, rewrite equation (17) as 


m - MU _. 

Al? ' Af?/?? 


+ random term. 


(18) 


Equation (18) preserves the positive correlation between money 
growth and the ratio of real expenditure to real cash balances, but, 
unlike (17), the fraction k is now allowed to vary over time. It is now 
argued that k, is itself correlated with Al®/r? and S®. 

To examine this correlation, assume first that S® is fixed at some 
value S®. This level of real spending can be financed by different 
mixes of inflationary finance on the one hand, and taxation, debt 
issue, etc., on the other, where the amount to be collected by money 
issue is expressed as A,A®. In the usual diagram plotting the demand 
for real balances as a function of the inflation rale, AfA® is measured in 
steady states by the area of the rectangle defined by /x.—the inflation 
rate—and Af?/c®. 

(Consider now an increment in fi. Real balances decline according to 
the money demand function, and the revenue from inflation, A(A®, 
increases as long as p, is below the rate that corresponds to a unitary 
demand elasticity for real balances. Because real spending is constant, 
k, increases, and, hence, the fraction of A® financed by other means 
declines. 

7 his shift from taxation to money issue can be viewed as the policy 
variable that brings about higher inflation rates, (flassic works on the 
Ciermaii hyperinflation, like those by Graham (1930) and Bresciani- 
I'urroni (1937), describe an opposite direction of effect. Namely, the 
rate of depict iation of the currency had a negative ef fect on the real 
yield from taxatiori due to the interval of time existing between the 
occurrence tif taxable traiLsactions and the actual payment of the 
taxes.'* The present discussion relies on the correlation between the 
fraction of expenditure financed by money issue and the inflation rate 
rather than on a specific mechanism relating these two variables. This 
positive correlation implies that A, and l/(Af{’/('®) move in the same 
direction. However, this coincidental movement does not hold for all 
p. When p reaches the rate that maximizes the revenue from 
inflation. A, also reaches its highest level, and when it rises above that 
rate. A/ must decline. In other words, the correlation between A, and 
l/(Af®/c®) turns negative in that range. 

This decline in A, implies that the revenue from other sources must 
go up. If tax collection and debt issue cannot be increased (e.g., due to 
the negative effect of inflation mentioned above), spending must be 
partially financed by extraordinary means, such as sales of gold from 


"See, e.g. Bresriani-Turroni 1937, p. 66; and Graham 1930, p. 44. 
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the Central Bank’s stock. In fact, the balance sheet of the German 
Central Bank shows that the stock of gold begins to decline 
significantly in April 1923, after being fairly stable since 1920.® 
Given this behavior of k, when S? is constant, equation (18) can be 
approximated by the semilogarithmic form: 


M? - M?-! 


Af? 


= constant + h' log 


MW 


-j + u', 


(19) 


where b' is a positive coefficient, «' is a random term of zero mean, 
and the constant term is affected by the level of .9®. In this specifica¬ 
tion, the implicit fraction/;, increases along with ll(Mflef) at lower and 
middle ranges of this variable but eventually declines when real bal¬ 
ances fall below a certain value. 

Equation (19) acquires more empirical content if real spending is 
allowed to vary. During the period under study, .S'® has a declining 
trend. However, holding constant the real balances term, which has a 
strong correlation with time, movements in .S? tan be interpreted as 
temporary deviations from a “normal” trend. These Huctuations are 
assumed ai.so to be correlated with the fraction k,. The assumption 
here is that given relatively high costs associated with temporal y shifts 
in tax collection and debt is.sue, transitory movements in sp>ending 
would be financed primarily by adjustments in money issue. A posi¬ 
tive correlaiioti between S® and h, would then result. However, a 
sufficiently high value ol'S® could be presumed to require extraordi¬ 
nary finance of the sort previously mentioned, so that k, might even¬ 
tually decline. 

Incorporating an approximation of this effect into equation (19) 
results in the following generalized expression; 


- A/«_, 
M? 




-t- f' log .9? 


which can be rewritten as 


= «’ - b'[M,^, + (M, - Af,_,) 


Af? 


e,] + c’.9, + 


( 20 ) 


where variables without a superscript are again in logarithmic terms. 

In order to proceed with the formulation of the prior expectation, 
it is convenient to replace the logarithmic growth rate (Af, - Af,.,) on 
the right-hand side by the growth rale measured by (Af® — Af?.,)/Af®. 
While this rale is always lower than Af, — the gap widening the 

hight'i the growth rates, this effect can hopefully be captured ap¬ 
proximately by the coefficients in the estimated equatiott. Then, 


"Sec Statistischc.s Rcichsami 1925, p. 53. 
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equation (20) can be solved for (A/? — Af®_i)/A/{* to yield 


M? 

The prior conditional expiectation is defined accordingly as 

«'-(T4T)-(TfF)<"--''> + (TTl)'*- 


( 22 ) 


The unperceived part of money growth mi’** is then computed by the 
difference between actual growth and g,, namely, 

ih, = (M? - Mf-,)/M? - g,. 


The coefficients in equation (22) are those which result from re¬ 
gressing (Af“ — onS, and (M,_, — r,)- However, the exchange 

rate is not in general an exogenous variable in a money growth 
equation. A correlation between e, and the error term u, will exist via 
some unspecified condition for equilibrium in the foreign exchange 
market. Therefore, the coefficients in (22) do not correspond exactly 
to those in equation (21). This property is not a drawback. On the 
contrary, the bias in the estimated coefficients (relative to those in 
[21 j) reflects the part of u, that can be estimated from It therefore 
.should be taken into account in calculating g,.'* 

furn now to the estimation of equation (21). There is a problem in 
matching the available data on money with those on prices for the 
German hyperinflation. Unlike the price series, which consist of 
monthly averages, the available data on the money stock until January 
1923 arc end-of-month figures.'* From January 1923 onward, four 
quotations per month are available. 1 hus, a proxy is constructed for 
the monthly average money stock. For the period January-July 1923 
it contains averages of the beginning-of-month, end-of-month, and 
the three intermediate quotations available. Until December 1922, the 
monthly averages are approximated by linear interpolation of the 
end-of-month figures. The constructed series are shown in table 2. 

A further consideration arises. I'he estimation of equation (21) 
from monthly average data on the money stock, rather than end-of- 


The unf>erceived growth mi does not correspond exactly to the error term u,. On 
this point, see below. 

" E.g,, in the general linear modely = X/3 + u, where the variables in X are correlated 
with u, the estimated vector of coefficients is ^ = {X'X)~‘X'y = (X'Xy'X'{Xp + u),ox 
= g + (X'X)~‘X'u, where {X'Xy'X'u is the regression coefficient of u on X. The 
prediction of y, given the values of the vector*, is, accordingly,y, = *,'(/3 + (X'X) 'X'uJ; 
i.c., it is composed of the systemadc part*//3, plus the conditional expectation of«, given 
*'• 

" Another problem stems from the form of the prior expectation, which, as it stands, 
requires die use of end-of-month money stocks. See the discussion below. 
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TABLE 2 

Values (in Logarithms) of Real Expenditure, 
Exchange Rate, and Money Stock 



s, 

ft 

M, 

Oct. 1920 

5.85 

2.79 

11.24 

Nov. 

6.44 

2.91 

11,25 

Dec, 

6.58 

2.86 

11.28 

Ian. 1921 

6.42 

2.74 

11.29 

Feb. 

6.80 

2.68 

11.28 

Mar. 

6.66 

2.70 

11 29 

Apr. 

6.65 

2.71 

11.30 

May 

6.53 

2.69 

11.30 

June 

6.76 

2.80 

11.33 

.lu'y 

6.43 

2 91 

11 36 

Aug. 

6.76 

3.00 

11.38 

Sept. 

6.23 

3.22 

11.42 

Oct. 

5.95 

3.58 

11.48 

Nov, 

5.54 

4.14 

11.55 

Dec. 

6.44 

3.82 

11.66 

Jan. 1922 

596 

3,82 

n 72 

Feb. 

5.84 

3 90 

11.75 

Mar, 

5,85 

4.22 

n 81 

Apr. 

5.77 

4.24 

11.88 

May 

5,93 

4.24 

11.96 

June 

5.75 

4.33 

12 05 

Jviy 

5.34 

4.77 

12.16 

Aug. 

5.32 

5.60 

12 33 

Sept. 

6.08 

5.86 

12.58 

(Xt 

5.60 

6 63 

12.92 

Nov. 

5.30 

7.44 

13.35 

Dec, 

6.08 

7.50 

13.85 

Jan. 1923 

5.32 

8,36 

14 26 

Feb. 

5.63 

8.80 

14 82 

Mar 

6..55 

8.53 

15.31 

Apr, 

6.15 

8 67 

15.60 

May 

5.65 

9.34 

15.83 

June 

6.21 

10,17 

16,29 

.iuly 

6.16 

11,34 

17.14 


Nott —sS, => log of the monihiv Kovrrnm«n( expcndtfuiT in millions ol gold marks Sourer Brrsciani-Tiitroni 
(1937), pp. 436-37. f, « log of ihr numihiy avrrisgr ntchtf^ me of ihc gold marii tn psper rrurks 
Source Bresciani- furmni (1937), p. 441 M, *■ log of the momhiy average money nock »n nuUions ol paper marks 
Source based on hxed days quotations from Sonderheftr zur Wtrticha/l umi Utahstu (1923). pp 43-47 


month figures, means that both the current month's spending and 
that of the previous month should be considered. Spending financed 
by money issue during the previous month increases one to one the 
current monthly average stock but has, in general, a weaker effect on 
the prior month’s average stock,'® To account for this effect, lagged 

This etfeti can be seen by considering first a case where each month's spending is 
spiead evenly over the month, and, say, it is financed only by the issue of new money. In 
this case the money stock grows linearly at, in general, different rates within each 
monthly period. Then, if denotes the money stock at the end of month I - 2, 
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spending is incorporated into the framework of the semilogarithmic 
function in equation (21). First, define the variables 5/ and e't by 5,' = 
log + (1 - and el ^ log (fS? + (1 - After 

substituting S,' and r,' for S, and e, in (21). f is estimated, simulta¬ 
neously with the other coefficients in the equation, using a nonlinear 
maximum likelihood procedure under normally distributed errors. 
The estimated nonlinear equation is 

/MO _ \/f0 

‘ Afo = -660 - .166 (M,-, - el) + .1 l.'W/ (23) 

(.071) (.007) (.014) 

(I = .75, = .98, D-W = 1.63.0- = .026, 33 observations), 

(. 11 ) 

where the numbers in parentheses are the standard errors of the 
coefficients. With respect to the number of observations, the starting 
month was taken as November 1920 in order to test lagged effects of 
monetary shocks on price dispersions. According to the argument of 
note 13 above, the value .75 for ^ suggests a pattern of spending that 
is biased toward the beginning of the month. In order to proceed with 
the analysis on the more familiar ground of linear e(}uations estima¬ 
tion, ^ is assumed henceforth to equal .75. Given this value, the 
standard errors of the other coefficients, linearly estimated, are not 
materially different from those obtained above. In order of their 
appearance in (23), they arc .068, .005, and .014. Fhe standard error 
of the regression is now 0.0256. The other regression statistics and the 
coefficients remain obviously the same. 

rhe pattern of the residuals from equation (23), which are reported 
in table 4, suggests that their variance increased during the sample 
period as inflation progressed. A method similar to that proposed by 
Glejser (1969) is adopted to correct for the apparent hcteroscedasiic- 
ity by assuming a specific model for the variance of the error term. In 
this procedure the variance is postulated to l>c determined by a set of 
variables {z,} in the linear form: 

a^, -1 W(Z|,. (24) 

the monthly average for / - 1 equals .S” ir" i, and that corresponding to 

month t equals Mf 2.rni + i. r? i + (4 !*?'?■ Thus, the increase in the monthly average 
from! - 1 to t equals'/4 + Vt Sfef. Namely, spending evenly spi ead over each 

month would imply equal weights lot current and lagged spending in the money 
growth equation. Alternatively, if spending is concentrated at the beginning of the 
month, the relative weight ol lagged expenditure would be lower. At the extreme, e.g., 
if all .spending is made only on the first day of each month, both t - 1 and ( monthly 
averages increase equally by the amount of the I — 1 expenditure. In this case lagged 
spending does not belong in the money growth equauon. 

“ This procedure is from the T'SP Regre5.sion Package. 



DISPERSION OF RELATIVE PRICES 


343 


If the values of the true money shocks to?* were available, one could 
use them as follows. Since the expectation of mf^ is cT%a> h follows that 

= <rL + V/. 

where Vf is of zero mean. Combining the last two equations yields 

mf* = SwjZ,, + V ,. (25) 

Estimates of the coefficients in equation (24) could be obtained by 
regressing m,** on the Z| variables. The heteroscedasticity problem is 
also present here, but it will be ignored in what follows. 

The values of mf are, however, unknown; only the estimated re¬ 
siduals m, are available from the OLS money growth equation. Since 
m? converges to mf^ asymptotically, the variance estimated using to? 
values obtained from small samples will be biased. This bias is ne¬ 
glected, as it is probably of relatively small importance. 

In order to proceed with the implementation of this procedure the 
set of the 2 ( variables must be specified. The presumption is that the 
same variables used to explain the growth rales are also correlated 
with the variances. I hus, S/, el, and M,_, are candidates. The lagged 
squared residual to?-, is also included as an explanatory variable. It 
presumably captures the eff ect of serially correlated omitted variables 
and perhaps a direct correlation between iTm-, and the current vari¬ 
ance. 1 he estimated variance equation is 

mf = - .007 - ,00038.S; - .00026c; + .00090.M,., - .281 to?., 

(.003) (.00047) (.00031) (.00047) (.130) (26) 

— .57, D-W = 2.5, cr = .001,33 observations).'^ 

Nothing iti this procedure for estimating the scries of money vari¬ 
ances guarantees that all the fitted values from equation (26) would be 
positive. Indeed, two of the filled values have a negative sign. In order 
to use the estimated series as a measure of variances, these two 
negative values are replaced with the smallest positive value in the 
series. The series of the square rtxits of these estimates are reported in 
column 4 of table 4. 

Usitig this series as weights for the corresponding observations, 
equation (23) is reestitnaied with the following results: 

(Af? - M!>_,)/Af® = .700 - .154(Af,_, - c,') + .0945,’ 

(.047) (.006) (.012) (27) 

(«* = .95. D-W = 1.8.0- = 1.01,33 observations). 

In order to estimate the SS variances needed to reestiinate eq. (34). the m, series 
were obtained from runniitg the money growth equation after .iddmg the additional 
observation of (ictober 1020. 
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Observe that the coefficient of 5/ here is somewhat lower than in the 
OLS equation and that the coefficient of (M/_i — el) is somewhat 
higher. The general form of the equation is, however, robust to this 
transformation of the data. 

The next step is to test the stability of the coefficients in equation 
(27) across two subperiods. Stability of the coefficients has particular 
relevance here. If the equation is approximately stable, it seems easier 
to assume that it was known from the beginning of the period and 
that perceptions about money growth were formed using the same 
equation during the entire sample.'® The jseriod is divided into an 
approximately nonaccelerating money supply period until May 1922 
and an accelerating phase beginning in June 1922. An f-test applied 
to these two subperiods yields the .statistic Fir ~ l'^, with a corre- 
sfxmding .5 percent critical value of 3.0. Therefore, the hypothesis of 
stable coefficients across these two subperiods cannot be rejected at 
the 5 percent significance level. 

A regression in which the coefficients of Af,_i and el are uncon¬ 
strained produces coefficients of similar magnitude for the two vari¬ 
ables. The F-test for the linear constraint of equal coefficients yielded 
the statistic Fia = 2.3, where the 5 percent critical value is 4.2. 


IV. Empirical Test of the Dispersion Equation 

I he tests of the price dispersion model in equation (15) are per¬ 
formed using the dispersion series computed in Section 11 and the 
unperceived monetary shocks as measured by the residuals in equa¬ 
tion (27). 

For convenience equation (15) is rewritten here 

yf = {2(1 - e)^cTl + 2[e -t- Ml - e)]*}(T? + (1 - fl)*oi(m, - 

(15) 

Significantly, this equation has a simple linear form—and can 
therefore be tested by an ordinary least-squares procedure—only 
under constant money and relative shocks variances. However, the 
analysis of money growth in the previous section suggested that 
increased during the hyperinflation. If this is indeed the case, it would 
not be appropriate to test the model with a specification that relates 
price dispersion to money shocks with a constant coefficient. 

Two different procedures are adopted here to deal with the possi¬ 
bility of a changing money variance. The first uses a linear approxi¬ 
mation in which the variance of money—as measured by the &%, series 


Estimation of a money growth equation in a similar context, using the entire 
sample (for U.S., 1941-73), was discussed and pcrlormed in Barro (1977). 
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from Section III—is kept constant by including it additively in the 
equation. As discussed in Section I, aj, has different and opposite 
effects on y*, and, therefore, on an a priori basis the coefficient of 
could take either sign. The other attempt is to estimate (15) as a 
nonlinear equation. The results of this procedure, reported later in 
the section, are quite poor. 

The estimated equation in which is added linearly is: 

yf = .033 + I7.4(m, - - 15.8&1, (28) 

(.010) (3.2) (8.6) 

(ft* = .59, D-W = 1.2, O' = .022, 30 observations). 

The monetary shocks appear to have considerable explanatory 
power for price dispersion. The coefficient of cfj,, is negative and, 
therefore, suggests a dominant Lucas-type effect of the money vari¬ 
ance on price dispersion. That is, the degree of dispersion associated 
with given shocks diminishes the higher their variance. The ex¬ 
planatory power of (Tmi< however, is fairly low; its coefficient is 
significantly different from zero at the 5 percent level but fails to be so 
at the 2.5 percent level. 

Theoretically, the 1-month lagged money variance crS,,-, belongs 
also in the equation. However, when included, its coefficient is in¬ 
significant with a ^-statistic of .8. 

The Durbin-Watson statbtic indicates autocorrelated residuals, 
which may be caused by omitted real variables (like changes in the 
pattern of government spending,'* in income distribution, etc.) that 
are serially correlated or by the fact that the m, variable includes an 
estimation error. In order to check whether the degree of significance 
of the estimated coefficients in (28) is affected by this autocorrelation, 
the equation is reeslimated using the C<Khrane-Orcutt technique. 

The lesulLs are quite similar to those obtained before: 

yf = .038 + 16.8(m, - m,_,)* - 18..5d-*, (29) 

(.008) (2.9) (9.8) 

(ft* = .67, D-W = 2.2, cr = .020, 29 observations, p = .36). 

(.17) 

Including in thi.s regression yielded a /-statistic of only 1.0 for 
its coefficient. The possibility of lagged effects of monetary shocks on 
price dispersion was explored by including the variable (w,-, - t«,-i)* 
in the equation, but the estimated coefficient was found statistically 
insignificant. The /-ratio was 0.3 in the 01.S regression and 1.4 using 
the Cochrane-Orcutt technique. 

” 1 he magnitude of changes in the amount of real government spending, however, 
docs not have any significant explanatory power. 
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The next step is to see whether the dispersion equation is stable 
over the entire period. In fact, the inclusion of the dj,, variable is an 
attempt to control one source of instability in the coefficients. In order 
to carry out this test the sample is divided first after May 1922, 
estimating the equation separately for the two subperiods. This parti¬ 
tion of the sample is the same one adopted previously to test the 
stability of the money growth equation. Then, the exercise is re¬ 
peated, dividing the .sample at the end of 1922. The aim of this 
partition is to see how the model performs after removing from the 
sample the 7 months of 1923, which had a much more unstable 
monetary growth. 

I'he results of these regressions, reported in table 3, can be sum- 
mari/ed as follows. When the sample is divided in May 1922 (first 
column), the results for the first subperiod are fairly weak. Both 
coefficients are insignificant in the OLS equation. Using the 
Cochrane-Orcutt technique the coefficient of (m, — turns out 

significant at the 5 percent level, although that corresponding to tj-J,, is 
still insignificant. During the second subperiod—from June 1922 to 
July 1923—the statistical performance of the equation is much 
stronger. 

1 he second column reports the equations estimated for the periods 
through and after December 1922. Observe that the removal of the 
192.3 portion of the sample worsens the performance of the model as 
judged by the size of the /-ratios. However, the coefficient of the 
monetary shocks still remains quite significant. 

Formal /'-tests fail to reject the hypothesis of stable coefficients 
across the mentioned subperiods. When the sample is divided in May 
1922, the resulting statistic i.s = 1.8, while the 5 percent critical 
value is 3.0. Partitioning the sample at the end (jf 1922 yields the 
statistic of only 0,8, 

The other approach adopted to lest the dispersion equation was to 
treat it as a nonlinear relationship. When (tI, is changing over time, 
equation (15) generalizes to 



+ cjf ) ^ ( + (T* ) 

ka-l,, + Xo-h ^ / Xo-^,-1 + Xo-g y 
AcrL + 0 -?/ I Ao-S„_, + (7? I 


(30) 


+ 0-? 




+ (ri 


m, 


Ao-S,,-, + &i 




As discussed in Section I, the third term in this equation reflects the 
negative effect of erj, on the impact of monetary shocks. The second 
term corresponds to Barro’s (1976) relative price variance expression 
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• , I when 0 < A < /. RecaJJ that A j 

that is affected 7 . ^<t] across markets, anda’(z) + a‘ 

defined as the average of I/[a (z) + a J acr 

is the excess ciemantJ eJasticity of commodtty z. Using again the <r„ 
series, the parameters of equation (30)—A, trf, and trf are estimated 
using a nonlinear least-squares procedure with fairly weak results. 
I'he additional structure given to the equation seems to be rejected by 
the data, as judged by a higher sum of squared errors than in the 


linear equation. 

Observe that a value of zero for X reduces the equation to the linear 
form of equation (15), in which the money variance is constant. The 
estimated value for this parameter was 0.056 with a standard error of 
0.065.'" The interpretation of this result is not that the average 1/ 
la^z) + od] is likely to be close to zero. Instead, it suggests that the 
detailed speciheation of equation (30) is too stringent. For example, if 
the var iance of the relative excess demand shifts, of also changes over 
time: this is more of a problem in this approach, since a? itself is being 
estimated as a constant. 


V. Actual Money Growth and Inflation in the 
Dispersion Equation 

In this section additional variables that were mentioned in the litera¬ 
ture as being ielated to price dispersion are tried in the equation. 
I here is no rigorous theoretical justification for their inclusion. Thus, 
only loose verbal explanations are given. Also, the variables related to 
inflation are clearly not exogenous, and, therefore, any observed 
correlation cannot imply cau.sality. 

.‘irtuni money grotvth and price dupersion .—A variable that can be 
considered exogenous and, if one assumption of Section 1 is violated, 
in principle also relevant for price dispersion is the actual money 
growth, ('-hanges in the money stock can affect the dispersion of 
prices (even when perceived) if the new money is spread unevenly 
across the economy, thereby affecting relative demand in different 
sectors. This type of effect was discussed by Cairnes (1873) with 
respect to gold discoveries. In the framework of the present mcxlel 
this type of effect could be represented by changes in the relative 
excess demand variance erf. Since here the focus is on price change 
dispersion, the corresponding variable in this context is the change in 
the growth rate or the degree of acceleration/deceleration in the 
money stock. 

In order to test this sort of effect, and also to see whether (m, - 

'* was 17.7 with a .standard error of 8.4 and <7? was 0.0008, with a standard error of 
0.0004. 
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is only a proxy for changes in actual money growth, price 
dispersion was regressed on ((A/? - Af?.,)/Af? - 
with the following results: 

y, . ,033 + 2.1 I 
(.006) (.6)' 

(f{* = .33, D-W = 1.5, O' = .028, 30 observations). 

The monetary acceleration/deceleraiion variable has a statistically 
significant correlation with price dispersion. Remarkably, however, its 
explanatory power vanishes when (m, - m,_,)* and 0 ^ are also in¬ 
cluded in the regression. The equation including the three variables is 

y* = .034 + 16.6(m, — m,.,)* — I6.5&^i + 

(4.0) (9.0) 

.25 lMLJdl± - 
(.77)1 Ml MU j 

— .59, D-W = 1.3, O’ = .022, 30 observations). 

This result denies the existence of any effect of relative demands 
following the introduction of new money during this period. It sup¬ 
ports the hypothesis that money affects relative prices only if it is 
currently unperceived. 

Inflatwn and price dispersion. — In his analysis of price behavior dur¬ 
ing the hyperinfiation, Graham describes a positive correlation be¬ 
tween the acceleration of price level and the dispersion of prices. 
Mills’s findings in his study of U.S. prices suggest that dispersion is 
positively correlated with both acceleration and deceleration of the 
price level. 

If the acceleration/deceleration in the price level is related to an 
unperceived monetary expansion or contraction, the theory tested 
here predicts that the correlation mentioned alvove should be cap¬ 
tured by a variable measuring unperceived money growth. I'o test 
whether this is the case here, the variable (ft( — is also included in 
the equation—where fi, is the inflation rale from month t — 1 to 
month t computed from the wholesale price index (see table 4). The 
results suggest that there is a separate correlation between [fi, - 
atid price dispersion. The equation estimated by Ol.S is 

y? = .032 18.6(ni, - m,_,)* - + .074(^, - Mf-i)* 

(.005) (3.0) (8.7) (.031) 

(R* = .67, D-W = .9, o- = .020, 30 observations). 

Given the low D-W statistic, the equation was reestimated by 
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Cochrane-Orcutt: 
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yf = .033 + 19.5(m, - m,-,)* - 25.36-^, + .077(/j., - 
(.008) (2.6) (9.0) (.022) 

{R^ = .77, D-W = 2.00, <r = .017, 29 observations, p = .49). 

(.16) 

Not only does (fi, — itt-if have a statistically significant correlation 
■with price dispersion, but its inclusion in the equation also sharpens 
the performance of the original two variables. Possible explanations 
of this correlatioti could be related, for example, to income redis¬ 
tribution following from unanticipated inflation or substitution be¬ 
tween money and certain commodities as stores of value, when their 
relative costs change. 

Sheshinski and Weiss (1977) consider a model of a monopolistic 
hrm in which costs involved in changing the price of the commodity 
produced generate discrete periodic price adjustments whose mag¬ 
nitude increases with the inflation rate. They suggest that, if the 
timing of these adjustments is independent across firms, higher 
inflation rates imply larger dispersion of individual price changes. 
Hence, this type of argument seems to imply a relationship between 
price change dispersion and /x*.'® 

A role for may be rationalized also on different grounds. 
Chaham mentions the effect of the interaction between government 
regulations and inflation on the dispersion of prices. He refers to 
price controls, such as rent restriction legislation, which generate 
dispeision among the differently affected prices as the general price 
level increases. According to this argument, the dispersion of prices 
will be positively correlated with the price level. Hence, the dispersion 
of pi ice changes will increase with the magnitude of changes in the 
price level—namely with p.?. 

Adding to the equation yields the following results: 


However, ihc empirical implications of Sheshinski and Weiss’s analysis for price 
dispersion do not seem clear to me. An ambiguity anses because of the probably 
po.silive etfccl of inflation on the frequency of price changes that they derived. If the 
length of the observation penod is kept constant, a higher frequency of price change 
may dimmish the measured dispersion of price changes. The possibility of a negative 
effect of inHauon on price dcspersion in this framework can be seen in the following 
example. Assume that the optimal frequency of price adjustments for all hrms is 2 
months and that pat t of the hrms adjust their prices during odd-numbered months and 
the rest during even-numbered months. Using monthly data, disjrersion of price 
change.s will depend on the magnitude of pnee changes corresponding to the group of 
hrms currently adjusting prices. Now as.sume that inflation increases: as a consequence 
the magnitude of price adjustments goes up, and also the optimal frequency is 
increased—say to one per month. Since now all the hrms adjust prices dunng the same 
month, the dispersion of prit e changes collapses to zero, in spite of the larger individual 
price changes. 
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■y? = .031 + 12.2K - - 21.3«r* + .0620*, - M/-.)’ + 036^? 

(.005) (4.1) (8.2) (.030) (.017) 

(/?* = .67, D'W = 1.3, cr = .019, 30 observations). 

The coefficients of /x? and (/*, — ^re both significant at the 2.5 

fjercent level. However, when the equation is reestimated by 
Cochrane-Orcutt, the effect of >** becomes insignificant (I-ratio of 
0.8), while the performance of the rest of the variables improves. If 
the monetary acceleration/deceleration variable is also included, its 
coefficient turns out negative but insignificant at the 5 percent level 
(t-ratio of 1.62 with OLS and 1.55 with Cochrane-Orcutt), without 
materially affecting the other coefficients and standard errors. 


VI, Summary and Conclusions 

This paper tested a model of price dispersion using data on the 
German hyperinflation after World War I. The theoretical frame¬ 
work, outlined in Section 1, predicted that money growth causes 
price dispersion only if it is currently unperceived. 

In order to perform this test, the first task was to construct a 
measure of price dispersion based on a broad enough range of com¬ 
modities to approximate the varying degree of relative price vari¬ 
ability during that period. This calculation was accomplished using 
wholesale price data on a sample of over 60 commodities, including 
foodstuffs, textiles, metals, and fuels. 

The next step was the delicate one of measuring the perceived part 
of money growth. This estimation implied the postulation of an in¬ 
formation set available at each point in time and also a function 
relating the variables in this set to the creation of money. The func¬ 
tion used to isolate the perceived part of money growth was specified 
based on considerations related to the government demand for reve¬ 
nue to finance expenditure. 

A measure of the variance of the unperceived money growth was 
also estimated. This series was used for dealing with the hetero- 
sccdasticity problem in the money growth equation and also as a 
theoretically relevant factor affecting price dispersion. 

The testing of the dispersion equation showed a statistically 
significant correlation between unperceived money—as measured 
here—and price dispersion. Thus, the model is supported to some 
extent by the empirical results. The variance of the money shcKks 
turned out to have a negative coefficient which was only marginally 
significant. Although the money variance has different and apposite 
theoretical effects on price dispersion, this negative correlation is 
consistent with a dominant effect of diminishing the misperceptions of 
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monetary disturbances as relative shifts—and thus reducing the 
dispersion associated with those disturbances. 

In Section V actual money growth was also considered as a price 
dispersion factor, justified as affecting relative demands. However, 
the results indicated no additional effect for the rate of money cre¬ 
ation. The magnitude of changes in the inflation rate does appear to 
have a separate correlation with price change dispersion. The present 
model, however, does not provide an explanation for this association. 
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Exchange-Rate Dynamics: An Empirical 
Investigation 


Robert A. Driskill 

Vmventty of California, Davis 


This paper esiimatcs a redueed-lorm exchange-rate equation whose 
estimates are used to address questions on ext hauge-rate over- 
shttoiing, intermediate-run exchange-rate dynamics, and long-run 
proportionality relationships between relative money supplies and 
exchange rates. Based on Swiss-U.S. data from the period 1973-79. 
the major hndings are that following a monetary shock there is 
short-run exchange-rate overshcKtting by a factor of about two. and 
that subsequent exchange-rate adjustments to a new long-run equi¬ 
librium take longer than 2 years and exhibit nonmonotonic patterns. 


I. Introduction 

Much of the recent literature on floating exchange rates has viewed 
relative money supplies as the bask determinant of exchange rales 
and relative price levels. While this literature is in agreement about 
the long-run relationships between money, exchange rates, and price 
levels, it differs in its predictions about short- and intermediate-run 
exchange-rate determination.' In particular, one strand of this liter¬ 
ature, mainly associated with the work of Dornbusch (1976), has 
emphasized the role of slowly adjusting commodity price levels for 
short- and intermediate-run exchange-rate dynamics. The most 
striking implication of the Dornbusch model is that the exchange rate 
may “overshoot" in the short run: In response to a change in relative 


Thi.s study stems I'roin iny doctoral dissertation, liratehil acknowledgments are due 
It) niy primary advisor, Jiirg Niehans, and also to Bela Balassa, Jeffrey Frankcl, Dale 
Hendeison, George Kanatas, Ed Kane, Jerry Thtirsby. and an anonymous referee. 

' For a discussion of these different approaches, see Bilson (1978), 
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money supplies, the exchange rate may immediately change by more 
than its long-run equilibrium value. Another implication of the model 
is that, following an initial overshoot, the exchange rate then mono- 
tonically approaches its long-run equilibrium value. Both of these 
implications result from the key assumptions of slowly adjusting 
commodity prices and perfect capital substitutability and mobility. 

Within the theoretical framework of slowly adjusting commodity 
prices, an alternative to Dornbusch’s view of exchange-rate dynamics 
has developed, an alternative which emphasizes stock/flow interac¬ 
tions and relative-price trade-balance effects. This literature, devel¬ 
oped by, among others, Bran.son (1976), Niehans (1977), and Hen¬ 
derson (1980), alters the strong Dornbusch conclusions about short- 
run overshooting and monotonic exchange-rate and price-level 
adjustments to long-run equilibrium, instead permitting short-run 
undershooting and nonmonotonic exchange-rate and price-level 
adjustments. 

The purpose of this paper is to examine empirically the predictions 
of these exchange-rate dynamic models against Swiss/U.S. data over 
the period 1973-77. Specifically, the paper addresses the questions of 
whether there is short-run exchange-rate overshooting, whether the 
intermediate-run exchange-rate and price-level adjustments are 
nonmonotonic, and whether the long-run proportionality relation¬ 
ships between money supplies, price levels, and exchange rates hold. 

This is not the first attempt to test a model of exchange-rate deter¬ 
mination based on the Dornbusch exchange-rate dynamics approach. 
Frankel (1979) extends the Dornbusch model by incorporating secu¬ 
lar inflation rates and tests it against the deutschmark/dollar rate, and 
Bilson (1978) provides suggestive evidence about the appropriate¬ 
ness of the Dornbusch approach for the deutschmark/pK)und rate. 
Neither test, however, uses an exchange-rate equation rich enough 
to incorporate trade-balance responses to relative price changes and 
thus exclude the possibility of more complex, perhaps oscillatory 
exchange-rate adjustment paths and exchange-rate undershooting. 

The major empirical findings from this study are as follows; (1) The 
exchange rate overshoots in the quarter in which a monetary change 
takes place by a factor of about two. (2) The exchange-rate adjustment 
path to full equilibrium is not monotonic, as predicted by (he original 
Dornbusch fixed-output model, but rather exhibits periods of ap¬ 
preciation and depreciation. The price level, however, adjusts mono- 
tonically. (3) Purchasing power parity holds in the long run. As noted 
above, there is wide theoretical agreement that this should be the case, 
but it has often been difficult to verify empirically. Furthermore, the 
“long run” is calculated to be approximately 2-3 years. 
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IL The Reduced'Form Equations 

The goal of this section is to motivate the reduced-form exchange- 
rate and price-level equations which are estimated in the following 
section. Reduced-form equations are, of course, consistent with a 
variety of structural models. What is done here is to develop two 
structural models, the Dornbusch model and a stock/flow model de¬ 
veloped by generalizing the Dornbusch model to permit imperfect 
capital mobility. Both of these structural models impose a priori 
constraints on the reduced-form parameters and thus can in principle 
be rejected by the data. Of course, neither structural model may be 
sufficiently complex, and a “true" structural model may impose few 
theoretical predictions about a priori reduced-form constraints. This 
possibility emphasizes the fact that the reduced-form empirical esti¬ 
mates are more general than either structural model. 


A. The Dornbusch Model 

This subsection recasts the Dornbusch model in discrete time. The 
model consists of three basic building blocks: a money-market equilib¬ 
rium condition, a price-level adjustment equation, and an uncov¬ 
ered-interest-arbitrage specification. Assuming that both the domes¬ 
tic and foreign country have identical structural parameters, we can 
specify money demand as: 

mf - p, + (py, - \r, + v„ (1) 

where m is the log of the ratio of the domestic to foreign money 
supply,/) is the log of the ratio of the domestic to foreign price let el, y 
is the log of the ratio of domestic to foreign real income, r is the 
difference between domestic and foreign interest rates, \ is the inter¬ 
est rate semielasticity of the demand for money in each country, (p is 
the income elasticity of the demand for money in each country, andi', 
is a serially uncorrelated random variable with zero mean and vari¬ 
ance o-f.. 

Each country’s money supply is assumed to be exogenously con¬ 
trolled by the respective government. Furthermore, all changes in the 
relative money supply are assumed to be unanticipated; this assump¬ 
tion is important because it implies that this period’s relative money 
supply is also the expected supply for all future periods. Assuming 
equilibrium obtains each period, we can write the equilibrium condi¬ 
tion as: 


m, = (m,)" =p, + <Py, - Kr, + v,. 


( 2 ) 
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In the goods market, relative demand for output depends on rela¬ 
tive real income, the relative interest rate, and relative prices and has 
the following functional form:* 

log D, = yy,-(Tr, + - pi), (3) 

where e is the log of the exchange rate and I) is demand for domestic 
output. 

The rate of relative inflation,, — p„ is proportional to the log of 
the ratio of relative demand to relative supply in the goods market: 

/),+ ,-/», = S(logO, - y,). (4) 

Combining equations (2), (3), and (4), we can write the relative price 
equation as; 


pi = ^nyii + + flae,-,, (5) 

where: 8( 1 “ v) + = 1 — S<t/X — Sa),a 2 = 8al\, anda^ = 

5a>. 

The final building block of the Dornbusch model is a joint assump¬ 
tion of uncovered interest arbitrage and exchange-rate expectations. 
I'he uncovered interest arbitrage assumption can be stated as 

r, - X, = 0, (6) 

where .v, is the expected change in e from f to t + 1. The assumption 
about .X, is that it is a fraction of the gap between the current and 
long-run equilibrium exchange rate. Given the assumption that the 
lelative money supply follows a random walk, this implies the follow¬ 
ing relatiimship; 

X, = 9{m, — e,) + k, 0 < 9 < \, k a constant. (7) 

Combining equations (2), (5), (6), and (7), we tan derive the following 
reduced-form equation: 

e, = TTo -t- TT,?,-, -t- TTjm, + TTam,., -t- iTipt-, + n^y, 

+ TT^y,-, + iTjZ,, ( 8 ) 

where 2 , is a first-order serially correlated random variable and where 
the TTf's satisfy the following constraints;’ 

* Output demand could be specified as a function of the rrtU interest rate. In Ixjth the 
Dornbusch and the slock/flow models, this change makes no difference in the <tualita- 
tive implications derived for the reduced-form equations. 

^ Details are found in Appendix A. 
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I 


TT, = 1 

Va < 0 

TTl < 0 

TTa < 0 

7r .j > 1 

TTs < 0 


TTb < 0. 


The constraint on the sum of the first four 7r,’s simply says that 
purchasing power parity holds in the long run. The constraint that 
iTj > 1 says that there must be short-run overshooting. 


B. The StocklFlow Model 

As noted in the Introduction, the stock/fiow model is developed by 
generalizing the Dornbu.sch model to permit imperfect capital mobil¬ 
ity. To do this, a net demand for foreign assets is specified as a linear 
function of the expected net yield; 


~ Vi.Xi~ n), •») > 0. (9) 

In addition, a trade-balance equation is specified as a linear function 
of the log of relative prices and the log of relative real incomes; 

7', = a(e, - pi) - py, + u„ a,^ ^ 0, (10) 

where w, is a zero-mean, finite-variance, serially uncorrelated random 
variable.'* 

The foreign-exchange market-clearing equation states that net 
capital Hows equal net trade Hows plus all other autonomous Hows 
(assumed constant); 


^B, = r, + A,. (11) 

where A, is a constant. Replacing equation (6) in the Dornbusch model 
by equation (11), we can derive the following reduced-form ex¬ 
change-rate equation; 

= ^0 + + trjwi, + Trim,., 

+ Vtp,-, -t n'^y, + TTsy,^, + ir-z',. (12) 

The tt/’s satisfy the following constraints;' 

* t he leim T, could be .spccitied as a lunclion of ciirrem and lagged relative prices 

where the current-price effect is perverse, t.e.. creates a effect, this speahea- 

lion i.s indistinguishable from (10) in terms of re.strirtions on reduced-form param¬ 
eters. 

* Detatls are tn Appendix A. 
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£ tt; = 1 

I-1 

v'l < I 

ir;>0 Tri S 0 Tri^O. 

While the purchasing power parity constraint holds just as with the 
Dornbusch structural model, the other constraints are quite different. 
In particular, both the lagged exchange-rate and lagged price-level 
coefficients may be positive in this model. The estimated sign of these 
coefficients should provide a sharp test between these two models. 
Also note that ttj need not be greater than one; this model makes the 
question of overshooting or undershooting an empirical question. 


iri>0 


III. Estimation Results 

The reduced-form equations (5), (8), and (12) were estimated for 
Swiss/U.S. data with two changes. First, after some experimentation 
with various proxies for Swiss income (data on which are available 
only on a yearly basis), the income variables)!, and)i,_, were dropped. 
None of the proxies used in the experimentation provided significant 
coef ficients, and the remaining coefficients were hardly affected when 
income was dropped. Second, two dummy variables were added to 
the exchange-rate equation: OIL, which takes the value of I for 
December-January-Fcbruary 1973-74 and zero for all other periods, 
reflecting the improved attractiveness of dollar-denominated assets 
following the announcement of the oil embargo and cuts in Arab oil 
production; and SEAS, which takes the value of 1 every December- 
January-February and zero all other periods, capturing the pro¬ 
nounced year-end demand for Swiss francs by Swiss firms for end- 
of-year "window dressing” of their financial statements.® 

The choice of the Swiss franc/U.S. dollar rate for the period 
1973-77 is based on the relative “cleanliness” of the franc float, the 
substantial variation in the franc-dollar rate, and the independence 
over this period of the franc and dollar from the “snake.”’ 

The estimation uses quarterly average data, where the quarterly 
data were generated by averaging three monthly end-of-period 
figures; that is, March-April-May is the average of March, April, and 
May. This provides a sample size of 19 observations, beginning March 

* Schjltknecht (1976) discijs.scs this jtoim. 

’ Schiltknccht (1976) indicates that the Swiss have no exchange-rate targel.s for the 
■Swiss money supply at least for i>eriods longer than a month or two. 
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1973 and ending November 1977* Even though monthly data would 
have had the advantage of more observations, the quarterly speci¬ 
fication was used since it is more consistent with the theoretical 
models developed, that is, money-market equilibrium and trade-bal¬ 
ance adjustment. While it would be possible to specify partial 
asset-adjustment schemes and longer trade-balance lags and use 
the monthly data, this would introduce substantial multicollinearity 
problems, precluding precise parameter estimates. 

Sources and methods of deriving the data are explained in Appen¬ 
dix B. The price index used was the consumer price index, and the 
monetary aggregate used was M3. Not surprisingly, the results were 
sensitive to the choice of monetary aggregate: Over the period in 
question, U.S. Ml demand has not been very stable in comparison 
with M3 demand.® In the presentation of estimated equations,/-statis¬ 
tics are in parentheses beneath the estimated coefficient, and Durbin’s 
H statistic is presented.*® All estimations were done using White’s 
(1978) SHAZAM regression package. 

The estimated relative price equation is: 

p, — .60 + .76^,., + .37m,_| — 

(1.90) (4.58) (1.96) (.73) 

(adj.) = .96, Durbin’s H = 1.67. 

Hence,rt| = .76, Uj = .37, and Oj = -.03. Both n, and Uj are significant 
at the .05 level for a one-tailed test, and the hypothesis that a, + + 

fij = 1 is not rejected at the .05 level. Since u, in (13) is insignificant 
and the wrong sign, the equation was also estimated with only/>,_, and 
as regressors." The estimated equation is: 

p, — .57 + .78p,-, + .29m,.,, 

(1.86)(4.74) ' (1.89) 

(adj.) = .96, Durbin’s// = 1.65. 

Again, bothu, and (12 are significant, and the hypothesis that ri, + a j = 
1 is not rejected at the .05 level. Finally, the equation was estimated 


"A regression using cncl-of-[)eriod quarterly data 1 rom 1973:11 through 1977.111 is 
al,so reported. More will be said about the rhoke between aserage and cnd-of-()erKKl at 
that point. 

“See Coldfeld (1976) for evidence on this point. 

In regressions with a lagged dependent variable. Durbin's H statistic is the appro¬ 
priate statistic to test serial correlation of the disturbante terms. For a sample as small as 
used here, however, the usefulness is diminished. 

" See Appendix C for a derivation of the condiuons under which a, 0, but the 
trade balance is still a funcUon of relauve prices. Basically it requires ihai the trade 
balance be a very small fraction of total aggregate demand. 
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with the constraint that «,+«,= 1 imposed: 

p, = .57 + .7 !/>,_, + .29m,_,. . 

(1.84H4.47) (1.86) ^ ’ 

4'he estimates of the relative i)rice etjuations, then, ate consistent with 
the model and have good explanatory power.'- 

I’reliminaiy to testing the exchange-rate equation, it is tiseful to see 
whethei the assumption used in both structural models that the rela¬ 
tive monev supply follows a random walk, that is, that w, = + x,, 

where is a serially uncoi related random variable with zero mean, is 
reasonahiv consistertt with the sample data. To do this, two proce- 
dttres were trsed. First, m, was regressed against a constant term plus 
the i)te\iorts laggerl values of wt,. A repr escniativc result is: 

w, = 1.5 • 99w, I - .() 8 /;;,_., + .77>»,_., - . + .25 

(.711(2.88) (-1.08) (-.14) (l.:50) ( .70) (.79) 

( 16 ) 

Ortly the coefficietit on is signifuant, and it is trot significantly 
din'etetu from one. The second proteduic trsed .t lest suggested by 
box aird Jetrkiirs (1970, p. 291) for t.tndomncss in a residtral vector 
from .t tune-series jriocess. For this test, m, w.ts regressed on a ton- 
st.rnt attd ott w, , with the following result: 

w, = .01 + 1.01w,_,. 

(.09)(13.9) ' ^ 

Residrt.rls from thts eqitation ,ind from art eqrration in which the 
(ortst.rrtt ter in .ritd , coeffitietrt were (otrsiiained .it (0. I) were then 
used in the afoternentionetl test lor noniandomness. Nont.indom- 
uess vv.is r ejec ted lor both sets ol r esiduals .it the 10 per t ent level, 
(ionsecjiiently, the assumption that all changes in the relative money 
supply .lie unaniic ipated seems justilied. 

I he csiim.iie of the exchange-rate equ.ition is '* 

'• I he icl.iiivc pine emi.iium cm lie <lis.iK(4ie)'.clecf hy coiinuv .me) wiitleii m rtie 
lollimiMK hum F)" = r ,/,/;}'• r h + c/.mp', r cf-.m)'', II ihe 

stiucuii.il ccjujlions ot S\vit/eiland aiicl the Unitctl .Sl.ites aic imleed idenlicti, thei//s 
shciuld ^.^rl'.l> ihe lolhiwin^ lestiietimw. — I.//, - -a,, ~ This ecpi.ilKm was 

estinialcd, .iiid ihc atiosc liypolhesisahoiit ihedfs was not lejerled at the O.'j level, the 
standaid eiiors. howc'vei. were cpnie lai|;e. 

" .As with the pi ICC eipialioii, the esihaiige-i.tle equation was also esiiinalcd wil)i 
couniiy-dis.ij'giegalcd data to test the assumption ol irlenlic.il sliutiui.il equations 
.irioss (ountiies. The eqii.iiioii is e, ^ A„ - 1 - A.m,'" + Aimf'’ + A.m,'''', + Ajm)’.'', f 
I The roeOuiciit icstrietions consistent with idcniic.il stvut- 
tiiial paianicteis atioss counlncs aic: A, = -A^.A, - -A,,A,= - Aj. Ihc hyitolhcsis test 
ol these icstiictions was not icjeetcd at the .05 level, the assumption ol identical 
structnial pal.iineteiscannot he lejected. Fuidietinoic, the |K)inl estimates wcie lathci 
siinilai to those ol eq. (IH) and salislied all the othei a piioii cocfhcicnl lesiiic- 
tions. 
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e, = -2,22 + + 2.37m, - 2.45m,+ mp,., 

(-2.82)(3.65) (5.73) (5.60) (2.23) 

+ .I50IL - .06SEAS; RHO = .35. 

(7.61) (-5.47) (1.37) 

(adj.) = .99. Durbin’s H = .21. 

where RHC) is the estimated first-order autoregressive parameter of 
the error term.''* 

The overall explanatory power of the equation is quite good. Fur¬ 
thermore, all toefhcients except RHO are significantly different from 
zero at the .05 level for a one-tailed test. The Durbin H statistic 
indicates no serial corielation in the disturbances. 

As far as a priori constraints on the parameters are concerned, first 
note that the sum of rr, through is 1.28, which is insignificantly 
different from one at the .05 level. That is, the estimated equation is 
consistent with the putchasing power parity proportionality predic¬ 
tion al)out changes in rel.ttive money supplies and changes in long- 
run equilibiium exchange rates. To further test this result, equation 
(17) was also estim.ited with this consltaint imposed; 

c, = -2.38 + .55c,_, + 2.30m, - 2.69m,_, + .84/i,-, 
(-2.89)(4.48) (4.92) (5.79) (1.91) 

(19) 

+ ,16011. .OfiSF.AS; RHO = .21. 

(7.25) (-5.11) (.79) 

riiese coeflitieius are <|uite close to the unconstrained ones, thus 
providing fuither evidence that the putchasing power parity princi- 
ple holds. 

Both struc'tur;tl models developed in .Section II imply that the 
purchasing power parity constraint should hold. The two models 
difiered shat ply, though, on their predictions of the sign of the 
lagged e\change-i ate and lagged pi ice-level coefficients. The estima¬ 
tion results of positive coefficients on these variables are consistent 
with the stock/llow model izui inconsistent with the Dornbusch one. 
Thus, the data reject the Dornbusch model. Note, though, that the 
Dornbusch model’s major insight about overshooting is \erified as an 
empirical phencjtnenon. 

’Fw'o more procedures were carried out to examine further the 
lobustness of the preceding empirical results. First, the exchange-t ate 
equation was estimated using end-of-|>eriocl rather than average data. 

Estiiiicition ol (18) w.is clone using .i mcHliliecl ('.cxhi.ine-Oicutt picxwluic in the 
siiAZ.AM computer program developcsd by Kenneth White, ('.rid search and maxi- 
niiiin likelihcxid nonlineai metherds picxluced almost identic.il estimates. 
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(18) 



205 JOURNAL OF POLITICAL ECONOMY 

While using end-of-period data increases the likelihood that any one 
data point is an outlier, it has the advantage of not possibly confusing 
anticipated and unanticipated changes in the money supply. The 
results are: 


e, = -3.16 + .56e,^, + 1.69m, - 2.47m,_, + l.I7/»,_, 
(-2.66) (3.44) (2.44) (-3.50) (2.14) 

+ .210IL - .07SEAS; RHO = -.14, 

(5.63) (-3.44) (-.50) 


( 20 ) 


R'^ (adj.) = .97, Durbin’s 7/ = —.34. 

The estimated coefficients are significant at the .05 level, have the 
same signs as those of equation (18), have similar magnitudes, and 
satisfy the purchasing power parity constraint; indeed, tt, = .94, 
which is not significantly different from one at the .05 level. 

Second, an in-period dynamic simulation of the exchange-rate 
equation was performed. This is important because of the presence of 
a lagged dependent variable and tends to show whether the estimated 
1 egression simply picks up the serial correlation in the dependent 
variable. The forecast error and squared forecast error were then 
regressed against time to see whether there is a tendency for it to 
increase with time. The results are as follows: 


Forecast error = .05 - .0002(time), 
(3.83X-.14) 


( 21 ) 


Forecast error = .003 + .00003(time)^. 
(2.15) (.22) 


( 22 ) 


This provides rather strong support for the empirical results. 


IV. Concluding Comments 

This paper has addressed those issues in exchange-rate determina¬ 
tion raised by the recent work on exchange-rate dynamics. The 
exchange-rate equation developed was flexible enough to be consis¬ 
tent with, among others, the .structural model of Dornbusch (1976) 
and a stock/flow model incorporating features emphasized by Nie- 
hans (1977), Henderson (1979), and others. Both structural mod¬ 
els implied that a purchasing ptower parity a priori constraint on the 
reduced-form parameters should be satisfied; the estimates did not 
reject this hypothesis. Other a priori constraints implied by the Dorn¬ 
busch model were rejected, though. There was, however, empirical 
verification of Dornbu.sch’s overshooting hypothesis. 
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To help summarize the empirical findings on these questions, esti¬ 
mated coefficients from equations (15) and (18) are used to calculate 
an exchange-rate and price-level adjustment path following a one¬ 
time monetary increase. To do this, equations (15) and (18) are 
transformed into final form (suppressing all random disturbance 
terms): 


= Co + ^ Mi»«»-i. 

Co a constant. 

(23) 

m 

1 

Co a constant. 

(24) 


where the fi,’s and e/s are as follows: 

Ml = ^ 2 . 

K 2 = (•n’n + -n-i-n-j), 

Ma = (’Ta + w,7rj)7r, + fljTr^, 

Ms = (rra + rr,'jrj)7r? + + a,). 

Ms = (^3 rr,ir2)fli + a2iT4{iT] + a,Tr, + a]). 


Mn (^3 + fl'irrz)7r';'‘ + + ttJ ’n, + '•a? + 

Ci = a^, 

€2 — fl 2 ®l> 

€3 = 


€, = a^a", 

Using these final form coefficients and assuming the initial exchange 
rate and relative price level are (e = 0, p = 0), then, following a 
one-unit unanticipated monetary increase, we can calculate the ad¬ 
justment path as follows: 
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= 2.30 

A 

1 = 0 

fo -Po 

= 2.50 

f 0 

= .88 

= .34 

fii 

= .29 

= .50 

1 1 

= .59 

= -.16 

II M 

= .21 

.27 

h 

P* '■ 

= .65 

= .75 

-As = 

- p4 = 

= -.44 

■■ -.48 


.39 

Pr, = 

= .82 

Cs -Av = 

-.45 


.4H 

/»« = 

= .87 

Cb ~ pit ~ 

-.39 

c? = 

.54 

pi ~ 

.91 

fl - pi - 

-.37 

Ch “ 

.61 

P» = 

.94 

11 

X 

1 

X 

-.33 

Cs, = 

.67 

P» = 

.96 

- p9 = 

-.29 

^10 ^ 

.73 

Pto ~ 

.97 

fio - pM> = 

-.24. 


The exc hange rate initially depreciates to 2.30, then appreciates for 
three quarters to 0.21, and then depreciates again. After 11 quarters, 
about three-quarters of the exchange-rate adjustment has taken 
place, and 97 percent of the price adjustment has occurred. Notice 
that deviations from purchasing power parity are positive for only the 
first two quarters, after that remaining negative. This means that 
exports are stimulated and imports suppressed because of increased 
relative prices for only one quarter, and then the effects are re¬ 
versed.’’ Note also that the exchange-rate path to long-run equilib¬ 
rium is. as anticipated, nonmonotonic, showing periods of both ap¬ 
preciation and depreciation. 

To keep these empirical results in perspective, some of the more 
serious limitations should be noted. First, the exchange-rate equation 
tested was based on a bilateral model, while in reality the n - 1 
independent exchange rates of an n-country world are functions of 
exogenous variables of all n countries. The equations estimated here, 
then, are subject to omitted-variables specification bias. If both the 
correlation of these omitted variables with the included variables and 
the expected sign of the coefficient of these cjmitted variables can be 
determined, then the bias in the estimated coefficients can be deter¬ 
mined. At present, not even the expected sign of the omitted 
coefficients has been determined.'* 

Second, the equation estimated is a reduced form from whose 


" Note that the cumulative trade balance over the adjustment path must be r.ero, 
since there is no net accumulation of foreign assets. 

'• See Berner et al. (1975) for an attempt to develop an n-country model. 



exchanor-rate dynamics 

estimates structural parameter values cannot be retrieved. While the 
reduced-form estimates seem plausible, they would be more credible 
if they were shown to be consistent with existing estimates of an 
underlying structural model’s parameters. Still, the results provide 
general support for the monetary approach to exchange-rate deter¬ 
mination and specific support for the importance of both slow 
commodity-price adjustments and current-account relative-price ef¬ 
fects within that approach.'^ 


Appendix A 


I. Derwtilwn of Reduced-Form ('oiL\lrntnl\ from the Dornhw^ch .Model 
Combining equation (7) with (6) and (2). we gel; 

= m,(l + \/dK) -i- (Al) 

Suiistiliiting (5) for p, we have: 

e, ~ e,.^{~(i,/0K) + m,(l + l/6\) + -t- p,,,(-a,/9X) <A2) 

+ y,(-4>l6\) 


II. nenvrilion of Reduced-Form Constraint from the Stock/Flow Mode! 

Replacing equation (6) of the L)oi nbusch model with equation (11) we get the 
following diffetenre equation: 


e. 


+ OT,-, 


ndfra^la - nlX) ^ ^ ; 1)6 + T?/X \ 
T)d + a * I ifd + a ) 

/ -71$ - nPs - ai-qlh + a-fit \ ^ I 
\ vO + a I '^'■'1 


(I - a,)ri/h + a«i 

T}0 -t- a 


(A;P 


+ ,v< 


( 


-®—1 
7)9 + a I 


+ yi-> 


ii,i(a - 7)IK) 
7)9 t- IK 


Note that the coelficient on w, may be less than one, the toefficieni ol e, may 
be positive, and the coefficient on must be positive. 


Appendix B 

I. Sunlierland 

!. Exchange rate. — End-of-period data are end-of-monlh single obsen ations. 
Source: OECD Main Economic Indicators, Paris, various issues. Quarterly 
data are averages of daily nvxin New York rates. Source: tederal Resen'e 
Bulletin, Washington, various issues. 

2. Money supply. — The M3 is taken from International Financial Statistics 
o( the IMF. It apfiears to include some demand de|}Osits denominated in 


’’Again, see Bibon (1978) for a dLscii.s-sioii of how the Dornhiisch approach (its 
within the general monetary framewoik. 
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foreign currencies. From figure-s supplied by Peter ^ 

National Bank, it appeals that this component is a small 8 r ,9 

M3 (less than 4 perc'ent) and fairly stable. These figures ^ “ 

nonconsecutive months, which precluded u.sing them for estimauon pur- 

CVirnmncr /.nee inde;(.—Source: OECD Main Economic Indicators. These 
are monthly end-of-period figures. 


II. United Stales 

1 , Money supply. —Source: FederalHe.seri'e Bullehn. Seasonally unadjusted data 
were used lor the quarterly estimations. 

2. Consumer price index. —.Source: OECD Main Economic Indicators. 


Appendix C 


This Appendix shows thatin the price equation may be close to zero and the 
tiade balance still a function of relative prices if the trade balance is a 
relatively small fraction of aggregate demand for a country’s output. LetD be 
the ratio of domestic to foreign aggregate demand, F and E* domestic and 
foreign expenditure, respectively, that is, C + I + G, and P relative prices 
between the two countries' outputs. The question is what implicit assumption 
makes (d log D)l(d log P) close to zero. Now, 


n = 


E + r 

E* - T’ 


(Cl) 


where T is the trade balance. Hence, 

d log n ^ 1 d{E + T) \d{E* - T) 

dlofcP (E+T) d\ogP (E*-T)d\nP 

(CjZ) 

= p d(E+T) _ P D(E*-T) 

(E + T) dP (E* - T) dP 

Now define the trade balance elasticity asPT'/T = e. Assuming thatdE IdP = 
dE^IdP = 0 and that E = E*, we have 

= __2e_ (C3) 

dIogP (EIT)~{T/E)' 

Hence, (d log I) )l{d log P ) goes to zero as E IT goes to infinity, dc.spite the size 
of e. Note that this does not say that trade balance responsivene.ss to relative 
prices does not affect exchange rates; it only does not affect aggregate 
demand. What is more important for exchange-rate determination are the 
relative sizes of trade Hows and capital Hows. 
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The Family as an Incomplete Annuities Market 
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FaniilifS can sell-insure against uncertain dates oi death through 
iniphcit or explicit agreements with respect to consumption and 
ttiterlamily tratisf'ets. Inlerfamily transfers necci have nothing to do 
with altruistic feelings; they may simply reflect risk-sharing behavior 
of completely selfish family members. Although family annuity mar¬ 
kets arc incomplete, even small families tan substitute by more than 
70 percent for perfect market annuities. Given adverse selection and 
transaction costs, family ri.sk pooling may be preferred to public 
market annuities. In the absence of public antiuitics, these risk- 
sharing arrangements provide powerful incentives for marriage and 
fdtmly formatictn. 


rhf institution of the family provides individuals with risk-sharing 
oppcrrtunitics which may not otherwise be available. Within the family 
there i.s a degree of trust and a level of informatican which alleviates 
three key problems iti the provision of insurance by markets open to 
the general public, namely, moral hazard, adverse selecticm, and 
deception. In addition, provision of insurance within the family may 
entail smaller transaction costs than arise in the purchase of insurance 
on the open market. I’here are a number of important risks for which 
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Educauon and the Nauonal Bureau of Economic Research. Any opinions expressed are 
solely our own. We wish to thank Finis Welch. Joe Ostroy, Bryan Ellickson, John 
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sions. 
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the “public” market problems of moral hazard, adverse selection, and 
deception are especially severe. The risk of loss of job or earnings 
because of changes in the pattern of demand or partial disability is 
one example. Here the ability of the public market to determine the 
extent to which the individual actually suffered an earnings loss or is 
simply lying about his backache is highly questionable. Other exam¬ 
ples are the risk of bankruptcy and the default risk on personal loans. 
Many family practices in dealing with these types of risks can be 
explained as implicit insurance contracts made ex ante by completely 
selfish family members. Love and affection may be important for the 
enforcement of some of these implicit contracts, but they need not be 
their sole or even chief determinant. Healthy brother A’s support for 
disabled brother B may simply be the quid pro quo for brother B’s 
past implicit promise to support A if A became disabled instead of B. 

The existing economics literature on marriage and the family (in¬ 
cluding Schultz [1974] and Becker, Landes, and Michael [1977]) has 
not, to our knowledge, explicitly considered the family’s role in 
providing insurance to family members. 

This paper is concerned with family provision of insurance against 
the risk of running out of consumption resources because of greater 
than average longevity. The problem is how fast to consume over time 
when one does not know how long one will continue to live. Too much 
consumption when young may mean relative poverty later on if one 
lives “too long": alternatively, excessive frugality when young involves 
the risk of dying without ever having satisfied one’s hunger. A com¬ 
plete annuity market permits an individual to hedge this uncertainty 
of the date of death by exchanging his initial resources for a stream of 
payments that continue as long as the individual survives. We demon¬ 
strate here that implicit risk-sharing arrangements within marriage 
and the family can substitute to a large extent for the purchase of 
annuities in public markets. Since the number of family members 
involved in the risk pooling is generally small, these family risk¬ 
sharing arrangements constitute an incomplete annuities market. 
However, our findings suggest that even small families can substitute 
by more than 70 percent for a complete annuity market in pooling the 
risk of death. When the economic structure of society is sufficiently 
developed to sustain organized public insurance markets, implicit risk 
pooling within an incomplete family annuity market may well be 
preferred to public purchase of annuities because of adverse selection 
and transaction costs.' When organized insurance markets do not 


' The transaction costs we have in mind here include the omc costs involved in 
negotiating individual specific annuity contracts. As we demonstrate in the text, each 
individual's optimal annuity contract depends on his rate of time preference, his degree 
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exist, the analysis here indicates that implicit risk-sharing arrange¬ 
ments can provide powerful economic incentives for marriage and 
family formation. 

Throughout the paper individuals are assumed to be completely 
selfish: that is, they obtain utility only from their own consumption. 
One implication of this approach is that voluntary transfers from 
children to parents or bequests and gifts from parents to children 
need have nothing at all to do with altruistic feelings; rather, they may 
simply reflect risk-sharing behavior of completely selfish individuals. 
While altruism per se is not required, some level of mutual trust and 
honesty is required since elements of these arrangements are not 
legally enforceable. 

This paper is divided into four sections, the first of which describes 
optimal consumption behavior for a single individual in both the 
presence and absence of a complete annuity market. The welfare 
gains from access to a complete and fair annuity market are calculated 
for the case of the iso-elastic utility function. This welfare gain is, in 
turn, decomposed into income and substitution effects. This decom¬ 
position suggests that an important component of the gains from 
access to complete or incomplete annuity markets is the desirability of 
substituting future for current consumption. 

Section II develops the theoretical argument for Pareto-efficient 
implicit family annuity contracts and explores potential welfare gains 
arising from these arrangements.'^ Although the complexity of the 
calculations precluded analysis of large families, quantitative results 
for families of two and three persons are presented. The analysis 
considers cases in which family members both do and do not have 
identical survival probabilities (i.e., are of similar and dissimilar ages 
and sexes). This framework permits us to ask whether marriage 
between individuals with similar survival probabilities is more efficient 
than marriage between individuals with dissimilar survival prob¬ 
abilities. 

Optimal family annuity contracts involve agreements on the con¬ 
sumption path of each family member as well as a commitment on the 
part of each member to name the other members as sole heirs in his 
estate. Section III discusses the problems of enforcing both aspects of 
these agreements. Section IV summarizes the paper and suggests 
areas for future research. 


of nsk aversion, and his survival probabilities. Some individuals may prefer a constant 
annuity stream, others an increasing or decreasing stream of annuity payments. 

‘ Koilikoff and Spivak (1979) present a pr<»f that family annuity contracting con¬ 
verges CO a complete annuities market as the number of family members increases. 
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I. A Single Person’s Consumption Plans with 
and without Fair Annuide^ 

In the absence of an annuity market, a single individual’s consump¬ 
tion choice problem is to maximize his expected utility, equation (1), 
from current and future consumption subject to the budget con¬ 
straint, equation (2): 

EU (1) 

f-0 

( 2 ) 

f"0 

The P^ s of equation (1) are probabilities of surviving from age zero 
through age t\ P^ equals one. The term D is the maximum longevity. 
For simplicity, we assume the utility function is separable in con¬ 
sumption (C,) over time. In (2) R, the discount factor, is one plus the 
interest rate. The initial wealth of the individual is we ignore 
possible streams of future labor earnings or inheritances.'* 

The budget constraint written in equation (2) is identical to the 
budget constraint that would arise in a certainty world in which 
individuals never died before age D. While individuals will, on the 
average, die prior to age D, equation (2) reflects the nonzero prob¬ 
ability that an individual will live through age D-, that is, equation (2) is 
the relevant budget constraint because the individual may actually live 
through age D, in which case his realized present value of consump¬ 
tion cannot exceed his budget. 

Let us now assume that the single person is free to purchase ac- 
tuarially fair annuities in a complete public annuities market. The 
budget constraint in this case is 

D 

y P,C,R~‘ = W,. (3) 

<•0 


" Yaari (1965) is the pioneering paper on this subject. Sheshinski and Weiss (1981) 
provide an illuminating discussion on the interacuon of annuiucs and social insurance. 
Barro and Friedman (1977) provide an analysis of the risks of the uncertainty of the 
date of death. 

* The gains from access to an annuities market are greatest when the individual has 
all his resources up front. This assumption, then, dramatizes the demand for annuiucs; 
but dropping this assumption would not alter the theoreUcal point that families can 
substitute for annuity markets. For the sake of completeness one can think of the 
individuals dcKribed in this paper as having received all their resource streams prior to 
their current age. In the no-annuity, no-family world involuntary bequests can be 
thought of as being collected by the government and redistributed to individuals at 
their birth. 
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In contrast with (2), (3) requires only an equality between the ex¬ 
pected present value of' consumption and initial wealth. The single 
individual now chooses his optimal consumption path by maximizing 
(1) subject to (3); he then exchanges his initial wealth with the 
insurance company in return for its promise to pay out the C, stream 
as long as the person continues to live. 

The in (3) may be thought of as prices. Since each of the P,’s 

in (3), except P„ which equals unity, is less than one, the consumption 
choice in the case of a fair annuity market is equivalent to the con¬ 
sumption choice without an annuity market but with lower prices of 
future consumption. Obviously, access to a fair annuity market in¬ 
creases utility by expanding the budget frontier; it also alters the 
optimal consumption path because of the income and substitution 
effects resulting from the lower prices of future consumption. 

The iso-elastic utility function (4) is convenient for assessing the 
potential gains from access to a fair public annuities market as well as 
the gains from family annuity arrangements; 

c) 

In (4), y is the constant relative risk-aversion parameter, and a is the 
time preference parameter. By considering different values of y we 
indicate for this family of utility functions how the gains from an¬ 
nuities and family arrangements depend on the specification of tastes. 

In the no-annuities case maximization of (4) subject to (2) leads to 
the consumption plan, (5); 

^ W,(Ray'ypr 


In the case of fair annuities, maximizing (4) subject to (3) leads to 


WoiRaf^ 


( 6 ) 


Figure 1 compares equations (5) and (6) for the case R = a — 1. The 
ability to trade in a fair annuities market may raise or lower initial 
consumption, depending on whether y is less than or greater than 
unity (fig. I). Intuitively, the higher the degree of risk aversion, y, the 
greater the concern for running out of money because of excessive 
longevity and, hence, the lower the initial consumption. At y equal to 
infinity, equation (5) dictates equal consumption in each period. 

Plugging (5) or (6) into (4), we arrive at two indirect utility functions 
for the no-annuity and annuity cases with initial wealth, the interest 
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Consumption 



Kio. 1.—Consumption paths with and without lair annuities 


rate, and survival probabilities as arguments. These functions are 
presented in equadons (7) and (8), respectively. 


Wo(Wo) = 


1 - V 




1/ 

'y' Qf)iyf{Mi-y)iypj/y 


j-o 


(7) 





1TA-’ 




j-o 


( 8 ) 


The increase in utility resulting from access to fair annuities can be 
measured in terms of dollars. Equation (9) solves for the value of M, 
which represents the percentage increment in a single person’s initial 
wealth required, in the absence of an annuity market, to leave him as 
well off as he would be with no additional wealth but with access to an 
annuities market: 


Wo(MH'o) = ygiiVv)- (9) 

For the iso-elastic utility function this calculation is independent of 
the initial level of wealth. Table I reports values of M for different 
ages and levels of risk aversion using both male and female survival 
probabilities. Friend and Blume (1975) estimate the degree of relative 
risk aversion from individual portfolio choices. They conclude that 
risk aversion, on average, exceeds unity. We present our results for 
risk-aversion coefficients of 0,75, 1.25, and 1.75, a range that we feel 
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TABLE 1 


Age 


30 

55 

75 

90 

30 

55 

75 

<J0 

30 

55 

75 

<)0 


PFRCrNTAC.E 


|N( Rf AsE IN Initial Wealth Required to Obtain 
Fair Anni'ities Utiuty Level 


Relative 

Risk Aversion (y' 


.75 

.75 

75 

75 

1.25 

1.25 

1.25 

1.25 

1.75 

1.75 

1.75 

1.75 


Males 


24.5 
46.9 
71 2 
99. H 
30.3 
59.2 
97.0 
152.6 
34.7 
6S 9 
119.1 
199 1 


Females 


18.5 

34.4 

63.0 

100.2 

22.7 

43.4 

85.3 

152.9 

26.1 

50.7 

104.6 

199.4 


Note — I hi<Hi({hout nblc <•* "« 99 <ind W » I 01 . 


encompasses reality. The survival probabilities used in this and all 
subsequent calculations are actuarial estimates from the Social Secu¬ 
rity Aclministration.’ Maximum longevity is taken to be 120 through¬ 
out the paper. 

Table 1 indicates that the utility gain measured in dollars from 
access to an annuities market can be quite large. For a relative risk- 
aversion parameter value of 0.75, the gain to a 55-year-old male is 
equivalent to a 46.90 percent increase in his initial wealth. The utility 
gain is age dependent; for y = 0.75, the 30-year-old male’s gain is 
24.46 percent, while the 90-year-old male’s gain is 99.81 percent. 
Annuities are less important to young people because a large fraction 
of their lifetime utility from consumption is fairly certain due to their 
lower mortality probabilities in the immediate future. Higher levels of 
risk aversion naturally increase the gains from access to an annuities 
market. The male-female differences in the table reflect the higher 
male age-specific mortality rates. The calculation is somewhat sensi¬ 
tive to the choice of a and /?. Raising the interest rate to 5 percent 
while holding a constant increases the age 55 wealth-equivalent factor 
from 46.90 to 55.57 for the case of y = 0.75. The 90-year-old wealth 
equivalent is increased from 99.81 to 115.34. 

'' We u.sc the low mortality male and female probabilities reported on pp. 17 and 19 
of the Social Secunty Administration Actuarial Study no. 62 (see U.S. Department of 
Health, Educauon, and Welfare 1966). 



incomplete annuities market 

Income, Substitution Effects, and Unintended BeqursU, 

Without access lo an annuity market a single, nonaltruislic individual 
will always die prior to consuming all his wealth and, accordingly, will 
make involunury bequests. The level of these unintended bequests 
can be quite large. From equation (5) we calculated the consumption 
path as well as the corresponding wealth path for the no-annuity case. 
By multiplying the probability of dying at each age times the wealth at 
each age and discounting back to the initial age, the present expected 
value of these unintended bequests can be computed. For y = 0.75, R 
= 1.01, and o = .99, the present expected value of unintended 
bequests represents 24.47 percent of initial wealth for a single male 
aged 55. This number means that a 55-year-old male with no annuity 
market will, on average, fail to consume about one-quarter of his 
wealth because he is risk averse. Increasing the risk-aversion 
coefficient to 1.75 raises the ratio of present unintended bequests to 
initial wealth to 0.3583. These large unintended bequests occur de¬ 
spite a fairly rapid rate of consumption. Current mortality prob¬ 
abilities dictate a fairly rapid rate of consumption even for high levels 
of risk aversion. For y = 1.75, a single male who survives to age 85 
consumes at age 85 less than a third of his age 55 consumption level.® 

The homothetic property of the iso-elastic utility function ptermits a 
decomposition of the utility gains from fair annuities into income and 
substitution effects. Suppose a fair insurance company approached a 
single, 55-year-old (y = 0.75) male and offered to pay him 24.47 
percent of his initial wealth in exchange for his naming the insurance 
company as his heir. The single male would take the 24.47 percent 
gain and, because of homolheticity, consume it according to his origi¬ 
nal no-annuity consumption path. This additional wealth would give 
rise to an additional .2447 x .2447 in present expected bequests. By 
letting the insurance company also pay for this second round of 
expected but involuntary bequests as well as further rounds, the 
insurance company ends up paying 32.40 = .2447/(1 - .2447) percent 
of the single individual’s initial wealth. This 32.40 percent figure 
represents the utility gain from the pure income effect. In this 
scenario the individual continues to consume at the no-annuity set of 
prices. Since the total gain from being able to purchase fair annuities 
and thus face lower prices for future consumption is 46.90 percent, 
the income effect represents 69.08 percent and the substitution effect 
30.92 percent of the total gain. Hence, the ability to alter the age 

• The ratio of consumption at age 75 to consumption at age 55 is 0.62. When nsk. 
aversion equals 0.75, the ratio of consumpuon at age 85 to consumption at age 55 
equals 0.06: it is 0.33 at age 75. 
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consumption profile is an important part of the total welfare gain 
from annuities. 

II. The Family as an Incomplete Annuities Market 

Decisions by family members concerning consumption expenditures 
and interfamily transfers may reflect implicit though incomplete an¬ 
nuity contracts. In the case of marriage both individuals commonly 
agree to poo! their resources while both marriage partners are alive 
and to name each other as the major, if not the sole, beneficiary in 
their wills. For each partner the risk of living too long is somewhat 
hedged by the other partner’s potential death; if one partner lives to 
be very old, there is a high probability that his (or her) spouse has 
already died leaving him a bequest to help finance his consumption. 
While each spouse gains simply from the exchange of wills, the two 
can further increase their expected utilities by agreeing on a Joint 
consumption path that takes into account each spouse’s expected 
bequest to the other. The importance of joint consumption planning 
is highlighted in the case of an implicit contract between a parent and 
a child. Here the parent implicitly promises to name the child in his 
will in exchange for the child’s implicit promise to care for the parent 
if the parent lives too long. Although the child may have zero prob¬ 
ability of dying while the parent is still alive, both can gain because the 
child agrees to share consumption resources with the parent. 

I'his view of bequest and consumption arrangements within mar¬ 
riage as an incomplete annuity market becomes intuitive when one 
contemplates increasing the number of members in the family. To 
simplify the issue, let us assume that all individuals within the family 
have identical survival probabilities and that they enter this multiper¬ 
son family with identical resources. In the limit as the family (or 
"tribe”) gets large, the consumption path of an individual within the 
tribe converges to the path a .single individual would choose in a 
complete and actuarially fair annuities market (Kotlikoff and Spivak 
1979). 

Quantitative Analysis of Family Risk Pooling 

In the case of two family members the frontier of efficient marriage 
contracts is obtained as the solution to the following recursive 
dynamic programming problem: 

V,_,(W,_,) = max [u"(C" ,) + ^'(QL,) + aP,„-,(i„-,V',(W,) 

w„c»,/:,'_,?o.t = 7, ,1 (10) 

+ aP„_.(l - Qm-,)f^,iW,) + 0a!i,,_,(l - T,„_,),Sj(W,)]. 
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subject to 


lV,/« + C« , + C?L, = (11) 

where 

~ max m"(C}J) + ^'(Cf). 
cr/:i 

In (10) is the period t maximum-weighted expected utility of 

the two family members with joint wealth W,. In the expression the 
letters H and S denote the two family members, C'l' and are the 
consumptions of the two, u" and «' are their utility functions. Pm-, 
and are their respective period t survival probabilities condi¬ 

tional upon surviving through period t - 1, and H,(W,) and S,(W,) 
are the maximum expected utilities for each member if he or she 
alone survives to period 1. These expressions are obtained from equa¬ 
tion (7) by replacing IVa with W, and applying the appropriate prob¬ 
abilities. The term 8 is the differential weight applied to member S’s 
expected utility. 

The first two terms on the right-hand side of (10) represent utility 
from certain period I - 1 consumption. The third term is the family’s 
expected period I utility multiplied by the probability that both mem¬ 
bers survive to period t. The last two terms represent expected utilities 
when one member dies and the other survives. 

The Appendix presents an algorithm to solve (10). The algorithm 
for solving the three-family-member maximization problem is avail¬ 
able from the authors.' 


The Gains from Family Annuity Contacts 

The solution to (10) permits a comparison of consumption paths and 
utility levels of married people with those of single persons, assuming 
throughout that there is no public annuities market. Both spouses are 
assumed to have identical iso-elastic utility functions in the sense of 
the same degrees of risk aversion and rates of time preference. 

The shape of consumption paths for married couples while they are 
both alive may differ from that of single individuals for two reasons. 
First, even if the two spouses have identical survival probabilities, the 
reduction in risk within the marriage rate acts like a reduction in the 
price of future consumption. If the relative risk-aversion parameter y 
exceeds (is less than) one, the identical survival probabilities marriage 
profile will start above (below) the single person's profile. For y equal 

' In the three-member family we maximize a weighted sum o( the three members’ 
expected utility taking all survival contingencies into account. If one of the three dies 
first, the other two jointly inherit the remaining wealth and consume according to the 
optimal two-person plan. 
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U are identical. In terms of figure 1, the consump- 

ta unity tfie proh between the no-annuity and 

tion profiles for marr ^ different consump- 

compkte anno.ty profiles. I fte se individuals is possi- 

non profiles for earned people Higher su prob- 

bJe differences in spousal survival an old man 

abilities act like lower rates f consump- 

inari'ies a younff woman the slope of tnc p , * l 

r„ proSte rJr„ ,be .„„i.al probabMe. of bo.h .he old ho band 
and the young wife. The two .spouses compromise with respect to the 
rate at which they eat up their joint wealth while they are both alive. 
The old iiusband would prefer to eat up the wealth more rapidly, and 
the young wife would prefer to consume at a slower rate. The formula 
for each spouse’s consumption when married takes both spouses 
survival probabilities into account as well as the relative spousal utility 
weights. To our knowledge empirical studies of consumption and 
savings at the household level have not considered this point—that 
the time preference rate for a household may depend on the age-sex 
composition of the household. 

Table 2 reports the gains from marriage as well as three-person 
polygamy among individuals who have identical survival prob¬ 
abilities and identical initial endowments and who are weighted 
equally in the contract. The marriage and three-person polygamy 
gains are calculated as the percentage increase in a single person's 
initial wealth needed to make him as well off as he would be in the 
marriage or jvolygamous relationship. The table also reports the dol¬ 
lar gain as a fraction of the table 1 total dollar gain from complete and 
fair annuities. Since utility is concave in wealth, the dollar gain from 
these family contracts as a fraction of the dollar gain from fair an¬ 
nuities is smaller than the actual utility gain from these contracts as a 
fraction of the utility gain from fair annuities. Table 2 also reports this 
latter fracijon." 

The figures in table 2 indicate that marriage can offer substantial 
risk-pooling opportunities. Fora SS-ycar-old man using male survival 
probabilities, pooling risk through marriage is equivalent to about a 
20 percent increase in his wealth had he stayed single. The gains from 
marriage increase as one becomes older since the risks incurred are 
much greater as one ages. At age 75 marriage is equivalent to in¬ 
creasing one’s wealth by 30 percent when risk aversion is 1.25. 
Death-risk'pooling through marriage can be quite important even at 
young ages. The table reports gains from 11.7 to 13.6 percent at age 
30 using the male probabilities. 


•This fraction is calculated as ((t + m)'"’’ - Il/((1 + n)'"'’ - IJ, where m is the 
fractional wealth equivalent gain from marriage, and a is the fractional wealth equiva¬ 
lent gain from fair annuities. 
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“--j ::pt: 

X” filon 010 75. marriage aubs..™ea 46.10 pereen. for comptao 
risk asersioi „ - _ a better substitute for fair annuities at 

andfairaoooiUer M"' probability .ba. both 

younger ages becaus - relative to the probability 

spouses will die simultane y 9 addition, there appears to 

,L one rpooae will die ■'■J 

be an interaction etween ag annuities at young ages when risk 

marriage a better substitute for fan annuities at y .® * , 
aversion is low and at old ages when risk aversion is high. 

Over a wide range of ages and parameter values, three people 
appear to he capable of capturing about 60 percent of the gams from 
fair annuities. Whi/e the compfexity of the calculations precluded 


considering a four-per.son arrangement, we can conjecture using 
table 2 how well four people would do together. In the case of a 
55-year-old male with risk aversion of 0.75, adding one marriage 
partner is equivalent to a 20 percent increase in his wealth had he 
remained single. The marginal dollar gain from adding a third per¬ 
son (table 2) is 8.04 percent. If the marginal dollar gain fell at a 
constant rate in this range, the fourth person would add 8.04 x 
(8.04/20.0) = 3.23 percent.®* By adding 3.23 to 28.04, we can roughly 
calculate the extent to which four people can close the utility gap. The 
procedure sugge.sts that four people can substitute by 70 percent for a 
fair annuities market. 

Diminishing returns to risk pooling appear, then, to set in at a fairly 
rapid rate. In this example two people substitute by 46 percent, three 
people by 63 percent, and four people by over 70 percent for full 
insurance. 

Table 3 considers incomplete annuity arrangements between two 
parents and one child and between one parent and two children. In 
both cases we assume equal consumption by all family members but 
permit the initial wealth of the child or children to vary. All individu¬ 
als are a.ssigned the male survival probabilities; the children are age 30 
and the parents age 55. In the case of two parents with one child, if 
the child has an initial wealth of $35,000 and the parents have an 
initial wealth of $20,000, entering into an equal consumption-will¬ 
swapping arrangement is equivalent to a 32 percent increase in wealth 
for each parent and a 10.6 percent increase for each child. For the 
parent this arrangement captures 71.2 percent of the utility gain from 


“ This is probably a lower bound estimate for the contribution of the fourth person; 
the marginal dollar gain cannot fall at a constant 40 percent rate forever, because if it 
did the toul dollar gains would, in the limit, not sum up to 46.9 percent, the full annuity 
gain of table 1. Presumably the marginal dollar gain falls at a decreasing rate, and 3.23 
percent probably underestimates the fourth person's marginal contribution. 
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Gains from In«»mvu.i>, Annv;ity A»iun(;emen rs in the Femh.v 


Two Parents with 
One Child 


Two Children with 
One Parent 


Initial Wealth 
or Each Child 
($) 

Dollar Cain 
to Parent 

(^) 

Dollar Cain 
to Child 

Dollar Cain 
to Parent 

m 

Dollar Gain 
to Child 

{%) 

25,000 

14.4 

34.2 

2.3 

24.8 

30,000 

23.2 

20.4 

16.9 

18.9 

35,000 

32.0 

106 

31.5 

14.6 

40,000 

40.8 

3.2 

46.1 

11.5 


Nont —The lalciiUtioiu assume equal iun3um|>tuMi by all family memliers Initial wraith ol |iareni or {urenu u 
$20,000 W « 1 01. a 99, arkl y = 0 75 


fill! annuities; for the child the arrangement substitutes by 45.4 per¬ 
cent for full annuities. The last two columns of table 3 present the case 
of two children contracting with one parent. When each child con¬ 
tributes $35,000, the gain to the parent is 31.5 percent, while each 
30-year-old child enjoys a 14.6 percent gain relative to consuming as a 
single person. The numerical differences in the table for the two 
different types of families reflect, on the one hand, different mone¬ 
tary contributions of parents relative to children and, on the other 
hand, differences in the rate at which resources are consumed when 
all family members are alive. Resources arc consumed at a slower rate 
in the two-children-one-parentcase than in the one-child-two-parent 
case, since each individual's survival probabilities are given equal 
weight in determining the optimal rate of consumption. 


Is Marrying People of Similar Ages More Efpcientf 

Suppose one had to decide how to pair up four people, two who are 
old and two who are young. Is it more efficient to marry the old 
people together and the young people together than it is to mix ages? 
Intuitively, marrying two 90-year-olds together and two 20-year-olds 
together leaves a large chance that both 90-year-olds will die in the 
immediate future, and resources that they have failed to consume will 
be lost to the 20-year-olds who, on average, will still be alive.'® The 
countervailing argument against mixed-age marriages is that mixing 

We assume here that any involuntary bequests that arise from the simultaneous 
death of both marriage partners or from the death of a surviving spouse are not 
inherited by any of these four individuals. Again the government can be thought of as 
collecting these residual bequests and distributing them each year to the newborn. We 
thank Finis Welch for a helpful discussion on this section. 



g36 JOURNAL OL POUTICAL ECONOMY 

ages involves greater risk to one of the two partners; the utility cost of 
this greater risk may exceed the utility gain from the increase in 
expected resources arising in mixed marriages.'' 

We investigated potential efficiency gains from mixed marriages 
between two 55-year-olds and two 30-year-olds, where each individual 
was risk averse at the 0.75 level. The 55-year-olds were assigned male 
survival probabilities, while the 30-year-olds were assigned female 
survival probabilities. When risk aversion equals 0.75, weights of 1.7 
for the old person yield utility levels for both old and young which 
exceed those in the old-old, young-young marriages of table 2. The 
additional dollar gain to the old person from this weighted marriage 
with the young person is 3.1 percent; the added gain to the young 
person is 1.6 percent. 

These additional gains from mixed-age marriages require, how¬ 
ever, a fairly skewed distribution of consumption within the marriage. 
For this example the young-old weighting scheme that dominates 
old-old, young-young coupling involves the older spouse’s consuming 
about 86 percent more than the younger spouse while they are both 
alive. If it is too costly to negotiate such an arrangement within the 
marriage or if the type of consumption (e.g., housing) within mar¬ 
riage is nonexcludable, then equal consumption marriages of indi¬ 
viduals with similar survival probabilities (of similar ages) will be the 
rule rather than the exception. Of course, we have been discussing 
here marriages in which each spouse has the same initial dowry. The 
old-young marriages can dominate old-old, young-young marriages 
even under an equal consumption arrangement provided the dowry 
of the young spouse sufficiently exceeds that of the old spouse. 


III. Enforcement with and without Altruism 

In the absence of altruism would family members voluntarily main¬ 
tain these implicit contracts as family members age? The answer is 

" To see this consider an old-young marriage in which the young person promises to 
consume less chan the old person in the state of nature in which both spouses survive. 
-Suppose that this promise to the old person of higher consumption in the "both 
survive" state is large enough to exactly compen.sate the old person for the loss in 
expected utility from the state in which his spouse dies but he survives. The old person’s 
expected utility from this latter "bequest’’ .state is lower when he marries someone 
young, rather than someone old, because the probability of the young person actually 
dying IS smaller. While the old person is by assumption no worse off in this compen¬ 
sated old-young marriage, the young person could be worse off than if he had married 
someone young. By entering into the compensated old-young marriage, the young 
person reduces his payoff in the both survive state while leaving the payoff in the 
bequest state unchanged. He also increases the probability of the bequest state and 
lowers the probability of the both survive state. Although cxfiected consumption for the 
young spouse rises, the spreading of the payoffs may lower expected utility, depending 
on the young spouse’s degree of risk aversion. 



INCOMPLETE ANNUITIES MARKET 387 

that there always are ways of structuring payments to individuals 
within the family so that each individual at each moment in time has a 
selfish interest in maintaining the original implicit contract. An equal 
consumption marriage contract between two individuals with the 
same survival probabilities and the same initial endowment is a good 
first example. If each spouse maintains control over his own wealth 
while both spouses are alive and consumes at the same rate as the 
other spouse, then each will separately have an incentive to continue 
the contract at each point in time. A similar type of individual control 
can be maintained in family arrangements; rather than have the 
parents use up all their resources before the children begin con¬ 
tributing to their support, the children can contribute each period in 
return for that period’s expected parental bequest. This scenario of 
parents’ maintaining control over their wealth until the very end, as 
enforcement leverage over their children, may partly explain the 
limited use of gifts as a tax-saving intergenerational transfer device. 

The proof of this proposition is immediate from equation (10). 
Given their initial endowments at time f — 1, IV{L, and IV-JL,, family 
members choose a value 0* such that the contract to consume the 
contingent plan (<)*),C?(0*),.. . ,C?,m,CUe*)] 

is in the core at time < - 1. The consumption plan at time t 

Cf(0*) .represents the period I Pareto-efficient 

contract corresponding to the initially chosen utility weight 6*. This 
plan is in the core for a set S, of individual endowments of family 
members in period I, and W-^, which satisfy + W-^ = fV,. To 
insure that the initial contract remains a core allocation for each 
family member, side payments are made at time t - 1 when con¬ 
sumption in period t — 1 occurs. The side payments leave the period t 
individual endowments in set S,. Since the initial contract [C," ,(0*), 
Cf(6*), .. . , is in the core, each selfish 

family member will have a personal incentive to make or accept these 
side payments. 

There are two additional questions of enforcement to consider. 
One problem is that a spouse may covertly name a third party as 
beneficiary in his will in exchange for the same commitment by the 
third party or in exchange for a particular service. A second type of 
cheating may occur when one or both spouses covertly consume in 
excess of the consumption levels dictated by an optimal implicit mar¬ 
riage contract; while each spouse may correctly believe that he or she 
is the beneficiary in the other spouse’s will, each may try to take 
advantage of the other by increasing his own consumption and thus 
reducing the potential bequest available to the other spouse. 

These two types of cheating will be more problematic for implicit 
incomplete annuity agreements between friends or relatives who are 
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physically separated. The consumption cheating scenario can be 
modeled as a Nash equilibrium in which each partner chooses his 
consumption path by taking the other partner’s consumption path 
and potential bequest path as given. Resources are consumed at a 
faster rate in the Nash equilibrium as each partner fails to consider 
how his consumption will diminish his expected bequest and thus the 
expected utility of his partner. 

Using male survival probabilities we calculated for two 55-year-olds 
the dollar equivalent utility gain from engaging in a Nash 
consumption-cheating partnership. The gains in the Nash equilib¬ 
rium proved to be almost identical to those in the more efficient 
marriage contract. For levels of risk aversion of 0.75, 1.25, and 1.75, 
the percentage dollar increments are, respectively, 19.9, 22.2, and 
23.5. While the rate of consumption is faster in the Nash equilibrium, 
it is not much faster than in the marriage contract. Intuitively cheat¬ 
ing by overconsuming is fine provided one’s partner actually dies; but 
if one’s partner survives, the early excessive consumption will require 
relative deprivation later on. Apparently this latter consideration 
dominates the former, leaving utility in the cheating equilibrium at 
essentially the same level as under a marriage contract. These exam¬ 
ples suggest that consumption cheating does not represent a substan¬ 
tial impediment to consumption-risk-sharing arrangements. 

Another means of enforcing these implicit contracts is simply al¬ 
truism. All of our calculations have involved maximizing a weighted 
sum of individual family members’ utilities. If, however, each family 
member is altruistic toward each other and each weights each family 
member’s utility from consumption in the same way, then all family 
members would unanimously agree on the utility maximand. The 
calculations we have presented tan, therefore, be thought of as re¬ 
sulting from the maximization of an agreed-upon altruistic family 
utility function. Since all family members agree on the maximand, 
there is no problem of enforcement. 

IV. Summary and Conclusion 

This paper has demonstrated that consumption and bequest-sharing 
arrangements within marriage and larger families can substitute to a 
large extent for complete and fair annuity markets. In the absence of 
such public markets, individuals have strong economic incentives to 
establish relationships which provide risk-mitigating opportunities. 
Within marriages and families there is a degree of trust, information, 
and love which aids in the enforcement of risk-sharing agreements. 
Our calculations indicate that pooling the risk of death can be an 
important economic incentive for family formation; the paper also 
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suggests that the current instability in family arrangements may, to 
some extent, reflect recent growth in pension and social security 
public annuities. The methodological approach of this paper can be 
applied to the study of family insurance against other types of risks. 
Of chief interest are those types of risks that are handled very poorly 
by anonymous public markets. Disability insurance and insurance 
against earnings losses are good examples. 

Our approach has been to compare family insurance with perfect 
insurance. It would seem worthwhile to compare family insurance 
with public market insurance where the market insurance is subject to 
adverse selection and moral hazard problems and family insurance is 
not. Realistic specification of the degree of adverse selection and 
moral hazard may indicate that family insurance dominates public 
market insurance even in small families. 

Finally, the paper suggests the empirical difficulty of determining 
whether intergenerational transfers reflect altruism or simply risk- 
mitigating arrangements of essentially selfish individuals in the ab¬ 
sence of perfect insurance markets. Distinguishing between the selfish 
and altruistic models is fundamental to a number of major economic 
questions, including the impact of the social security system on na¬ 
tional saving and the effectiveness of fiscal policy.'* 


Appendix 

Computational Algorithm for the Two-Family-Members Dynamic 
Risk-pooling Problem 

This Appendix indicates the algorithm used to .solve the two-family-members 
dynamic programming problem, copied here as equation (.M). The al- 
goiithin for the case of three (amily memlicrs is similar to that for two 
mcmheis and is available fiom the authors. While we consider the iso-elastic 
family of utility functions, our algorithm can l)c applied to any homothelic 
utility function. 

.I 

+ a/*,,,.,(I “ - P,„-,).V,{IV’,)], 

subject to 

W,Hi + , + (.V_, = IV,,,. (A2) 

Again, the letters H and .S' correspond to the two family members with 
respective conditional survival probabilitiesand fhe expiession 

W, is Joint family wealth, 0 is the weighting factor, and //,(IV,) and .S,(IV,) are 
the expected utility levels for each family member if he alone survives 10 
peiiod t. 


'* See Barro 1974. 
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and are found recuisively starting at period T" and 
We demonstrate that nttty be written in the 


where v, is a constant. 
I,y 




w;-’' 

I -y 


(A3) 


We also show that total family consumption, C,. is given 


r;,_, 


w,-, 


__ 

t4'’' + (aK,n)'i'R^' ’ 


(A4) 


where K, i.s another constatit. (liven total family consumption, consumption of 
the two memheis is 


r« = rs = (■ 0''^ 

I + <?■'”' '''* I + er 


(A5) 


We demonstiate that K, is a function of j», and that v,^, is a function of 
•Staiung then at the initial value tor A',, we can compute tv! t'r in turn 
gives A\, which in turn gives I'j- 1 - Proceeding in this fashion to rreriod zero we 
compute the entire sequence of v,’s and AC,’s. These values can then be used in 
equation (A4) to compute the ratio of consumption to wealth at each period. 
These ratios together with an initial level of wealth plus (A2) and (A5) 
generate the optimal consumption path. The homotheticity of the utility 
function permits us to calculate recursively the shape of the consumption 
path independently of the initial level of wealth. 

Slartmg with period T the maximization problem tor equation (Al) i.s 


I'AfPr) = max —!— (€«)'-’■ + e —— (C);)'-'' 

1 - y 1 ' y 

s.t. 6? + Q « tVr. djl, (:)■ » 0. 

Solving this maximization and computing the indirect utility function for P,-, 
wc have 


yAWr) = Vr 


Wl'’', where I'r = (1 + 


r:? = tVr 




I + 0 '''’ ' ' I + 0'" 

For I < T, (Al) for the iso-elastic case is written as 
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In going from (Al) to (A8) we use the fact that H,(W,) = 1 - y)] and 

~ ~ y)] for the iso-elastic utility function. The values for A, 

and s, are implicitly defined as the bracketed term in equation (7) in the text 
with J = 0 corresponding to time t and with each family meml>er’s survival 
probabilities from time t substituting for Pj. 

It is easy to see from (A8) that for given total family consumption, C„ Cl' 
and Cl will always satisfy (A5). Hence we may rewrite (A8) as 


= max tv Q-> (A9) 

'i-i 1 - y 


wr 

P. Qi 

u + P‘ l 

^1 — ^ -1- (I - 

\ ^ J 

+ a 

1 ~ y 

. Pt~i Qi- 1 

^ p,-, 1 

t 0,-1; 1 p,-, 

/ 0,-1 'J 


Denoting the term in brackets by k, we now have 


V,., (IV,.,) = maxt/r—i—C;.7 + a—^WI'^K, (AlO) 

t;,., I ~ y I ~ y 

s.t. C,., += W,.,. 

Maximizing (AlO) and computing the indirect utility functions yields 


V,., = + (a/^/f,)''’'fl-•]^ 


C,-, = IV,. 
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vfj'’ + (ak,/i ym 
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An Economic Theory of Self-Control 
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The coiKcpt of self-tontrol is imorporated in a theory of individual 
intertemporal thoice by modeling the individual as an organization. 
The individual at a point in time is assumed to be both a farsighted 
planner and a myopic rffier. The resulting conflict is seen to lx? funda¬ 
mentally similar to the agency conflict between the owners and 
managers of a firm. Both individuals and firms use the .same lech- 
nitities to mitigate the problems vshk.b the conflicts create. This 
paper stresses the implications of this agency model and discusses as 
applications the effect of pensions on saving, saving and the timing 
of income flows, and individual discount rales. 


For many years Chri.stmas clubs paid no interest. Members deposited 
money each week but could only withdraw the money cjn December 1. 
The clubs were very popular, although they seemed tt) be dominated 
by simply depejsiting money in an interest-bearing savings account. 

Passbook loans do still exist. I'hese loans allow an individual with 
$5,000 in a savings account earning 5 percent interest to borrow at 9 
percent using the balance as collateral, instead of at 10 percent with 

The authorship of this pajjer was fully collaborative. The order of the names on this 
paper and its companion (Shefrin and Thaler 1980) corresponds to our division of 
lahior. This paper was written while Thaler was a visiting scholar at the National Bureau 
of Economic Research at Stanford, California. While there he received financial sup¬ 
port from the Kaiser Family Foundation and the Robert Wood Johnson Foundation. 
Helpful comments on earlier drafts were provided by George Ainslee, James Bucha¬ 
nan, Tom Russell, and numerous other iriends and colleagues. We thank them all. 


[Journal of Polttual hronomy, IM8I. v«)t 09. n<» 2) 

© 1901 I'he Univnvt)' of Chicago. OO22-38OH/0)/89U2.OOU8^I 30 


39a 



THEORY OF SELF-CONTROL 


393 


no collateral. Obviously the individual could simply withdraw the 
money from his savings account at an (opportunity) cost of only 5 
percent. 

Smoking clinics are a new and thriving business. A smoking clinic 
will help people who want to stop smoking—for a fee of several 
hundred dollars. 

What does economic theory have to say about these institutions? 
George J. Stigler provides the following analysis: 

One can of course explain the participation in a Christmas 
fund by introducing another item of preference: a desire of 
people to protect themselves against a future lack of 
willpower. ... If we stopped the analysis with this explana¬ 
tion, we would turn utility into a tautology: a reason, we 
would be saying, can always be found for whatever we ob¬ 
serve a man to do. In order to preserve the predictive power 
of theotility theory, we must continue our Christntas fund 
analysis as follows. The foregone cost of putting money in a 
C'hristmas fund is the interest one could earn by putting the 
same money in a .savings account. If interest rates on savings 
accounts rise, the cost of buying protection against a loss of 
willpower rises and less of it ought to be bought. . . . (Stigler 
1966, p. 571 

We agree with the remarks above and therefore offer a model that 
says more about these institutions than the fact that the demand for 
their services will be negatively related U) price. We do so by propos¬ 
ing a simple extension of orthodox models, using orthodox tools, that 
permits such behavior to be viewed as rational. This rationalization is 
ba.sed on an analysis of the technology of self-control using the theory 
of agency rather than reliance on ad hoc explanations in which trans¬ 
action costs, taxes, and income effects play a major role. Our model 
can predict, in a nontautologtcal way, the circumstances in which 
these kinds of behavior will be observed. In particular, our new theory 
of iniertempzoral choice has important implications for theories of 
saving behavior. In the last section of the pajrer we discuss some of 
these implications and offer empirical evidence in supptort of our 
ideas. 


I. The Model 

The idea of self-control is paradoxical unless it is as¬ 
sumed that the psyche contains more than one energy 
system, and that these energy systems have some degree 
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of independence from each other. [Donald McIntosh 
1969] 

Why individuals would impose constraints on their future behavior is 
a problem that has received attention from economists since Strotz’s 
(1955-56) classic paper.* Strotz and those who have followed him 
(Poliak 1968; Blackorby et al. 1973; Peleg and Yaari 1973; Hammond 
1976; and Yaari 1977) have analyzed the phenomenon as one of 
changing tastes. In Strotz’s formulation a conflict occurs between to¬ 
day’s preferences and tomorrow’s preferences if the discount func¬ 
tion used today is not exponential with a constant exponent. 

Our framework differs from the changing-tastes literature in that 
we model man as having two sets of preferences that are in conflict at 
a single point in time. This idea is certainly not new. Adam Smith used 
a two-self model much like ours in his Theory of Moral Sentiments 
(1759). More recently Scheiling (1960, 1978) and Buchanan (1975) 
have recognized the importance of simultaneous conflict in under¬ 
standing self-control problems. Outside economics the idea is com¬ 
monplace, with the writings of Freud (1958) and Berlin (1969) de¬ 
serving special mention. 

Nonetheless, to the best of our knowledge our work is the first 
systematic, formal treatment of a two-self economic man. We have 
adopted a two-self model because, as McIntosh says above, the notion 
of self-control is paradoxical without it. Furthermore, we utilize an 
organizational analogy that leads to both insights into human behav¬ 
ior and a rich explanatory model. We will briefly describe the model 
here. We do so in order to make explicit the nature of our two-self 
conceptualization. The model also leads to specific predictions about 
behavior, which are discussed in Section 111 below. Further details are 
available in our more formal companion paper (Shefrin and Thaler 
1980). 

Our model is cast in discrete time. Consider an individual with a 

fixed income .stream y = (y,, yz .yr). Think of period T as 

retirement and let = 0. Let the individual choose a nonnegative 
level of consumption f, int. Callc = (ci,C 2 > ■■■ ,Cf) a consumption plan. 
The conflict between short-run and long-run preferences is intro¬ 
duced by viewing the individual as an organization. At any point in 
time the organization consists of a planner and a doer.^ The planner is 
concerned with lifetime utility, while the doer exists only for one 
period and is completely selfish, or myopic. The period t doer is as¬ 
sumed to have direct control over pteriod I consumption rate c,. The 


' Two im[)oriant contributions by noneconomists are Ainsicc (1975) and Elster 
(1977). 

' These terms ori);inated in an early draft of Thaler’s (1980). 
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doer’s utility function is given by Z,<-); 2, is taken to be independent 
of all components of c except Cj.® Furthermore, suppose initially that 
Z| is strictly increasing and concave in c,. 

In our model the planner does not actually consume but, rather, de¬ 
rives utility from the consumption of the doers. Therefore the plan¬ 
ner’s utility function is given by 1^(2,, Z,,.. ., Zj). Observe that a plan 
which maximizes I' subject to the present value budget constraint, if., 
c, se = Y, is considered optimal from the planner’s point of view. 

However, without some method to control the doer’s actions, this plan 
cannot be implemented. Indeed, under the assumptions above, the 
actual consumption stream chosen would have total lifetime income 
consumed during the first period, when the period-one doer would 
borrow F — y, on the “perfect”capital market. In order to prevent this 
from occurring, the planner requires some psychic technology capa¬ 
ble of affecting the doer’s behavior. Two main techniques are avail¬ 
able for this: (l)The doer can be givenin which case either his 
preferences must be modified or hism«n/n'« must be altered, or (2) the 
doer’s set of choices may instead be limited by imposing rules that 
change the constraints the doer faces. 

We begin by analyzing the case in which no rules are used. We refer 
to this case as pure discretion. While we do not believe that the pure 
discretion case is empirically important (most people appear to use at 
least some kinds of rules), it provides a useful foundation for our 
model, to which rules are easily added. Furthermore, since this case 
corresponds closely to that usually considered in economics, it high¬ 
lights the differences between our model and the standard frame¬ 
work. 

Recall thatZ,(-) was assumed to be unbounded. We now specify that 
Z, depends on a preference modification parameter 9, selected by the 
planner. The choice of 0, allows the planner to alter Z, such that it 
possesses an internal maximum. By appropriately selecting $„ any 
desired c, may be obtained; however, the lower the desired c, is, the 
more modification will be required. Furthermore, dZ,/d0, is negative; 
that is, modification reduces short-run utility and is therefore costly. 
Finally, we assume that the marginal cost of modification increases 
with $. Thus successive reductions in c, require increasing reductions 
in Z,. 

If the planner could exercise direct control over the choice of a 
consumption plan, 6, would be set equal to zero for all / (because 
modification is cosdy) and c would be selected to maiximize V subject to 
the budget constraint. Since under pure discretion this is assumed to 

* The extreme assumption that doers do not care at all about past or future doers is 
adopted just for expositional simplkicy. Other arguments of Z, could easily be added. 
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be infeasible, the planner must instead choose 0 = (0,, 02 , , dr) 

to maximize V. (The solution to the problem is described in Shefrin 
and Thaler [1980].) Essentially, modification is increased until the 
marginal utility derived from additional consumption in retirement 
(period T) equals the marginal loss in utility in earlier periods due 
to modification. 

In the more general case when both rules and preference modifica¬ 
tion are permitted, the planner may also alter the budget constraint 
facing each doer. This allows the planner to reduce c, without incur¬ 
ring modification costs. However, since available rules are imperfect 
(see next section), the planner will have to trade off modification costs 
with the opportunity costs associated with using second-best-type 
rules. Rules are formally incorporated into the planner-doer model in 
Shefrin and Thaler (1980). 

II. Techniques to Reduce Conflicts in Individuals 
and Organizations 

We have characterized self-control as an internal conflict resembling 
the principal-agent conflict between the owner and manager of a firm 
(see Ross 1973; Jensen and Meckling 1976). In this section we de¬ 
scribe the actual technicjues used by individuals and firms to mitigate 
these conflicts. The techniques fall into the two categories highlighted 
iti the model: rules and incentic'es. We provide many illustrations of 
the methods individuals use because these are in essence part of our 
model. When individuals use rules it is impossible to characterize their 
behavior sitnply with first-order conditions. The limits on the kinds of 
rules which individtials will find feasible lead to the specific predic¬ 
tions about saving behavior discus.sed in Section III. In this section we 
also compare the techniques individuals use with those used in firms, 
'fhe close correspondence we find lends intuitive support to our 
principal-agent model. 

A. Metknd-s to Alter lncentive\ 

Individuals u.se three basic technicjues tct alter the doer’s incentives. 
First, the doer's preferences can be modified directly. Some individu¬ 
als consider saving a good in and of itself.'' In this case doer myopia 
does not inhibit saving. Second, inputs to a saving or dieting program 
can be explicitly monitored via weekly budgets or calorie counting 


^ This idea hd.s been suggested by Scitovsky (1976). The importance of norms in 
controlling individual behavior is ako strcs.sed heavily by Adam .Smith (I7)r9, p. .S26) 
and by Irving Fisher (1930). 
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(customers of diet clinics and credit counselors are advised to do this). 
Simply keeping track seems to act as a tax on any behavior which the 
planner views as deviant. Third, incentives can be explicitly altered; 
Alcoholics take the drug Antabuse which makes them ill if they take a 
drink; academics agree to give a paper at a conference to provide 
a proximate incentive to write it. 

Firms use the same three methods. First, profit-sharing plans are 
quite popular even though they offer only trivial financial incentives 
to all but the highest executives. We believe firms adopt them because 
they help create an atmosphere in which the employees’ preferences 
are more similar to those of the owner. Second, firms monitor de¬ 
partmental inputs through cost accounting and then tie comp>ensation 
to these input measures. Third, departmental profits are measured 
and used as performance measures for managers. 

B. Methods to Alter Opportunities: Rules 

If the costs of monitoring and persuasion are high, individuals will 
resort to rules that restrict the doer’s opportunities. In the extreme, 
all doer discretion can be eliminated using what Strotz referred to as 
the strategy of precommitment. Such behavior is rational in our 
model if the rule can approximate the choices that the planner would 
select. Market precommitment institutions are observed, as we would 
predict, in such areas as saving and dieting. For example, people pay 
to go to “fat farms" which essentially are resorts that promise not to 
feed their customers. 

Less extreme rules can limit the range of doer discretion, usually 
through the use of self-imposed rules of thumb. In the savings con¬ 
text several such rules appear to be commonly used. These rules alter 
the budget constraint faced by the doer in much the same way as 
credit limits imposed by lenders do.^ A simple first departure from 
pure discretion is a ban on borrowing, the so-called debt ethic. A 
somewhat weaker rule which seems common is to prohibit borrowing 
except for specific purchases, like houses and automobiles. Another 
rule ol thumb is a prohibition on dissaving combined with limits on 
borrowing. Using this rule of thumb, a person might borrow and lend 
simultaneously in spite of a substantial difference in the interest rates, 
as in the case of the passbook loan. The loan allows him to transfer 
consumption across time periods while it provides a regimented re¬ 
payment scheme. 


‘ While it is difficult to diKUment the extent to which these precise rules are used, 85 
percent of Cagan’s (1965) sample of Consumers I'nion members re|)orted using one of 
these or similar rules to determine monthly saving. 
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Rules can also eliminate discretion over a specific class of decisions 
for which the conflict is particularly acute. Dieters try not to keep 
cheesecake in the refrigerator and will refuse invitations to lavish 
dinner parties; problem gamblers avoid Las Vegas. Also, many smok¬ 
ers pay more for their cigarettes by buying them by the pack instead 
of the carton—it helps enforce a self-imposed ration such as one pack 
a day. 

Again, the same types of rules are observed in organizations.* 
“Pure" rules are observed most often in bureaucratic organiza¬ 
tions because the costs of monitoring output are so high. Rules that 
limit managerial discretion over a particular range are frequently in 
the form of guidelines (e.g., a plant manager can adapt any invest¬ 
ment that exceeds some stated rate of return). Similarly, rules may 
prohibit discretion over a specific class of decisions; loan officers, for 
example, might need approval for loans to relatives or friends. 

We wish to make three other points about internal rules of thumb. 
First, it is useful to consider these rules as learned as much as 
chosen. Rules like the debt ethic are learned from parents and other 
models, which suggests that there will be differences in the use of 
rules depending on social class, education, and age. Second, rules of 
thumb are likely to become habits. By establishing a routine, the doer 
decision process can be avoided. Third, to the extent that the rules do 
become habits, there will be rigidities built into the individual’s be¬ 
havior.^ The implications of these observations are discussed in the 
next section. 

III. Implications 

We now turn to a discussion of the implications of the planner-doer 
model. What predictions about behavior can be made with our model 
that are inconsistent with the standard model? Some of these predic¬ 
tions are obvious. We predict that people will rationally choose to 
impose constraints on their own behavior. Furthermore, we predict 
that such precommitments will occur primarily for those goods whose 
benefits and costs occur at different dates. We present here some less 
obvious predictions based on our model. 

Because our framework is richer than the standard theory, vari- 

‘ for a discussion of firms imposing constraints on their future financial policies, see 
Myers (1977). 

’ flabits can be formally introduced in two stages. At first stage 2, can also be made a 
function of 0„ i < I, to reflect the fact that self-control at the early dates renders 
self-control at later dates less costly. At the second stage, Z, can also be parameterized 
on the adopted rule. In this case the function Z, could exhibit an internal maximum inc, 
when 9, = 0. It is interesting to note that modification would now be required in order 
to break the rule. This would tend to explain miserliness, for instance. 



THEORY or SELF-CONTROL 


399 


ables that the standard theory treats as irrelevant differences in form 
we model as differences in substance. In Sections IIIA and IIIB we 
consider how differences in the form of payment (holding the level 
constant) affect saving decisions, and in Section IIIC we present 
variables other than borrowing and lending rates that determine 
individual marginal rates of time preference. 

A. Pensions and Saving 

Consider two identical individuals with the same total income and 
wealth. Assume that they both save some fraction s of their income. 
Now give one a mandatory pension plan that forces him to savep < t. 
What will happen to total saving? Though it is difficult to get a specific 
prediction from the standard model (see Feldstein 1977), a first-order 
prediction would be that total saving is unaffected. Other forms of 
saving should fall by the approximate amount of the pension. 

Our model has a different prediction: The pension plan produces 
saving at no psychic cost. Modification costs occur only when saving is 
voluntarily withheld. This is what we call “discretionary saving." Thus 
since the marginal cost of saving is lower at the old saving level, we 
expect total saving to go up. (In other words, the offset in other saving 
will be less than the size of the pension.) 

Furthermore, in two cases the offset will be essentially zero: (1) The 
individual uses a saving rule such as “save s percent of disposable 
income" which is not changed when the pension is introduced (total 
saving increases by (1 - s]py,), and (2) the individual uses discretion 
but treats saving as a good for its own sake rather than as a transfer to 
future consumption. In this case discretionary saving and retirement 
saving are not perfect substitutes as in the standard model; in fact, 
their cross-elasticities of demand could be zero." 

The effect of pensions on saving has been investigated with indi¬ 
vidual data by Cagan (1965), Katona (1965), and Munnell (1974, 
1976). All obtained similar results. Cagan used a sample of Consum¬ 
ers Union members. He found that for those members with a man¬ 
datory pension plan, other saving actually was higher than for those 
without pensions (see Cagan 1965, p. 21). Munnell (1974) replicated 
Cagan's study using the same data source. She used a different mea¬ 
sure of saving, replaced before-tax income with after-tax income, and 
restricted her analysis to a subset of observations she thought to be 
more reliable. She then regressed the nonpension saving to income 
ratio on several variables including a pension dummy. Her basic result 

• A fornuii treatment of the effect of pensions on saving appears in Shefrin and 
Thaler (1980). 
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was that the pension had no effect on other saving (i.e., a zero offset). 
The coefficient of the pension dummy was never significant. Its high¬ 
est/ value occurred for the 55-65 age group (/ = 1.2), for which those 
with pensions saved 3 percent less than those without ptensions. 
Though other explanations (such as selectivity bias) have been offered 
for these re,sults, we find that they lend support to our model. 

B. Saving and the Timing of Income Flows 

The importance of current disposable income (as opposed to perma¬ 
nent income) yields another prediction from our model that dif fers 
from the standard theory. Consider two identical individuals, S and 
B; S receives a salary of $12,000 per year paid in 12 monthly install¬ 
ments of $1,000, while B receives a salary of $10,000 per year paid in 
monthly installments plus a guaranteed bonus of $2,000 paid in 
March each year. Standard theories of .saving behavior would predict 
that these two individuals would save the same amount. Our model 
predicts that, on average, B will save more. 

Although we know of no test of this hyf)othesis,“ there is one bit of 
circumstantial evidence. In Japan, where there is a very high saving 
rate, bonus schemes are quite common. We think this is no coinci¬ 
dence. A test would be possible, given the right data. We predict that 
indi\iduals who are paid a portion of their salary via a lump-sum 
bonus will have higher saving rates than those who receive their 
compensation in a smooth pattern. How does this follow from the 
model? 

We have characterized saving behavior primarily as a set of self- 
imposed rules of thumb and externally enforced .saving plans. For an 
individual like B, those rules and plans will be based on his regular 
monthly income. Contributions to pension plans, payments on whole 
life insurance policies, mortgage payments, and so forth must be made 
on a regular basis. Furthermore, most individuals prefer to have their 
monthly inflows and outflows roughly balance.'” For B to act like S, he 
could deposit the bonus in the bank and draw it down gradually 
during the year (as if his salary were $12,000), or he could borrow the 
$2,000 over the course of the year and repay the loan when the bonus 
is paid. However, we feel that neither of these behavior patterns is 
likely to be widely observed. Notice that they violate either the ban on 
borrowing or the ban on dissaving. To the extent that these rules of 
thumb are used, imitating the behavior of S will be difficult. We 

* However, a related issue is investigated by Landsberger (1966). 

This explains why many teachers sacrifice interest by electing to receive their 
academic-year salary paid in 12 monthly installments (Septeml)cr-Augiist) rather than 
10 (September-June). 
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believe that for the typical individual much of the bonus will end up 
being saved, especially through the purchase of durable goods. This 
amounts to the use of an auxiliary rule regarding the disposition of 
bonuses received. We expect that total saving for B will exceed that 
for S because of the technology of self-control. By paying the bonus 
the firm is acting as an external self-control device (much like parents 
who tell children that money gifts at Christmas must go into a savings 
account). Temptations to spend during the year will be overcome 
because the smaller monthly salary will make them seem beyond the 
individual's means. This technology seems to have been recognized by 
the millions of taxpayers each year who claim too few exemptions in 
order to assure a tax refund." Obviously some self-deception (doer 
deception) is necessary for this device to work, but doers can appar¬ 
ently be deceived quite easily. How else can one explain the not 
uncommon practice of knowingly setting one’s watch a few minutes 
ahead (“in order to get to places on time’’).^ 

As the analysis in this section suggests, the shape of the income 
stream will af fect the type of saving strategy adopted. Those individu¬ 
als with variable and uncertain incomes will find a discretionary rule 
such as saving ,s percent each month difficult to enforce in the low- 
incorne months. Without a mandatory saving plan, they would have to 
adopt some more complex strategy to save effectively. Similarly, those 
individuals whose incomes are expected to decline (such as profes¬ 
sional athletes) would prefer to save a large proportion of their high 
current incomes and a smaller (perhaps negative) proportion of their 
lower future incomes. For both, a mandatory pension plan is particu¬ 
larly likely to increase total saving. A more sophisticated analysis of 
the effect of pensions on saving might detect differences of this sort. 

The case of athletes points up the extreme dif ferences in behavior 
that self-control can produce. Their declining income stream creates a 
difficult self-control problem in the high-income years. Some athletes 
hire agents to invest their incomes and limit their current spending, 
and many of them become rich. Others rely on discretionary strate¬ 
gies and end up bankrupt. Both types of behavior are possible in our 
model, depending on the degree of planner control and the types 
of precommitment strategies available. The best predictors of which 
individuals will fall into which groups are probably related to family 
background, since the family is the most likely place for the individual 
to learn (or not learn) the rules and norms necessary to overcome the 
self-control problems," 

“ E.g., in 1969, 55 million taxpayers received refunds while IH million owed taxes. In 
dollars, overpayments exceeded underpayments by 39 percent. 

The last two paragraphs were prompted by a suggestion from Sam Pellzman. 
Irving Fisher also discussed some of these issues. He notes that some individuals spend 
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C. Individual Marginal Rates of Time Preference 

The orthodox theory of intertemporal choice as formulated by Irving 
Fisher produces a very strong result. Each person should equate his 
marginal rate of time preference (MRTP) with the relevant after-tax 
interest rate.'^ Thus the theory predicts that all individuals who face 
the same after-tax interest rate will make the same marginal inter¬ 
temporal choices; Specifically, they will act as if they u.sed the after-tax 
interest rate as their discount rate. This follows because individuals 
are assumed to use capital markets to arbitrage away any difference 
between what Fisher called their “rate of impatience” and the interest 
rate. It should be noted, however, that this result can fail to hold if 
capital markets impiose quantity constraints on borrowers (capital 
rationing). In this case borrowers may be forced to stop borrowing 
even though their rate of impatience exceeds the interest rate. 

In exactly the same fashion, self-imposed borrowing constraints 
such as those di.sciissed in Section IIB prevent the complete internal 
arbitrage from taking place. Thus in our model, in which such con¬ 
straints play an important role, the presumption that individual 
MRTPs will equal the interest rate no longer holds. Indeed, we expect 
to observe behavior that implies an MRTP greater than the interest 
rate and at the same lime an unwillingness to engage in additional 
borrowing. Two points need to be raised about this implication. First, 
once rules are incorporated into the analysis, the failure to equate the 
.MR IT to the interest rate may not violate an optimality condition. If 
t itles are used it is because they lead to higher levels of utility than 
pure discretion would. The inequality created is costly, but the cost 
arises from the necessity of using a second-best technology. Because 
rules by nature must be simple, rules that select the precisely correct 
consumption bundle in every situation are infeasible. Second, notice 
that if a quantity constraint is binding, whether internally or exter¬ 
nally imposed, observed MRTPs will be equal to or greater than the 
interest rate. 

Attempts to measure individual MRTPs appear in studies by Kurz, 
Spiegelman, and West (1973) and Hausman (1979). Hausman studied 
families’ purchases of room air conditioners. The trade-off between 
initial outlay and operating costs permitted him to estimate implicit 
MRTPs. The mean MRTP in his sample was about 25 percent, clearly 
above any relevant interest rate. Kurz et al. obtained similar results 


therr weekly paycheck at the “grog house.” (Others, we believe, avoid the grog house 
precisely on those days.) He attributes much of the observed differences in behavior to 
social class. Our difterences with Fisher are discussed in the next section. 

We define the MRTP to be the marginal rate of substitution between tomorrow's 
consumption and today's consumption minus one. 



THEORY or SELF-CONTROL 


403 


by asking hypothetical questions of participants in the Seattle and 
Denver Income Maintenance Experiments. They asked a sample of 
participants a series of questions of the follotving sort: What size 
bonus would you demand today rather than collect a bonus of $100 in 
1 year? Several different forms of this type of question were asked, 
and the results were striking. For whites the mean rate of time 
preference implied by their answers varied between 36 and 76 per¬ 
cent. For blacks the rates varied between 40 and 122 percent. Of 
particular interest for present purposes is the fact that this sample 
included only those respondents who said they could borrow either 
$500 to make an installment purchase or $1,000 in cash. Further¬ 
more, 81.3 percent of this subsample reported that they would not 
borrow $1,000 at current interest rates. (The mean perceived rate was 
generally less than 20 percent.) This strongly suggests the use of 
self-imposed borrowing constraints. 

Once individual differences in MRTPs are anticipated, it becomes 
interesting to ask what factors determine those differences. A detailed 
examination is beyond the scope of this paper, but the model predicts 
that those factors determining individual rates of impatience will (for 
the reasons stated above) also affect observed MRTPs. These factors 
are discussed at length by Fisher (who draws on the writings of John 
Rae and Eugen Bohm-Bawerk). Fisher believed that age, income, and 
marital status affect the rate of impatience (in obvious directions). He 
felt that the shape of the income stream as well as the level was 
important; if income were expected to rise, the rate of impatience 
would be higher. Six “personal characteristics” were also deemed to 
be important: foresight, self-control, habit, expectation of life, con¬ 
cern for the lives of other persons, and fashion.'^ 

In principle, the planner-doer model can be tested against the 
standard model quite simply. Observe some intertemporal choice that 
implies a discount rate (such as Hausman’s study of air conditioners). 
Then regress the implied discount rate on the factors above and the 
individual’s borrowing rate. The standard framework implies that 
only the borrowing rate will be a significant predictor. Our model 
implies that the other factors will also be important. Hausman did test 
one such variable. The implied rate of discount was computed for six 
income classes. His results appear in table 1. Clearly tbe variation in 
discount rates cannot be attributed solely to variations in borrowing 
rates. 

We would also expect age and social class to be important in pre¬ 
dicting individual intertemporal choices. The young behave impa¬ 
tiently in part because they have yet to master the techniques of 


'* Fisher 1930, p. 81. On this topic also sec Mailal and Mailal (1977). 
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TABLE I 

Estimated Discoi'Nt Rates Usino Mean Popi'lation Estimates 


Income Class 
($/>t) 

Observations (N) 

Implied Oiscouru 

Rate (%) 

600 

6 

89 

10,000 

15 

39 

1 .>>,000 

16 

27 

25.000 

17 

17 

35,000 

8 

8.9 

50,000 

3 

5.1 

Stii'RCf—Hau»nian l‘J79 



self-control. To the 

extent that these techniques 

are learned from 


parents, class differences will be observed. Most important, our model 
stresses the theoretical admissability of these variables. Only further 
empirical work can establish their relative explanatory power. 

IV. Summary and Conclusion 

We now briefly recapitulate our argument. We have investigated 
intertempoial choice as a problem in the economic theory ol self- 
control. As the quotation aj the beginning of Section I states, the 
concept of self-control is paradoxical unle.ss some kind of multiself 
model of man is adopted. We have introduced self-control into a 
formal model of intertemporal choice by modeling man as an organi¬ 
zation with a planner and many doers. Conflict occurs because the 
doers are myopic (i.e., selfish). This conflict is fundamentally similar 
to the agency relationship between the employer and the employee, 
and individuals use many of the same strategies that organizations 
adopt to deal with their “conflicts of interest.” These strategies can 
involve doer/employee discretion while their incentives have some¬ 
how been altered, or they may entail the implementation of precom¬ 
mitment (a rule) to avoid the doer/employee decision process al¬ 
together. 

The close correspondence between the solutions to control prob¬ 
lems adopted by organizations and individuals provides strong sup¬ 
port for our model. Although our model is nontraditional, our tools 
are strictly traditional. Formally, our model closely resembles that used 
by Ross (197.')) in his study of the theory of agency. Finally, we note 
that ours is a theory of rational behavior, just as Ross's theory is of 
profit-maximizing behavior. 

Many applications of the model are possible, and we have discussed 
a few briefly. The most important applications are in the study of 
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individual saving behavior. Our hypotheses are quite different in 
spirit from the permanent income and life-cycle hypotheses that cur¬ 
rently dominate the literature. On the basis of the evidence presented 
here, we feel these theories of saving should be reevaluated. 
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A Note on Loss of Control and the Optimum 
Size of the Firm 

Antonio Camacho and William D. White 

University oj /llinnis at Chteago Circle 


This note has been inspired by a recent paper in this journal by Calvo 
and Wellisz (1978). Its main purpose is to suggest that, in constructing 
models to study the size of firms, more attention needs to be given to 
the conditions which bring abttut their existence. 

In Section II of their article, Calvo and Wellisz prove a proposition 
that given certain assumptions about supervision, if a firm of any size 
exists, then loss of control will not impose finite limits on its size. 
Although their result is correct, they, as well as some other research¬ 
ers working in this field, fail to give explicit reasons for firms to 
exist. However, as we will argue below, there are not convincing 
reasons, under their assumptions, for firms of any size to exist, thus 
making their case for the appearance of unbounded firms rather 
weak. 

This result prompts us to suggest, as stated above, that more atten¬ 
tion must be given in this area of research to the very conditions which 
bring about the exi.slence of firms. For instance, models which, in the 
spirit of Williamson’s (1967) paper, explicitly consider problems with 
information and communication, or models where it is shown that the 
activities of workers or agents may be coordinated more efficiently 
through extra market arrangements, may both produce realistic and 
interesting results regarding the size of firms. 

We proceed as follows. First, we prove that, under Calvo and 
Wellisz’s assumptions, firms will never exist if workers can be self- 
employed. Second, we show by means of a “setup" cost example that. 

We are grateful to Joseph Persky for his comment.s on an earlier \ er5ion ol this note. 
Any errors which remain are, of course, our own. 

{Journal Political Economy, 1981, vol. 89. no. 2) 
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even if conditions which limit the opportunities of workers for self- 
employment exist, Calvo and Wellisz’s assumptions regarding skills, 
utility functions, and productivity of workers seem to imply that there 
are more efficient arrangements to carry out production activities 
than the formation of firms. The note concludes with an intriguing 
conjecture and a suggestion. 


I. No Limitations on Opportunities for Self-Employment 

We will proceed now to prove that, if there are no limitations on 
workers’ opportunities for self-employment, then under Calvo and 
Wellis/’s assumptions firms will never exist. We will obtain this result 
by showing that, under their assumptions, a worker can always 
achieve at least as high a level of utility by working for himself as he 
can by working as an employee for any firm which makes a positive 
profit out of hiring him. 

In their model, Calvo and Wellisz assume that the (Von 
Neumann-Morgenstern) utility index, U, of every worker is the same 
and depends on consumption, c, and effort, e\ more specifically they 
assume: 

C = «(c) - r(e),c s 0, 0 « e ^ 1, (1) 

where u' ^0, v' ^ 0, n" < 0, andt'" < 0. (The last two conditions rule 
out risk-loving behavior.) Effort is assumed to be at a maximum when 
e equals one, while the worker is assumed to be completely idle when r 
is equal to zero. 

Workers employed in firms are subject to monitoring where f is the 
probability that a worker will be detected shirking. Workers who are 
not detected shirking are paid a wage of w. Workers who are found 
shirking are paid we. 

For any P and w, a worker will select that level t)f effort which 
maximizes his expected utility z: 

z = P[u(we) — v{e)] + (1 — P)lu{w) — elc)]. (2) 

Let e be the value of e which maximizes z. 

Calvo and Wellisz assume also that the jtroductivity of every worker 
is the same and equal to ;8r (0 « p ^ 1) when working at the level of 
effort e. Given this assumption, a firm’s expected revenue from em¬ 
ploying a worker at a wage w and with a level of supervision P will be 
/3 p. The expected cost to the firm of employing the worker will be the 
expected payments to the worker, P{we) + (1 - P)w, plus the ex¬ 
pected cost of providing a level of supervision P. Thus, the total 
expected cost of employing a worker will be greater than or equal to 
P (we) + (1 — P)w. A necessary condition for the firm to make a 
positive expected profit is then 
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^ > P (we) + (1 — P)w 
or 


w < 


P(e - 1) + 1 


(3) 


Let E(0) be the maximum expected utility which a worker can 
obtain by working for a firm at a wage rate w and under a level of 
supervision P: 

E[lJ) = P[u(we) — + (1 “ f‘)[i‘(u>) - (4) 


The expected utility of a worker when self-employed and applying 
himself to his work at the same level of effort e as in (4) is 

E(U) = uipe) ~ v(e). (5) 

Taking into consideration condition (1) and inequality (3), we can 
easily obtain 

E{0) u({3e) - v(e), 

which in view of (5) proves our assertion. 


II. Limitations on Opportunities for Seif-Employment 

We will now discuss conditions that may limit workers’ opportunities 
to be self-employed. To facilitate this discussion we will first enumer¬ 
ate the main assumptions of Calvo and Wellisz’s model: Everyone has 
the .same skills; everyone has the same utility function and, therefore, 
the same attitude toward risk; there is no specialization of labor, no 
need for communication of instructions or for coordination of ac¬ 
tivities; there are constant returns to scale when everyone applies the 
.same level of effort to production activities. 

Given these assumptions and the additional assumption that there 
are no institutional restrictions on sell-employment, it appears that 
the only condition which might limit opportunities for self-em¬ 
ployment for part of the population is the existence of some factor 
of production, say land, whose ownership is unevenly distributed. 
This situation might allow owners to pay workers (those who lack any 
endowment of this factor) a wage below their marginal product. But 
let us analyze this situation in more detail. Consider two cases; (i) only 
one individual owns all the land; (ii) more than one individual owns 
land. 

In case i, an individual who is a monopolist may be able to start a 
firm and make total profits greater than by hiring workers at a wage 
below their marginal product. Then, as Calvo and Wellisz rightly 
prove, the owner can increase his profits by adding layers to the 
hierarchical structure of his firm until he employs everybody. 
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But this is a rather cumbersome arrangement. Suppose that the 
total amount of land is L, the total number of workers is k, and the 
maximum profit the monopolist can extract by employing k workers is 
Tr{k). By renting each worker a plot of land Uk at a price of iT(,k)lk, the 
monopolist can obtain the same amount of money as he would by 
establishing a firm, while workers will achieve a level of utility greater 
than or at least equal to the level they would obtain by working for the 
monopolist. Under these circumstances, firms are unlikely to be es¬ 
tablished. 

Similarly, firms are not likely to be established in case ii either. Any 
owner who might start a firm always has the simpler option available, 
like our monopolist in case i, of renting his land to the workers of his 
hypothetical firm and charging them a rent equal to the maximum 
possible profit he could obtain from establishing a firm divided by the 
number of his employees. Under this arrangement, the owner will 
receive the same amount of money as he would receive if he set up a 
firm, while workers will obtain a level of utility which is greater than 
or at least equal to the level of utility they would obtain by working for 
the owner as employees. 

The results obtained by Calvo and Wellisz in section II of their 1978 
paper and our own discussion above lead us to make the following 
conjecture, which we pre.sent as a still unproved theorem; that for any 
set of conditions under which there is no limit to the size of firms, 
there is no reason for firms to exist. And this conjecture, together with 
the fact that firms do exist and are of limited size, in turn suggests to 
us that the optimal size of firms may be determined by those very 
conditions that bring about their existence. 

References 

C.alvo, t’ruillerrno A., and Wellisz. Stanislaw. “Supervision, Loss of Control, 

and the Optimum Size of the VWm." J.P.E. 86, no. 5 (October 1978): 

943-52. 

Williamson, (Oliver E. “Hierarchical Contiol and Optimum Firm Size." 

75, no. 2 (April 1967): 123-38. 



Prior Information and the Observational 
Equivalence Problem 

Warren E. Weber 

Virginia Polytechnic Institute and State Univerdty 


The observational equivalence problem pointed out by Sargent (1976) 
consists of two parts. The first is whether regression analysis of so- 
called classical and Keynesian reduced-form equations can discrimi¬ 
nate between the competing hypotheses, using data from a single 
policy regime in the absence of any prior information concerning the 
policy rule being followed. The answer to this question is no and can 
be shown as follows. 

Consider the reduced-form equation for a “goal” variable (e.g., real 
GNP). The classical version of this reduced-form equation would be 

a(L)y, + = n„ (1) 

where m, is a policy variable (e.g., the nominal money supply), and 
fif is a white-noise error term. This equation is classical, since only 
innovations in the policy will affect the goal variable. As a conse¬ 
quence, the time path of the goal variable will be independent of the 
policy rule chosen, so long as the rule implies m, = + e,, where 

€( is a white-noise error distributed independently of /x,. 

The Keynesian version of this reduced-form equation would be 

a(L)y, -t- b(L)m, = »/,. (2) 

Assuming that (2) is invariant with respect to interventions that 
change the m, process, it is broadly "Keynesian," since it would be 
possible to solve (2) for a policy feedback rule which would minimize 
the deviations of the goal variable around some desired level. 


Professor of economics, Virginia Polytechnic Institute and State University. I am 
indebted to Anatoli Kuprianov, Bennett McCallum, and an anonymous referee for 
useful comments on this paper. 
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To show that estimation of (1) and (2) will not distinguish between 
the Keynesian and classical positions, suppose that the policy variable 
follows the rule 


y(L)m, = e, (3) 

over the entire observation period. Then substituting (3) into (1), we 
obtain 


a(L)y, + ^{L)y(L)tn, = fi,. (4) 

In the absence of any prior information on the order of the 
polynomials y(L) or ^(L), it would be impossible to distinguish em¬ 
pirically between (4) and (2). That is, if both equations included the 
same number of lagged terms, they would have the same sum of 
squared residuals, since l3{L)y(L) would be an arbitrary polynomial in 
L. Further, since (4) has an identical error to (1), the sum of squared 
residuals obtained from estimating (1) would be the same as that from 
estimating (2), regardless of whether the Keynesian or classical hy¬ 
pothesis were true. Thus, in the absence of any prior information 
on the lengths of the distributed lags in the model, it is impossible to 
empirically distinguish between the Keynesian and classical positions 
using data from a single policy regime. 

The second question is whether such a test can be performed when 
prior information on the order of y(/.) or/3(/.) is available. McCallum 
(1979) has pointed out one such case. Our purpose is to generalize 
McCallum’s result and to relate it to Sargent’s original paper and the 
empirical tests of the cla.ssical and Keynesian hypotheses performed 
by Barro (1977, 1978). 

Let t (L) be the polynomial in the lag operator L of the regression 
coefficients on the m, terms when equation (4) is estimated; that is, c (L) 
= /3(L)y(L). Thus, (4) can be empirically distinguished from (2), if 
and only if there exist testable restrictions on the relationship between 
y(L) and c{L). To examine the nature of these restrictions, first note 
that a typical element of the convolution r(L) = p(L)y(L) is 

k= - 30 

Thus, c(L) = l3(L)y(L) can be written in matrix form as 


C = H{y)B, 


( 5 ) 



COMMENTS 


4*3 


where H(y) is the (n + 

m + 

1) X (n + 

1) matrix 
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. . 0 
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. . 0 

0 


yz 

V. 

Vo 

. . 0 

0 

H(y) = 

yn 

V»-i 


• • Vi 

Vo 


yn+ 1 

Vn 


• ■ V2 

Vi 


0 

0 

0 

. . 0 

Vm 


( 6 ) 


B = (/3o, y3../3„)’ is the vector of coefficients of/3(L), and C = (Co. Cj. 

. . . , is the vector of coefficients of c(L). Then by the result of 

Rothenberg (1973, p. 37), knowledge of m or n will yield restrictions 
relating the coefficients of y(L) znd c(L) if rank [H{y)] < (n + m + 1). 

With this result, we can now interpret Sargent’s original argument 
and McCallum’s result. Sargent considers the case in which n = m = *. 
In this case, the restriction that the rank of // {y) be less than n + m + \ 
is not satisfied, and no restrictions between y(L) and c(L) are ob¬ 
tained. McCallum considers the case in which n = 0, m = ». In this 
case rank [H (y)] = 1 < », and the prior information given by knowl¬ 
edge of y(f,) implies restrictions on c(L). 

Rolhenberg's result, however, allows a more general statement of 
when knowledge of n or m implies that restrictions between y{L) and 
c{L) exist. This general result is that knowledge of n or m will imply 
that testable restrictions between y(/-) and c(L) exist, iinle.ss n = » or n 
is finite and m = 0. This genet alization follows directly from the rank 
condition stated above. It implies that unless y, depends on all past 
innovations in policy or policy is white noise, the classical reduced- 
form equation (4) can be tested against the Keynesian reduced-form 
equation (2). The test is whether imposing the restrictions of equation 
(5) leads to a statistically significant decrease in the explanatory power 
of the regression equation. 

The test of the classical model by Barro used the policy equation 

y(L)m, + X{I.)z, = e, (3') 

in place of (3), where z, is an exogenous variable (such as the min¬ 
imum wage or level of unemployment compensation) and X(L) is 
a polynomial of order r in L with Kg = 0. Substituting (S') into (1) 
would yield the new classical reduced form 
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a{L)y, + fi(L)y(L)m, + )3(L)\(L)z, = /x, (4') 

to be compared with the appropriately modified Keynesian version 
a(L)y, + b{L)m, + d(L)z, = V,. (2') 

Now let/(L) be the polynomial inL of the regression coefficients on 
thez, term when (4') is estimated, so that/(L) = /3(L)X(L). Then using 
the same arguments as above, (4') can be empirically distinguished 
from (2') if there exist testable restrictions between y(L) and c(L) or 
between \(L) and f(.L). The case of c(L) was considered before. 
Following the reasoning used for that case, we find that knowledge of 
r will yield testable restrictions between \(L) and f{L) if and only if 
rank [//(X)] < (n + r + 1). Since H(k) is (n + r + 1) X (n -f 1), 
knowledge of r will not imply that testable restrictions between \(L) 
and f(L) exist when n = «>. Thus, Barro’s method of including exoge¬ 
nous variables in the policy rule which do not appear in (1) will not 
yield testable restrictions to distinguish classical and Keynesian re¬ 
duced forms when y, depends on all past innovations in the policy. 
However, when n is finite and m = 0, so that no testable restrictions 
between y(L) and c(L) exist, the knowledge that r S' 1 implies that 
testable restrictions between k(L) and f(L) exist. Thus, in this case, 
Barro’s method will permit the classical and Keynesian reduced-form 
equations to be distinguished when they could not be if z, were 
omitted from the policy rule. 

The practical usefulness of these results, of course, depends on the 
existence of reliable theoretical restrictions on the order of the lag 
distributions of ^(L), y(L), or k(L). The order of /3(L) depends on 
the extent to which past errors in the expectation of policy affect the 
current level of the goal variable. One source of restrictions on the 
value of n can be found in the models of aggregate supply developed 
by Lucas (1972, 1973) and Leiderman (1979) in which producers in 
informationally separated markets face the problem of distinguishing 
relative price changes from absolute price changes, given current 
information only on the prices they receive. In these models, output is 
affected only by the current error in the expectations of the price level 
implying n = 0. The same restriction on the value of n arises from 
Sargent’s (1978; 1979, chap. 16) models of employment. McCallum 
also argues in favor of such a restriction on the grounds that “it is 
hard to imagine ways in which past expectational errors could have 
direct effects on current behavior—bygones are, after all, bygones” 
(1979, p. 398). Thus, there appears to be a solid theoretical basis for 
imposing the restriction that n = 0 on /3(/-).' 


' This conclusion is not altered by the consideration of adjustment or search costs. 
Such costs may lead to past expectational errors having indirect effects on the current 
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Given n = 0, the rank conditions above indicate that testable re¬ 
strictions between c{L) and y(L) or between [{L) and A,(Z.) will exist 
unless policy is white noise (i.e., unless w = r = 0). As yet, there is no 
satisfactory theory of how policy functions are determined, which 
would allow us to theoretically determine that policy is not white 
noise. However, the evidence from empirical attempts to estimate 
monetary policy functions does support such a conclusion.* 

Thus, we would argue that reliable theoretical and empirical evi¬ 
dence exists to allow testing of models with classical implications 
against models with Keynesian implications, using data from a single 
policy regime. However, the lack of theoretical restrictions on m or r 
does caution that such tests should check for their robustness to 
changes in the length of the lags assumed in the policy functions 
underlying the test. 
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The Enrhtsure of Oteati Reioutres: Eronomtcs and the Law of the Sea. By Ross D. 

Eckert. 

Stanford, Calif: Hcxjver Institution Press, 1979. Pp. 390. $16.95. 

This IS a clearly written book about the Law of the Sea, applying Coase/ 
Deinset/. economic ideas to regulation of, and property in, six or more uses of 
ocean re.sourccs fhe cetural theme is that national “enclosure" by the exten¬ 
sion of base lines, and widening of territorial seas and various types of 
exclusive economic 7ones, are analogous to the historic “enclosure" of com¬ 
mon lands and the appropriation of frequencies and so on. The book c loses 
with chapters on UNCLOS (the United Nations Conference on the Law of the 
Sea) as a venue in which entlosure is debated and arranged. 

Written for a more general audience than academic economists, the bcxik is 
sometimes less than rigorous in its analysis of rents, profits, and tran.sactions 
costs and may also be given to sliding over some difficult details of law, 
geography, engineering, and the other fields in which expertise is needed to 
write such a book satisfactorily to all. But the dcKumented attempt to put the 
claims of an econotnics/law property rights approach is worth pondering. 

The author fully realizes that “watertight" exclusivity arrangements for 
some ocean uses would be astronomically costly or even technically impossi¬ 
ble. Consider fisheries, to which he devotes a long chapter. Enclosure of 
coastal fisheries leads to internalized control over local stocks but is obviously 
less reliable in controlling the harvesting and growth of migratory species. For 
the latter, the author faintly recommends (as he does for some types of 
oil-spill problems) a mixed international-coastal regime. But he warns that, 
for such arrangements, the transactions costs between nations are high; 
he conjectures that costs for ccxiperation, enforcement, and decision making 
increase seriously with the number of countries participating; and he comes 
out in favor of single-coastal-nation fisheries' regimes. 

In this recommendation he is in distributive deep water, as he recognizes, 
fhe alternative to single-nation control is treaty-organized “communal” con¬ 
trol, giving some power to all nations. Eckert argues that this system is inferior 
to that by management by individual states. The evidence of inferiority is only 
cjne of ccjst and ef fectiveness: that transactions costs do seem high, and success 
does seem incomplete, in such Ixidies. On such cost evidence, the authoi 
seems inclined simply to argue that single-nation regimes should prevail. But 
this alternative is not one that is available. Not only are many cxean resource 
uses unenclosable physically, but many claims to conflicting fishing or other 
rights are of ancient vintage and cannot be cancelled merely by an economist 

[JourryU <?/ PolUiral Economy. 1Q8I, v<»l 89. no 2| 

(gj 1981 ^ The University of C.hKaf{o 

416 




BOOK REVIEWS 417 

asserting that fishing success is higher and costs lower when one country takes 
all. 

There ts an answer, involving multinational equity ownership of the rights 
to use certain ocean resources, actual management being entrusted by all to 
one nation or to a private firm. The Pacific fur seal treaty provides an 
example. Some UNCLOS proposals having to do with minerals and other 
resources could be interpreted or remolded in this form, but Eckert does not 
do so. His adamant opposition to large numbers is therefore unconvincing. (A 
side benefit of this discussion is his analysis of the process of national adjust¬ 
ment to the likely future outcome of UNCLOS bargaining. A country about 
to make a bargain that depends upon its initial position tries to magnify its 
initial position. Eckert notes this several times.) 

A .second answer to the many-nation difficulty is to offer compensation to 
those who arc excluded. If it is too costly to let all nations fish or mine, then 
Eckert should provide for some way to buy them out. Although the prtKess of 
doing this would be costly, it might work, and the UNCXOS pursuit of the 
dogma of the oceans being "the common heritage of mankind” could be 
utilized here. But Eckert seems to me to give little if any weight to the view 
that UNf’.LOS is a constitution-making body from which all must depart 
better off than they arrived. 

In other words, his quickness and clarity in making points against UNCLOS 
prot.edurcs do, it seems to me, blind him to the difficulty that distributional 
imperatives do call for multination ownership or benefit. Otherwise the injus¬ 
tice of exclusion would match that of the original enclosure movements, 
which Eckert seems to ignore. 

In economists’jargon, Eckert seems strongest on clficiency and weakest in 
the realms where distribution and public choice meet. This is a pity, for his 
book ends in frustrated anger at the "hypocrisy” of bargaining at UNCLOS, 
when his earlier treatment, clean and informative, somewhat extended be¬ 
yond the domain t)f operating transactions costs, would suggest the limits of 
what is possible given the costs of constitution making and the conflict in every 
nation's breast of distributional and efficiency objectives. 

Ani hony St.orr 

U7ihifT.\ity oj Bnluh (Uilumhia 


Isities in Financial Regulalum. Edited by Krankcin R. Edwards. Regulation of 
American Business and Industry fieries. 

New York: McGraw-Hill Book Co., 1979. Pp. 526. $42.30. 

T his book can best be thought of as an intellectual sampling of approaches to 
regulatory ])roblem.s by a diverse group of economists and lawyers. For those 
who want such a sampling, reading 11 would be useful. All economists do not 
think one way about regulation, and all lawyers another. There is a diversity 
o( opinion in both cam]js. As an economist, I will venture the opinion that the 
economics ptofession can offer two criteria that should be applied to analyses 
ot regulatory problems; (1) Are sound theoretical notions applied to the 
ptobleins? and (2) Is goocl empirical methodology used? The first criterion 
involves not merely an understanding of theory but an understanding of the 
problem, how it arises, and the market forces that surround it. The second 
criterion involves not merely the gathering and analysing of information but 
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asking the right questions about the type, amount, and quality of information 
needed to analyze the problem. It is useful to look at the papers in the book in 
this manner. Lawyers will undoubtedly have other criteria, but in my opinion 
a great number of our regulatory problems have arisen because one of these 
two criteria has been violated. 

The papers in this book indicate that some thinkers pay attention to these 
criteria, others do not, and the quality of the papers is influenced accordingly. 
Kenneth Scott analyzes the dual banking system as a model of competition 
among regulatory agencies. By dual banking system, he means the system of 
alternative chartering at the state and national level and the system of divided 
regulation at the federal level that goes along with it. However, he only 
analyzes one-half of the problem. Competition among regulators lets banks 
prevent the imposition of unduly burdensome regulation by using the option 
of a change in chartering status to play one regulator off against another. 
Scott says this is good. But unless there are other checks on the regulators, the 
logical end of such a process is no regulation at all—that is, competition in 
laxity. But other checks do occur. Bank regulation and supervision is a 
technical business in which the best sy.stems and procedures are far from 
obvious even to a trained professional. Regulators, like the rest of us, do not 
like failures, and they learn from each other’s mistakes. As in the mar¬ 
ketplace, alternatives in bureaucratic regulation promote innovation and effi¬ 
ciency, a form of “competition in excellence.” In their periodic oversight 
hearings on the condition of the banking system, the Congressional banking 
committees are not just as.se.ssing the credibility of the reports of one 
monolithic regulatory agency. They are comparing the performance of sev¬ 
eral agencies. It is not obvious that the system produces overregulation or 
iinderregulation. But there are inherent checks and balances against either 
tendency. 

Bank regulators have the dual responsibility of preventing extensive bank 
failures which would bring about unwanted contractions in the money supply 
and of insuring low-cost delivery of banking services to the public. This dual 
responsibility should be kept in mind when analyzing proposals for change. 
Scott presents an extremely complicated organizational chart to depict the 
structure of the system that regulates commercial banks. The chart displays 
the interrelationships between the Comptroller of the Currency, the Federal 
Reserve, the Federal Defjosit Insurance Corporation, and the 50 state regu¬ 
latory agencies. The chart is incomplete. It does not include the Securities and 
Exchange Commission, the Federal Trade Commission, the Treasury De¬ 
partment, the Department of Housing and Urban Development, the Com¬ 
merce Department, the Labor Department, and the Energy Department— 
some of the other agencies that have gotten into the business of regulating 
banks. In addition, the Federal Home Loan Bank Board, the National Credit 
Union Administration, and the Farm Credit Administration—all omitted 
from the chart—arc also regulators of deliverers of banking services and 
interact with the other bank regulators. The chart was constructed by Senator 
Proxmire’s staff for the purpose of justifying consolidauon and simplification 
of federal bank regulation. If we wish to design bank regulatory systems for 
the administrative convenience of the regulators, the senator’s proposed 
reforms merit consideration. But if the purpose of the regulatory system is to 
insure continuous and efficient delivery of banking services in the mar¬ 
ketplace, the failures of the current system arc not obvious. 

Failure to apply both criteria is illustrated in Roy Schotland’s paper on 
conflicts of interest. He relies extensively on anecdotal evidence and con- 
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structed situations with very little analysis of market structure or economic 
incentives. On page 125 he says that most conflicts occur because com|>etition 
is not effective, an assertion that is not justified with any evidence. On page 
128 he hastens to add that most conflict situations are not exploited. Could it 
be that competition is effective? He says that confidentiality requirements 
make “marketplace type shopping” unfeasible for doctors, lawyers, and in¬ 
vestment managers. No evidence is offered on the extent to which consumers 
actually do shop around for the services of such individuals. 1 make an 
attempt at it, and 1 do not think I am that unusual. Schotland makes an issue 
out of the conflict an investment banker incurs because of his obligation to 
“the buying public, to corporate issuers, and self-interest.” He does not say 
why this is any different from the problem faced by the president of a 
manufacturing firm who has to mediate the competing claims of customers, 
suppliers, and stockholders, or why it should require any more regulation. 

The paper by Robert Shay, “Consumer Protection in Credit Markets; An 
Analysis of Regulatory Reform," is an excellent example of Judicious applica¬ 
tion of both criteria. Shay uses the economic analysis of property rights to 
assess the regulatory reform movement affecting consumer credit markets. 
He develops an analytical framework and applies it to one limitation of a 
creditor’s remedy to recover debt; the prohibition of wage assignments. He 
then compares his analysis of the problem to that used by the Federal Trade 
Commission (FTC). The comparison is fascinating and revealing. Although 
he does not do any empirical testing, he develops a precise way of getting 
answers to the questions needed to implement his theory. 

Shay believes a problem in consumer credit markets today is that the locus 
of bargaining is on rates and terms and not on the remedies and rights of 
creditors and borrowers. Rather than prohibit wage assignments, parties 
should be permitted to bargain over whether or not they are to be allowed in 
the contract. The author notes that when wage assignment provisions are 
allowed they arc not uniformly found in all credit contracts. This suggests 
that when they are included, the reason is to lower the cost of credit to 
high-risk applicants. 

To assess the need for government interference Shay suggests using Dem- 
setz’s principle of compensation. The government should try to buy permis¬ 
sion to allow the activity from those who feel they would be harmed by 
allowing it, and it should try to buy permission to restrict the activity from 
those who feel they would benefit from allowing it. The lower<ost option 
should be purchased. 

Shay’s analysis indicates that the adversary relationship is not between 
debtor and creditor but between those who will be harmed if there is freedom 
to contract for remedies and those who will be harmed if there is not. The 
creditor is best viewed as an intermediary. Only a modest (xirtion of the funds 
being lent are his own funds. Nevertheless, the creditors’ adjustments to 
governmental intervention determine who gets helped and who gets hurt. 
The need is for marginal benefit/cost comparison between two groups of 
debtors, not between the debtors vis-a-vis the creditors. The rather crude 
balancing process used by the FTC is between the creditors and the debtors. 

Two of the papers dealing with electronic funds transfer (EFT) propose 
radical policy changes without using either of the criteria mentioned above. 
Almarin Phillips claims that regulation and institutional arrangements— 
interest rate ceilings, deposit insurance, etc.—cause difficulties in the admin¬ 
istration of monetary policy. Undoubtedly this is true. But he then goes on to 
assert, in language phrased to make the dissenter feel antediluvian, that EFT 
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makes all of this substantially worse and will bring a crisis. He offers no 
evidence or analysis to support his conclusion, and I, for one, dissent from it. 
To be sure, many regulations cause diffictdties for private bankers, central 
bankers, and the public. But it is the regulations that cause the problem, not 
the ii.se of electronic forms of communication. 

Similarly, Alan Westin .says that EFT creates the need for new statutes to 
spell out individuals’ rights. He says that when information is merged from a 
variety of sources and not used for one-time purposes a new trustee relation¬ 
ship is created. Of course, such information gathering is as old as banking 
it.self. But when it is done electronically, Westin believes new laws are called 
for. He IS not exactly sure about what the new laws and regulations should be. 
He proposes a decade or two of controlled experimental regulation. He gives 
no discussion of current abuses to support this recommendation. 

Arnold Heggestead does a survey ol studies of bank competition and 
performance. Most of the articles surveyed, and the survey itself, suffer from 
a lack of ajjplication of the criteria mentioned here. This is best illustrated by 
Heggestcad's ambiguous and contradictory conclusion. He starts out by stat¬ 
ing that IcKal banking markets are “more monojiolistic than competitive.” He 
ends by stating Hve major areas where lurther empirical research is needed to 
clear ii|i anomalies. 

If he had started out tiy analyzing the way markets operate he could have 
explained some of the anomalies himself. Intel-institutional competition is a 
fundamental fact of today’s marketplace. If one examines all of the various 
asset powers, liability powers, and fee income powers of commercial banks, 
there is only one privilege that commercial banks have that is not granted to at 
least one other financial institution, riiis is the power to offer demand 
deposits, a privilege whose value is rapidly being eroded by market innova¬ 
tions and changing product mixes. In 1947, commercial banks held 56.3 
peicent of the assets of all financial institutions. In addition to thrift institu¬ 
tions, this latter categoiy includes life insurance companies, private pension 
funds, state and local employee retirement funds, money market funds, and 
finance companies. In 1977 the corresponding figure was 39.6 i)ercent. In 
1947, tliey held 81.7 percent of the total deposits of all depositoi y institutions. 
In 1977, It was 59.3 percent. In 1947 commercial bank deposits were 56.2 
percent of the total liabilities ol all financial institutions. In 1977 the corre¬ 
sponding figure was 32.4 percent. IJcmand deposits, the product for which 
commercial banks are sometimes said to have a ’’monopoly,” were 40.3 per¬ 
cent ol total financial institution liabilities m 1947. In 1977, the coriesponding 
figure was 10.9 peicent. This is not the picture of a monopolistic industry. 
W'hen they had reason to do so, bank customers were highly sctphisticated at 
finding alternative ways of meeting their financial needs. 

.Some of the other papers are good examples of applications of the criteria 
.set forth here. Others are not. But enough has been said to illustrate the 
problem. 

The book suffers from a lack cjf cli.scussion of the political aspects of 
regulation. I refer to the process whereby inlormation is gathered through 
hearings and testimony, and laws are pa,s.sed: and more information is 
gathered through hearings and testimony, and regulations are written. With 
increasing frequency, outsiders are having a hard time matching the laws with 
the regulaticvns, and Congress and the regulators are pointing their fingers at 
each other. I have no hnal answer to this prcjblem, but I will suggest two 
partial cau.ses. First, the process inherently favors anecdotal evidence over 
econometric work and surveys which, because they are subject to 
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methodological criticisms, inevitably present a more balanced view of the 
world. Second, when the banking system works well, the bankers get the 
credit, not the regulators. When it goes awry, the regulators share a good 
portion of the blame. This inevitably promotes regulations designed to pro¬ 
tect inefficient banks, to the detriment of the customers of well-managed 
banks. 


American Bankers Association 


P. Michael Lai b 


New Challenges la the Role of Profit. Edited by Benjamin M. Friedman. 
Lexington, Mass.: D. Ci. Heath & Co., Lexington Books, 1978. Pp. viii-(-127. 

$14.95. 

In 1976, Professors Samuelson, Arrow, and Lundberg presented lectures in 
the third series of the fohn Diebold Lectures. Along with an introductory 
chapter by Professor Friedman and a concluding round-table discussion, 
ifiese lectures comprise Neiv Challenges to the Role of Profit. Samuelson gives a 
broad-brush irealment of the trend of profits, and intellectual opinion about 
profits, in the modern mixed economy. He also includes a brief history of 
profit theory. Arrow links criticism of profits to general attacks on private 
property, but he emphasises the conceptual differences between profit, 
property income, and private property. Lundberg reports on the Swedish 
■‘middle way"—now dubbed "fund .siK-ialism”—and attempts to draw conclu¬ 
sions for Western countries generally. 

All things considered, the book is a disappointment from almost any per¬ 
spective. Even for a work of this genre, Neii< (Challenges lacks focus. Though 
edited by a perspicacious scholar who contributes an introductory essay, the 
txKik wanders. The subject remains unclear. The principal lectures were 
delivered by academics, with comments by a mixture of academics and 
nonacadeniics. At a minimum, tine would have hoped for tareful delineation 
of a profit concept, distinguishing profit from equilibrium returns to owners 
of tapital, land, and t'lher scarce resources (i.e., interest and ectinomic rent). 
Though any number of allusions and some specific references arc made to 
the distinction between ititerest (or ordinary profit), rent, and pure profit, 
these various incotnes are constantly conflated, fhe analysis is further con¬ 
fused by unwarranted shifting in the lectures between static and dynamic 
analysis. For instance, in any essay containing a number of fruitful insights, 
Friedman observes that “real economics are dynamic, not stationary" (p. 4): 
he then notes the importance of this reali/,ation for our understanding of 
entrepreneurship, innovation, development, and industrial structure. This 
distu.ssion is immediately followed, however, by a reference to the intertem¬ 
poral allcKaiion role of profits (more properly an interest problem). Profits 
arc then treated as accruing to owners of capital, thus entangling pure profits, 
rents, and interest. .Shortly thereafter, Friedman considers the trade-off Ix*- 
tween equity and efficiency—or distribution and allocation—arising in a 
profit system. Almost every contributor eventually refers to this trade-off. 
1 he discu.ssions of the "trade-off" remind one of those concerning the "cruel 
choice” between inflation and unemployment. There niav indeed be different 
combinations of income distributions and allcKaiion efficienev, depending on 
the taxes and other constraints placed on profits, but it is by no means clear 
that any stable social choice set exists from which one could pick and choose 
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combinations. Further, if one considers the behavior of economies over time, 
then the question is not whether profits (however defined) contribute to 
income inequality, but whether the profit .system produces more or less income 
equality compared to other,/easiAfe economic systems. The latter question 
is not really addressed in this volume. 

Along with Arrow's, Friedman’s is one of the better contributions to the 
book. 'The shifting from dynamic to static analysis and between profit con¬ 
cepts, even in the introductory chapter, is indicative of problems in the rest of 
the book. No fruitful analysis of “profit," or of any other concept, is possible 
without a shared understanding of the nature of the problem being discussed. 
Such a shared understanding is lacking in New Challenges. The contributors 
could surely have adopted a number of profit concepts or definitions. But 
regardless of the concept or definition chosen, reasonable consistency in 
analysis is necessary. In this book there is almost no analysis of profits as the 
essential signaling and motivating force in a market economy. Despite oblig¬ 
atory references to Schumpeter, no approach treating profit as an income 
emerging from unexpected changes in economic data is seriously considered 
or analyzed. The work of Kirzner, identifying genuine profit as a return to 
superior entrepreneurship—alertness to altered conditions—is totally ig¬ 
nored (Kirzner 1973, 1979). Yet if the related theories of Kirzner, Mises, 
Schumpeter, and Knight are at all correct, then profit—not some fictitious 
auctioneer—is responsible for any tendency toward plan coordination and 
equilibrium in decentralized economies. Coordination is, of course, always 
incomplete, because change is always occurring. But without appropriable 
profits, the decentralized market system cannot work. 

The Kirznerian profit theory clarifies a number of conceptual problems. 
Profit, being a return to a special labor skill, is no longer conflated with 
interest, a return associated with capital accumulation. Nor is profit a rent or 
quasi rent accruing to a fixed or temporarily fixed factor, as in standard 
textbook analysis. Since profit disappears as quickly as it is publicly identified, 
and since it is not an economic rent, profit is not a taxable surplus. Arrow and 
Lundberg suggest that the ownership of property and the receipt of property 
income can be separated without serious allocational consequences. As Arrow 
pul it; “This justification of the charging of price for the use of propierty is not 
necessarily an argument that the reward should go to the owner rather than 
to someone else” (p. 53). This line of argument is more plausible if profit is 
considered as a residual or rentlike income. In Kirzner’s analysis, however, 
profits are the driving force of economic activity. They are actively and 
consciously strived for. They neither accrue to passive resource owners nor 
are they a residual. Thus, profit income and private property are inextricably 
linked and intertwined. As Marxists have always understood, the fundamen¬ 
tal issue is the private property system and the very existence of markets. 

The latter point helps indicate the deficiency of the book. Most of the 
challenges to profits discussed are quite old. Yet the contributions do not 
build on the long literature on the subject (Arrow’s essay again is the excep¬ 
tion). Newer critiques of profit and the market system derive from “highbrow 
opinion” and not from popular concerns (to which the book is ostensibly 
addressed). The ordinary fjerson is not concerned that market economies do 
not attain global maxima and are never in general equilibrium. (Whoever 
suggested they would be?) The ordinary citizen is becoming aware, however, 
of the tendency for governmental policy in a mixed economy to substitute a 
system of monopoly rents and arbitrary incomes for that of earned profits 
(see Hughes 19'77). He may lack the theoretical background to explain the 
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process, but the man in the street is surprisingiy on the mark in his judg¬ 
ments. Lundberg tells us that Western economics will, like Sweden's, become 
more mixed (p. 91). If so, monopolization and socialization of earned income 
will become more widespread. 

In the mixed economy, or fund socialism, the criticism that profits do not 
motivate innovation or lead to plan coordination changes from an erroneous 
critique to an essentially correct one. Inter alia, as measured profits are taxed 
at higher rates and supplanted by explicit and implicit subsidies, their moti¬ 
vational and allocational importance is diminished. Yet this process will re¬ 
main largely hidden if viewed from a perspective in which profit and eco¬ 
nomic rent cannot be clearly distinguished in an analytical sense. In the latter 
view, "profit" is a type of surplus that, at least within some range, can be 
altered with little impact on resource allocation. 

If, in fact, this book were to help focus attention on the distinctive operation 
of a mixed, as opposed to a market, economy, then it would have made a 
contribution. Were the book to help spark a public debate on this question, its 
faults could be overlooked. But if such a debate were to occur, it would 
probably be more in spite of than because of such efforts as this book. 

Gerald P. O'Driscoll, Jr. 

New York University 
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Economic Equality and Fertility in Developing Countries. By Robert Repetto. 
Baltimore; Johns Hopkins University Press, 1979. Pp. 206. $14.50. 

This work is best viewed as a collection of pieces in which a model of the 
relationship between income and fertility is developed, applied, and in turn 
tested through these applications. The questions raised by the author are 
crucial ones for population ptolicy. his answers are well reasoned, and if there 
is still room for disagreement regarding his policy conclusions it is neverthe¬ 
less much to his credit that the evidence and the issues are well laid out in the 
book itself. 

The book opens with a statement of its central theme: that fertility is a 
nonlinear negative function of income such that the marginal effects of an 
income change are greater for poor people (although pterhaps not for the 
abject ptoor) than for the rich. It follows from this nonlinearity that a given 
change in income level would have little impsact on fertility for families in the 
middle and uppier end of the income distribution but might very well have a 
large impact on fertility for poor families. The ptolicy implication is that 
redistribution of income from rich to ptoor would generate a decline in 
*8K*'cgate fertility and hence pxipulation growth. 



424 


JOURNAL OF POLITICAL ECONOMY 


Having staled this hypothesis in chapter 1, chapter 2 develops a cross¬ 
country model for testing its implications. Here it is necessary to assume that 
the basic structural relationship between fertility and income at the family 
level is essentially the same in every country (ceteris paribus). If so, it follows 
quite easily that any redistribution of income from the relatively rich to the 
poor, including transfers from rich countries to poor countries, would reduce 
aggregate fertility. It also follows that the relationship between aggregate 
fertility and income is not fully specified by the level of national income 
without some measure of its distribution. Chapters 3-5 are country studies 
establishing the robustness of the basic relationship. Chapter 6 concludes the 
book with an analysis of worldwide fertility along the lines suggested above. 

Each of the country studies uses some multivariate analysis based on 
household data as well as more aggregated descriptive statistics. In general, 
the regressions arc less satisfying than the descriptive analyses. In part this 
may be because of diffi< ulties obtaining data for the appropriate concepts and 
interpreting results based on imperfect data. Largely, however, it is because 
the .separate country studies have not been well integrated into a coherent 
whole. Although the relationship between income and fertility is the only 
point at issue, a fairly tomplex country-specific model is needed in order to 
isolate that relationship. Despite the tiaim that chapter 2 develops a general 
model, ea< h chapter contains its own model which is not clearly related to the 
"general" one. Moreover, the deveittpment of these models, the discussions of 
the data, and the ituerpretation of results are all t(H> cursory to be satisfying, 
even though the limited purpose of this study may not warrant a full country 
analysts for each case. I'he author’s analysis is much more convincing when he 
ts using hts model to tell us a story about country experiences. We are thus left 
with the impression that he may know more than he has revealed in his 
lountry studies. 

1 he case study of Puerto Rico has relatively little descriptive analysis; it u.ses 
1970 census data to estimate the magnitude of an income effect on fertility for 
households at different income levels and to provide empirical confirmation 
of the nonlinear relationship. The second case study shows how such a 
relationship can be used to interpret the Korean experieiue: a major redis¬ 
tribution of income followed by both growing income and declining fertility 
This is the most thorough of the case studies, and it is perhaps no coincidence 
that the regression model is limited to a few pages at the end. 

By far the most disturbing part of this bcK>k is the chapter entitled "Intel nal 
Policies for Income Distribution m Rural India." I'he data from Malar Taliika 
(a rural area in India’s Gujarat state) are fascinating in that they resurvey an 
area for which data were al.so collected in 1930 and so permit an analysis of 
long-run changes in income distribution. These data are used as a point of 
departure for an analysis of the benefits of land reform; fertility tomes up 
only once, in passing, when the author a.sscrts that redistribution through 
land reform will have the desired income effect. The discussion of land 
reform which takes up fully half of this chapter is well presented and argued, 
but its relevance to the main theme or to anything else in this fxxik is 
established tenuously at best. First, an income regression equation is esti¬ 
mated for Malar I'aluka in which both human and nonhuman assets are 
included as independent variables, and it is found that variation in the 
nonhuman asset variables accounts for by far the largest portion of the 
explained variation in income. However, the author tells us that Malar I'aluka 
is an agricultural region with large variation in the size of landholdings and 
that the survey year was a disastrous agricultural year for most of India; 



BOOK REVIEWS 


425 


ceteris paribus, this should lead us to expect a wider-than-usual distribution 
of income correlated with each farmer’s harvest within the region. At the 
same time, we are told why the human capital variables will not predict 
income; because there is no variation in schooling within the region and 
because age pertains to the household head instead of the individual earner. 
Thus the empirical results for Matar Taluka may tell us what was going on 
during the survey year but certainly cannot justify choosing land reform over 
human capital development as a l<»ng-tcrm strategy. 

Repetto follows his analysis of the effects of land reform with some argu¬ 
ments why education cannot be effectively used to redistribute income 
through human capital stocks, even though that is in fact what he says has 
happened in Korea. His arguments often conlu.se inequality measures with 
the distribution of income. For example, since the initial distribution of 
schooling in Matar Taluka is very equal, the first-round impact of increasing 
the education level of younger cohorts will be to int rease itiequality of earn¬ 
ings, presumably ht'cause education improves earnings. Even ignoring poten¬ 
tial intergenerational transfers, this is surely a most extraordinary argument 
against progress, and one which I doubt would impress many low-income 
families. In the same vein, the following paragraph contains the suggestion 
that since improved education would affett only new entrants to the labor 
force, it would be decades before their numbers could be great enough to 
appreciably affett the average income Then, having observed that an im¬ 
portant value of education is that it "prepares students . . . for jobs outside the 
traditional sector,” he says that land reform is needed because education "has 
more value for a future landholder than for a future agricultural laborer" (p. 
149). 

The irony of this analysis of the relative merits of human and nonhuman 
a.s,sets as sources of increased income for the pcHir is that although it is a small 
part (jf Repetto's bcKik it is strategically placed at a point where policy implica¬ 
tions are being drawn from research findings. The fact that the discussion of 
this point is at best superficial is especially puezling in the light of the com¬ 
paratively careful reporting of evidence in the earlier chapters. Indeed, on 
the whole this book contains a u.seful perspective on the impirlance of inc ome 
effects on fertility and provides convincing evidence that the improvement of 
incomes for the poor, especially in developing countries, is necessary and 
even perhaps sufficient as a condition for reduced aggregate fertility. 

('.ARMKi U. Chiswick 

Univrrsity 0 / Illinois al C.hicago Cinle 


Tho Economics 0 / the Prrjorming Arts. By C. D. Thkosbv and Ci. A. Withers. 
New York; St. Martin’s Press, 1979, Pp. 348. S27..50. 

BcKiks on the ec onomics of the performing arts are no longer a rarity. But this 
volume contains material quite differeni from that of its predecessevrs. It goes 
over and brings up-to-date the statistics on costs, revenues, and public sup¬ 
port which have been analyzed before. But, in addition, it .seeks to go as far as 
the subject allows in applying the .standard lechnicjues economists have used 
to analyze other industries—adapting theoretical micro models and econ¬ 
ometric analysis to the problems of the arts. 

Predictably, the success of this venture is mixed. But this is no discredit to 
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the authors, who are obviously masters of both the subject area and the 
techniques, and who display consistent good judgment on issues of public 
policy, which is the book's ultimate focus. 

The volume deals with ail of the subjects which one would expect it to 
consider, starting with the organizations which present performances (“the 
firms”), going on to consumer demand, industry structure, and then turning 
to public assistance to the arts. 

Most subjects are treated in four different ways. Each is discussed in 
general terms to describe the most pressing issues to which it gives rise. 
Second, extensive and often illuminating descriptive statistics are provided. 
Third, where piossible. a formal theoretical model is constructed. Finally, an 
econometric model is formulated and the values of its parameters are esti¬ 
mated. 

One can quibble here and there with the more discursive portions of the 
bcKjk, but, by and large, they are accurate and illuminating. Of particular 
value is the extensive material comparing pterforming activity and its financ¬ 
ing in Australia, Canada, Great Britain, New Zealand, and the United States. 
The value of such international comparisons has often been asserted, but in 
the past the materials provided for the purp>ose have generally been quite 
spotty. The major conclusions that emerge from the comparisons should, 
however, hardly be surprising to anyone with some acquaintance with the 
field. They show remarkable similarity in the economic problems faced by the 
arts in the different countries, in the characteristics of the audiences, in 
the nature of the organizations, etc. For example, it is “striking” that “proportions 
of the population expwsed to the performing arts are substantially higher for 
the middle-aged, high income, high education, professional, managerial and 
white-collar groups” (p, 96). About the only major difference seems to be the 
heavy reliance on private foundations and individuals for financial suppiort in 
the United States, while elsewhere such income flows almost exclusively from 
government. Far more remarkable are the figures on the growth of govern¬ 
ment support in the five countries studied. For the 8 years 1969 through 
1976, in every one of the countries government assistance grew faster than 
price level, population, average hourly earnings of labor, GDP, and (with the 
exception of Canada) aggregate government expenditure. In Australia and 
the United States the degree by which government outlays on the arts out¬ 
paced these other indices was particularly remarkable. All of this is notewor¬ 
thy because in the past periods of inflation have been times of financial 
difficulty for the arts. As a matter of fact, more recent data for the United 
States indicate that government outlays on the arts have continued to rise in 
real terms, so that while the arts have experienced financial difficulties stem¬ 
ming from other sources, at least so far, and with some noteworthy excep¬ 
tions, real cuts in government funding have not been among them. 

The authors’ discussion of policy deals with such issues as justifications ol 
assistance, the determination of the appropriate assistance levels and of the 
allocation of the funds by size of organization, art form, geographic location, 
etc. In discussing justifications for support the authors build their conclusions 
on the economists’ usual grounds for government intervention; externalities, 
redistribution, merit goods, option demand, etc. The discussion strikes one as 
sensible, though no surprising conclusions emerge. They reconfirm what 
economists had noted before—how difficult it is to provide a very strong 
justification for government support of the arts in these terms except perhaps 
on the merit goods argument. But, then, the term “merit good” merely 
becomes a formal designation for the unadorned value judgment that the arts 
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are good for society and therefore deserve financial support. In other words, 
the merit good approach is not really a justification for support—it merely 
invents a bit of terminology to designate the desire to do $ 0 . 

As already noted, the unique contribution of this book lies in its theoretical 
and econometric models, and these are, therefore, worth considering in a bit 
more detail. These models deal with a variety of subjects; consumer demand 
for leisure (following Buristam Linder); willingness to offer philanthropy; the 
theory of merit goods; propensity to provide philanthropy; and the behavior 
of the arts firm and its response to subsidy. The last of these is perhaps 
representative. Taking the nonprofit arts firms to be managed by utility 
maximizers constrained by a zero-loss condition, the authors proceed to 
formulate a utility function whose variables are attendance and quality, a 
production function which cakes attendance to be a function of number of 
performances, number of distinct productions, capacity of auditorium, and 
quality. This is related to a cost function, whose variables are attendance and 
quality, and a demand function deptendent on price and quality. From this, a 
standard partial-equilibrium diagram emerges, showing the equilibrium of 
the utility-maximizing firm and its relation to that of a profit maximizer. The 
conclusions which emerge are not particularly surprising—the utility max¬ 
imizer’s output (audience size) will be the larger, and its output will be in¬ 
creased by subsidy—even if the subsidy is a lump sum. 

The econometric models also cover a wide variety oi subjects: theater cost 
functions, consumer demand functions, voter characteristics, magnitude of 
government assistance, and the effects of grants on attendance. The relatively 
simple forms taken by the relationships used in estimation are fully explained 
by the complexity of the subjects and scarcity of data. More difficult to explain 
is the rather persistent failure of the authors to provide interesting and 
testable hypotheses. It is as though the statistical calculations were undertaken 
as ends in themselves rather than as means to get at more fundamental issues. 
Nevertheless, some very interesting results emerge. For example, the study of 
the effects of grants indicates: 

Firstly, grants to drama have been more effective at the margin in 
.securing greater attendances than have grants to opera, ballet and 
music. Secondly, the marginal unit of subsidy in drama has had a 
relatively higher effect on aliendanrcs at low levels of grams per 
company, declining as the levels of subsidy increased, although this 
diminishing productivity at the margin is not s<j evident m the case of 
the other art forms. 

It must be remembered that these results obtain at the margin m a 
situation where the existence of grants already accouni.s for a major 
part of attendances, especially in the case of opera, ballet and music. 
... In the case of theatre, however, the wider range of levels of 
activity and the more profound cITects of a relatively small amount 
of support on a small company give greater scope for grants to lead 
to increased attendances in the lower ranges, although at the other 
end of the scale (i.e., for the major subsidized companies) the margi¬ 
nal unit of grants is less effective in increasing attendances for 
I easons similar to those discussed above in relation to opera, ballet 
and music. (Pp. 268-69) 

To summarize, this b(K>k is a very useful addition to the literature. It will 
prove indispensable to anyone conducting research on the subject. The au- 
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thors have given us up-to-date information, have contributed valuable in¬ 
sights, and have expanded the armory of weapons shown to be helpful. But 
their attempt to fit the analysis to the procrustean bed of standard economic 
techniques seems to have been pushed beyond the point of diminishing 
returns. 


Pnnrrton anri AVre York Unniersitifs 
Mathematifn, hu. 


William J. Baumol 
Hilda Baumol 



The Deterrent Effect of Antitrust Enforcement 


Michael Kent Block and Frederick Carl Mold 

Hruwer /mtitutwn 


Joseph Gregory Sidak 

Stnn/ord Imw Srhiml 


In tills paper we lormulate and lest a mnde! nf tolluMvc pricing in 
the presence of antitrust enfVircement. We show that a cartel's opti¬ 
mal price is likely to Ix' neither the competitive price nor the price 
that the tartel would set in the absence ol antitrust enfortement but 
rather an intermediate price that depends on the levels ol antitrust 
enlorcement ef forts and penalties. ()ur empirical results reveal that 
increasing antitrust enlorcement in the presence ofa credible threat 
of large damage awards has the deterrent effect of reducing mark¬ 
ups in the bread industty. 


Soon after the passage of the Sherman Act. the Supreme C’.ourt 
determined that hoi i/onlal minimum price fixing was so inherently 
injurious to consumer welfare that it should l>e illegal per se. Hori¬ 
zontal collusion has since becctme a major fexus of federal antitrust 
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enforcement.' Through the use of criminal and civil sanctions the 
Department of Justice (DOJ) has sought not only to remedy specific 
instances of price fixing, but also to achieve general deterrence of 
[K)teniial price fixing. 

I'his paper is the first systematic attempt to estimate the impact of 
antitrust enforcement on horizontal minimum price fixing. We de¬ 
velop a simple theoretical model of the collusive pricing decision and 
then, using data on the bread industry, assess empirically the deter¬ 
rent effect of public and private antitrust enforcement on the decision 
to collude. 


I. A Simple Model of Collusive Pricing in the Presence of 
Antitrust Enforcement 

.-1. Dvfinilinns and Assumptions 

We construct a relatively simple model to consider explicitly the ef fect 
of antitrust enforcement on the decision of firms within an industry to 
fix prices collusively. The model uses the following variables and 
definitions; 


p = price of output; 

Q. — Q.(p) - demand for the industry’s output; 

r s f;(Q) = total cost of industry output, including a 
normal rate of return; 
c = C{Q)IQ = average cost; 
me = dCldQ - marginal cost; 

X = (/I — mc)lnu:= markup over marginal cost; 

TT = total profits of colluders; 
y- level of enforcement efforts directed toward 
detecting horizontal collusion; 

/z == t{p — rru:)Q— the combined civil and criminal penalty for 
price fixing, where t is the anticipated dam¬ 
age multiple; 

= <i(X, y) = the probability that a collusive pricing 
scheme will be detected. 

The definitions of /•' and d are essential to our analysis and warrant 
further discussion. 

Our specification of the penalty function, F = t{p — mc)Q^, reflects 
that under current statutes a price-fixing conspiracy is sutyect to both 
criminal and civil sanctions.* The Clayton Act’s private treble dam- 

' Pissncr (1970, p. 398, table 23) reports that 989 of the 1,551 Department of Justice 
antitrust ta.ses between 1890 and 1969 contained charges of horizontal conspiracy. 

* The maximum criminal sanctions for price bxing are imprisonment for 3 years, a 
hne of SKKI.tlOO per individual, and a fine of $1 million per corporation. We do not 
consider injunctive remedies in this model. 
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age remedy is the most formidable civil sanction and clearly relates to 
markups, as our specification of F reflects. If criminal sanctions are 
small—as they usually are—then civil sanctions provide most of the 
sting of antitrust enforcement, and our specification of F accurately 
describes the true penalty for price fixing.® Moreover, if, when 
sentencing price fixers, a district court judge considers the size of the 
markup to be an index of the cartel’s perniciousness, then F is still 
reasonably descriptive on the rare occasions when criminal fines and 
sentences are significant relative to private treble damage awards. 

In the presence of uncertain antitrust enforcement the decision to 
collude depends not only on the magnitude of the penalty but also on 
the probability of detection, which we assume increases with the 
markup, [dd^k, y)]lidy) > 0. The closer colluders come to the price 
that they would charge if no antitrust enforcement existed, the 
greater their chances of being discovered. Two observations support 
this hypothesis. First and most obvious, higher markups make cus¬ 
tomer complaints to the DOj more likely.^ Second, as Stigler (1968, 
pp. 268-70) has noted, a positive relationship probably exists between 
a conspiracy’s detectability and its ability to prevent cheating by indi¬ 
vidual colluders. f or example, a joint-sales agency would assure a set 
of colluders strict adherence to a monopoly price, but so visible a 
collusive device would be nearly assured of detection. In general, we 
hypothesize that the more efficiently a collusive device produces mark¬ 
ups, the more likely it will be detected and the colluders convicted.* 
Hence, as we have assumed, the very technology of collusion makes it 
likely that the probability of detection increases with the markup.* 

We also assume that the DOJ never charges noncolluding firms; 


’ A comparison ot the rnagniiude of criminal ami civil penalties appears in Set . 11. 
though a suhsiantial increase in potential criminal sanctions was legislated late in 1974 
under the Antitrust Procedures and Penalties Act. there is little evidence of its elTec- 
tiveness in increasing realired sanctions (sec Burnham 1978). 

* Hay and Kelley (1974) find that complaints by customers are the third most numer¬ 
ous method of detection among IX5| price-lixing cases. 

* Stigler (1968) presents .some indiret t evidence on this point. In a sample of [K>J 
cases he divides the types of collusion between what he considers rffinml and mrffinrnt 
collusion, fhe average time from inception of alleged collusion to complaint is 
sigiiifitanlly shorter for the efficient forms of collusion. 

•Our study of the bread industry revealed, as tentative empirical evidence of the 
relationship between the probability of detection and the markup level, that the prob¬ 
ability of being investigated for price fixing was positively related to the markup level. 
We assumed that a lag exists between the price-fixing derision and its detection; then, 
using a one-period lagged value of our markup indicator as an explanatory variable, we 
estimatetl a logit model of the probability that DOJ would initiate an iiivesugauoii for 
bread price fixing. The estimated intercept for our 2()8-observauoii sample was - 2.49, 
and the estimated coefficient on the lagg^-tnarkup indicator was 5.69 with a standard 
error of 2.43. When the logit function is evaluated using the sample mean values, this 
coefficient implies that a 1-perccnugc-point increase in the markup increases the 
probability of an investigation by 5 percent. Sec. II elaborates on our specification of the 
markup and the nature of our sample. 
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d{0, y) = 0. We assume further that DOJ does not necessarily detect a 
pure cartel price with certainty—that is, ■/)<!, where X*" is the 
markup that colluders would choose if no antitrust enforcement 
existed. 

B. Antitrust Enfurcement and Optimal Collusion 

The objective of collusion is to set a price that maximizes expected 
cartel profits.^ To model this price-fixing decision simply, we impose 
three additional assumptions. First, firms produce output under con¬ 
ditions of constant marginal and average ctist so that mr = r. Second, 
all firms take part in the joint-pricing decision. Third, the most 
significant cost of any collusive device is its impact on the probability 
of detection. For example, the major cost of a joint-sales agency is its 
visibility, not its resourc e costs. This third assumption implies that, in 
the absence of antitrust enforcement, firms could duplicate perfectly 
the price and output levels of a monopoly." 

Formally, the colluders set the price p by maximizing the expected 
value of their objective function, En. Fhat is, the colluders 

max Ett = (1 — ci)7r, + diti = Z, (1) 

where tti ^ {p - f )(l{p) is the profit level if the colluders avoid 
detection, — [p — (+ X.0K^(/t) is the profit level if the colluders 
are detected, and \ = (p — c)/c is the markup under constant costs. 
The necessary condition for an internal maximum is 

Z^ = - UOSP) + IQ(f’) + (p- ry)\p)K\ ~ dt) = 0. (2) 

oX 

Ftptalion (2) has a simple yet intriguing implication: If, as we have 
assumed, the markup significantly affects the probability of detection, 
then neitfier the competitive price (X = 0) nor the pure cartel price 
markup (X = X*’) can satisfy equation (2).“ Antitrust penalties, there¬ 
fore, do not necessarily eliminate price fixing. I'hey are, however, 
likely to reduce the optimal markup.'® 

' I his as.sumcs that ihc- lolUiders arc risk neutral. An analysis that allows lor risk 
aversion is presented in Blixk. Nold, and Sidak (1978) The results of that analysis do 
not differ substantially from those presented here. 

" As long as we restrict our attention to industries with a history ol conspiracy, 
considering intraindustry cheating would complicate the model without signiheandy 
adding to our understanding ol collusive pricing in the presence of antttrusi enforce¬ 
ment. 

* The pure cartel markup is the markup X'' saUsfying the condition Q_(p) + (p - c) 
OJtp) = 0. 

‘“If the detection probability were not sensitive to the markup level, then the decision 
to price fix would be a simple eilher/or proposition. In this case, as long as (I - dt)> 0, 
the collttders would set the price at the pure cartel level. Clearly no price fixing would 
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As long as price fixing is a favorable gamble for some markup, (1 — 
dt) > 0—and our assumptions assure it will be—then risk-neutral 
colluders obviously will not choose the competitive price. Also, al¬ 
though the “unfairness” of the price-fixing gamble at the pure cartel 
markup, [ 1 - y)t\ < 0, is sufficient to dissuade the colluders from 
choosing \ = A'', it is not necessary. In general, colluders will stop 
marking up when both [Q(p) + (/» - cK2'(/')] and (1 - dt) are positive. 
I'o deter colluders from chcmsing the full cartel markup is rather 
easy; to deter all collusive pricing in a conspiracy-prone industry is 
nearly impossible. 


C. The Effects of Changes in Enforcement Efforts and Penalties on 
Collusive Markups 

I'o assess the effect on markups of increases in enforcement efforts, 
we consider the effect of changes in y on the optimal A, or dk/dy. 
Totally differentiating etpiation (2) with respect to A, we obtain 

where is the second-order derivative. Only in the rather perverse 
case where an increase in enforcement actually reduces the impact of 
the markup on the detection probability—that is, where {d'^d)l(dkdy) 
< 0—does an increase in enforcement have an ambiguous effect on 
the markup. In most cases of practical importance we expect that l)OJ 
efforts designed to increase the probability of detection also reduce 
the markup. 

Increasing price-fixing penalties is often advocated as an efficient 
means to achieve deterrence." The effect of such an increase is 
expressed by the comparative static derivative; 


dk 

dt 



kit + <i[il{p) +(p 



(4) 


which indicates that a higher penalty unambiguously decreases the 
optimal markup. In summary, an increase in either the probability of 


incur if (1 - dl) < 0. Enforcement efforcs would determine siinpl) whether or tiot it 
paid lot a cartel to fix prices. If it did. colluders would set markups at the full cartel 
level; if it did not. they would not collude. ITiis mixtel, in which the probability dices not 
flepend on changes in would make the empirical analysis somewhat more 
sliaightforwatd. However, such a fitrtnulation would be less descriptive t>l the actual 
siitiaiion confronting colluders. Formally, our assumptions do not preclude a corner 
soUiiion at X = X''. If the deicciion probability displays a limited respirnsiveness to 
markups, then cq. (2) may not be saiishcil as an equality. However, it is unlikelv. given 
the technology of collusion and the existence of punitive sanctions for price fixing. i.e..t 
^ I, lor (1 ~ dt) to be positive at X = XC 
” Sec Sec. Ill; see also U..S. Office of the Presidcni 1969. and Elzinga and Breil 1976. 
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detection or the penalty for price fixing can be expected in most cases 
of interest to reduce the collusive markup. 


II. Empirical Findings 

Our theoretical model suggests that increases in enforcement levels or 
penalties for price fixing generally reduce collusive markups. Though 
straightforward to derive, this implication is hardly trivial to test. 
Unfortunately, we do not have a set of national, or even regional, 
industries having identical products, costs, and demand conditions 
but varying levels of antitrust enforcement. We must assess the impact 
of antitrust enforcement in a more problematic environment. 


A. C.hoice of Indxishy 

I'o test the implications of our deterrence model, we analyzed the 
market for white pan bread, a homogeneous commodity not only 
regionally produced and consumed but also well represented among 
DOJ price-fixing cases. During our sample period, bread cases were 
the most common among DOJ’s food price-fixing cases.'* In addition, 
the bread industry has well-recorded annual input and output prices 
compiled for selected cities by the Bureau of Labor Statistics (BLS) 
(see U.S. Deparimetit of Labor, Bureau of Labor Statistics 1964-76, 
1965-76).'^ Hence, we constructed a sample that enabled us to use 
Ixilh cross-sectional and temporal variations in product prices, costs, 
and antitrust enforcement. 


B. Esltmaling (Aty-speafic Markups 

Developing an indicator for price markups in the bread industry 
involved two steps. First, we examined a standard recipe for a loaf of 
white bread.''* We subtracted from each bread price observation the 

rhc FI t: (ibserved that the bread-baking and dislribulioii industry has “the essen¬ 
tial fharacieristic of a conspiracy-prone industry—relatively few sellers in individual 
markets" (U-S. Federal Trade t^ommission 1967, p. 135). 

1 he BTS publK'atJOns contained comparable price data for white bread for 20 
major tiucs. For 12 of these cities the data went back as far as 1955, although for the 
remaining eight they extended only to 1968. Input pnee data, though not so complete, 
proved adequate to construct a sample of 228 observations: 12 major cities for 
1964-76, and eight additional major cities for 1968-76. Clities included throughout the 
entire period (1964-76) were Baltimore, Boston, Chicago, Cleveland, Detroit, Los 
Angeles, New York, Philadelphia, Pittsburgh. St. Louis, San Francisco, and 
Washington, D.C. For 1966-76, we were able to add Atlanta, Cincinnau, Dallas, Hous¬ 
ton. Kansas City, Minneapolis, San Diego, and Seattle to the sample. 

“ The cost of ingredients (IC) for a l-pound loaf of white bread is IC = .bSSOPf + 
.057IP, + .0026Po + -OSSP,,, where Pf is price/pound of flour, P, is price/pouncl of 
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I ABLE 1 

Effect of Noninorediem' Input Costs 
ON Adjusied Price (PADJ) 


Independcnl Variables 


PELEC .256* 

(4.62)t 

PGAS 2.49 

(3.15) 

LABOR 1.07 

(7.14) 

Constant 7.05 

Observations (A/) 228 

«« .49 

F-statistic (3.224) 72.93 


* ] he csltnidlnl uyefHuent 

t [ he vahie (if (he euiinated CdcIfKieiK (ItvKiot hy its estinialcit standard emtr 


component atiribuiable to the cost of the ingredients, 1C, calculated 
from the recipe and from BLS ingredient prices. Formally, we 
defined the recipe-adjusted bread price, PADJ, as: 

PADJ,, = p„ - 1C„. (5) 

where i is the city and t the time index.'* Differences in profits and 
other noningredient input costs cause PADJ to vary across cities and 
over time. 

As a second step in developing a measure of markups, we estimated 
the amount of variation in PADJ attributable to variations in energy 
and labor costs, fable 1 contains the results of this analysis; PELEC 
and PGAS are mea.sures of electricity and natural gas prices, and 
L,\BGR is a measure of wage rates for truck drivers. We assume this 
wage to be a proxy for general labor costs.'® fhe results in table 1 
suggest that variations in energy and lalror input costs account for a 
reasonable amount of the variation in adjusted bread prices.*' 

su({ar, P„ ihe pritc/ptmiid of vuokiiig oil. and /*» is ihe prite/poiiiid of dry milk. All 
prite.s are retail, except dry milk, which is wholesale, (kioking oil is used as a proxy for 
shoriening (U.S. Executive Office of the Presicicni 1977, p. 7). 

'' The sources of price data were U.S. Depanmcni of l.abor. Bureau of I.abor 
Siati.slics (196.5-76; 1969). 

Specifications u.sing foexi and kindred workers’ wages as well as aliernatiye func¬ 
tional forms were also estimated. 1 he sc different specifications )ieldc‘d similar results 
(see Bl.xk ct al. 1978). 

" Of course, measures of gcxxlness of fit for this regression subsianiiallv understate 
the iiripoilance of variations in input prices since the dependent variable. PAUJ. is 
already net of the recipe or ingredient. Our entire prcxredure, taking account of bcKh 
1C and the estimated cost function, accounts for approximately 82 percent of the 
variation in the price of bread (/)). In general, prices for all signiheant inputs arc 
UK luded in cost functions. Our asymmetric treatment of input prices amounts to using 
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l o construti a measure of the markup on bread, we then used the 
recipe costs and the results of this regression to estimate M, the 
markup indicator: 

M = 1/1 - (NIC + IC)J/(N1C + IC), (6) 

where all subscripts are suppressed and NIC refers to the fitted values 
generated by the equation in table I.’* Equation (6) estimates the 
markup by first subtracting from the market price the sum of the 
known ingredient costs (IC) and an estimate of other noningredient 
costs (NI(i), and then dividing this difference by estimated unit costs. 
The higher the residual as a proportion of unit costs, the higher the 
markup. 


C. EsUmaltrig the F.lfect of DOJ Antitrust Enforcement 
on Cnlhisive Markups 

Antitrust enfortemcrit and penalties are only two of the many fac¬ 
tors that actually determine markup levels.** Certainly, all factors 
that influence the price elasticity of demand also influence the mark¬ 
up level. Moreover, although we have a.ssumed antitrust penalties to 
be the only cost of collusion, the resource cost of managing and 
policing a cartel also influent es the optimal markup in the presence of 
antitrust enforcement. Assessing the effect of antitrust enforcement, 
therefor e, requires a tvorkable method of tontrolling for such outside 
influences. Influences generated by aspects of market structure that 
do not change rapidly—or at least that change significantly less 
rapidly than the antitrust variables—tan be controlled by considering 
nf>t the markup lesel but changes in the markup level.We thcrefor'c 
u.sed first differences in the markup level, or 

AM,, = M„ - A4„_„ (7) 

in testing the effectiveness of aniitrusi enforcement.'*' 


a prion iiilorniation—(tie rcci|)e lor breat!—to supplant csliitiatioii of some 
coeffitiems It tnay be argued ifiac tfiis prcxeclure increases the efficiency ol the 
estimation, but it precludes substitution among raw materials and other inputs. 

'* Because a simple icgression generates NIC, approximately half die observations 
foi NIC exceed PAU| and thus generate a negative M. For our purposes, these 
negative values pose no prciblem.s. 

We siiuetured our lest ol anlitriiM enlorteinent aiound markups rather than the 
iiaditional—but indiiec I —lest of overall profitability or rale of return. One rea.son for 
using the markup was the very directness c,( the lest. Also, the availability of regional 
and c ity data enablcxl us to coiiduc ( more powerlul te.sis of the ef feetiveness of antitrust 
enlorteinent than could have been peiformed with available data on rales of return. 

“’Controlling for these laciors by actually assembling the relevant information on 
market slrutlurc for this industry at the city or SMSA level was not feasible. 

This prexedure also facilitated our use below of a straightforward measure ol 
regional antitrust enforcement. 
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1. Measuring Detection Probabilities, DOJ Enforcement Efforts, 
and Remedial Ef fects 

We assume in our formal model of collusion that for any markup level 
the probability of detection is a function of DOJ enforcement efforts. 
Unfortunately, this straightforward theoretical proposition does not 
suggest a unique empirical counterpart for measuring either the 
probability of detection or DOJ’s enforcement efforts. This formula¬ 
tion does suggest, however, that for any markup level the probability 
of detection is related to the capacity of the Antitrust Division to 
initiate cases. To the extent that the Division’s litigation capacity 
relates to its expenditure level, the Division’s annual budget should 
provide an indirect measure of this enforcement capability.^* 

A more direct measure of enforcement is simply DOJ price-fixing 
prosecutions. A price-fixing pro.secution is rare enough for colluding 
firms in the affected industry to take special notice.^* We have as¬ 
sumed, therefore, that each price-fixing case that the DOJ brings 
against a bread producer increases, for any markup level, the prob¬ 
ability of prosecution perceived by other bread producers operating 
in the same DOJ region.*^ 

We had sufficient price data to estimate changes in markups for 
1965-76. We constructed a regional antitrust enforcement variable, 
DOJREG, for this period by setting the variable etjual to one for each 
t ity within a region where the Antitrust Division filed ati action that 
year—except for the city incurring the action—and by setting the 
variable equal to zero otherwise.** In other words, DOJREG is a shift 

A pdsilive and sigiiifkant rclaiioiiship exists between the .Aniiirusl Division’s an¬ 
imal budget and the number of prue-fixiiig <ase.s brought by the Division. L’sing data 
on priic-fixing lases during lt)64-7tj supplied by the Antitrust Division’s Eionoinit 
Policy Otfice. we estimated the relationship: 

DOIPK = -11.39 + .OOSBUDGhI, 

(2.42) 

between the number ot pike-bxing cases hiotighi anniially by the Division (1W>JPF) 
and its budget (BUDGE. 1) measured in thousands of 1967 dollars (2.42 is the (-statistic). 

“ We ran provide a formal rationale lor this observation by assuming that colliiders 
use Bayesian methods to estimate the probability that they will be apprehended in a 
p.ti licular |)eriod. In this formulation, whenever colliiders are apprehended, colluders’ 
estimate of the probability n( apprehen.sioii increases, and that inrrease is ilramalic if 
their a priori dislnbuiiun is diffuse and has a small mean. Also, after an iniual nnpacl, 
the effect of the case on the apprehension prolrability estimated by the colluders 
deteriorates if no new rases are brought. 

I'he Antitrust Division operated seven regional offices during the years of our 
sample; New York, Philadelphia, Glcveland, Ghicago, San Krancisio, lajs .Angeles, and 
Atlanta. The Antitrust Division established a Dallas office in 1976 ami realigned the 
areas covered by the regional offices. This study used the earlier seven regions for 1976 
rather than realigning all regions for that single year. For coverage of regions, see U.S. 
Department of Justite, Antitrust Division (1973). 

” 'rhis variable was constructed using data from the Gonimerce C'.learing House 
(19.'S5-7.5; 1966-76). A summary of those data appears in Blitck et al. (1978). Filing 
dates and other summary information were cheeked against a special listing of bread 
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variable designed to capture the changes in the perceived prob¬ 
abilities of cartel failure. Its form presumes that the colluders’ esti¬ 
mate of the probability that DOJ will initiate a case for collusion 
increases after a DOJ price-fixing action within the same region.^® 
The variable DO JREM measures the effect of the antitrust action in 
the city where the Antitrust Division actually prosecuted a bread 
producer. In constructing DOJREM, we assumed that the impact of 
the antitrust action on the firms specifically prosecuted would differ 
significantly from the impact on firms in other cities in the same 
region. Basically we assumed that, for strategic reasons, the timing of 
the reduction in markups by prosecuted firms would not coincide 
with the reduction made by firms in nonaffected cities. To capture 
this effect, we set DOJREM equal to one in a city 1 year after the 
Antitrust Division had filed an action there. 


2. Estimated Detenent Effects 

In table 2 we present our estimates of the ef fect of DOJ enforcement 
variables on markups in the bread industry. Again, the dependent 
variable, A VI, is the annual change in the markup on white bread for 
the cities in our sample. The results in table 2 strongly suggest a 
deterrent effect of DOJ enforcement efforts.*^ First, the coefficient on 


pnce-tixing cases picpared for us by tile Economic Policy Office, Antitrust Division, 
U.S. Depaitmcnt of Justitc, for the years 1963-76. 

An altcrnaiive hypothesis would be that, since DOJ has limited resources, a prosc- 
cuiioii in a reypon signals that an additional prosecution in the same region is unlikely. 
Hence a firm actually would reduce its estimated probability of deteclioii after a DOJ 
lase m its own region. This hypothesis is cquivalenl to predicung that drivers on a 
highway accelerate when they sec a tar already being ticketed. 

Our foimulaiion, which [xisits that increases in the markup im reasc the detection 
probability, generates the structural equations: Ad — ( + ^AX + tABUDGET + 
TyDOJREti and AX = i/( + aAd + fllK)|REM. 1 here fore, the reduied forms are 

Ad - 01 +- - -ABUDGE r + -—3-DOIREC; + —DOJREM 

1 - a/3 I - a^ ' 1-0/3 

and 

AX = p + —A BUDGET + . - DOJREG + -DOJREM. 

Our estimated reduced-form coefficients in table 2 are of the form (ot)/( 1 — a/3), 
(ai 7 )/(l - afi), and (9)/(l - o/3). We arc mainly interested in deducing the sign of o 
given the sign of the reduced-form coefficients. The usual approach would be to 
estimate the other reduc ed-form equation in the system which takes Ad to be a function 
of the exogenous variable. Unfortunately this approach cannot be applied directly to 
this problem. We do not have a proxy for Ad or measures of the detection probability at 
a regional level. Even at a national level measunngd is problematic. However, given our 
formulation of the determinants of detection, if we assume that DOJ expenditures have 
some efficacy—i.e., t > 0—then the condition necessary to deduce the nonpositivity of 
a from the reduced-form coefficients is that (1 — o/3) > 0. This condition is required for 
Walrasian stability in the collusion market and in conjunction with our reduced-form 



ANTITRUST ENFORCEMENT 


439 


TABLE 2 

Estimated Effects of Changes in DO] Enforcement on 
Changes in Markups in the Bread Industry, 1965-76 


Independent 

Variables 

ABUDCET 

-.015* 

-.015 

-.024 

-.020 


{-2.74)t 

(-2.68) 

(-4.06) 

(-3.65) 

DOJREG 

-.025 

-.026 

-.025 

- 027 


(-2.05) 

(-2.21) 

(-2.09) 

(-2.26) 

DOJREM 


-.046 

-.046 

-.044 



(-2.32) 

(-2.41) 

(-2.32) 

AFOODM 



+ .058 





(2.33) 


AGENM 




-.010 





(-1,60) 

Cionstant 

.01 ] 

.013 

.014 

017 

R' 

.055 

.082 

.113 

101 

F-statistic 

5.93 (2,205) 

6.04 (3,204) 

6.47 (4,203) 

5.68 (4,203) 


Norc —F^ch r^greMion t« ba*ed on 208 ob*ervauon«. 

* rhta coefficMtK is csumaied per million <lomn 

t The value of the estimaicd coefTiaeni divided by lU ettinuued sundard error 


our general measure of changes in enforcement capacity—the change 
in the real value of the Antitrust Division’s budget, ABUDGET—is 
negative and significant. In other words, an increase in the enforce¬ 
ment capacity of the Antitrust Division appears to reduce markups on 
white bread. Second, the coefficient on our direct measure of DOJ’s 
enforcement activity (DOJREG) is negative and significant, suggesting 
that a price-fixing case against bakers in one city induces bakers in 
neighboring cities to reduce markups. I his result comports both with 
our formal theoretical results and with the related conjecture in the 
Stigler report that "every victory” in seeking out price fixing “weakens 
the efficiency of undetected collusion.”** Finally, the coefficient on 
DOJREM, the variable measuring the remedial effect of a price-fixing 
case, is negative and significant. Once discovered and prosecuted, 
colluders apparently "remedy" their price fixing by reducing their 
markups in the following year.** 


results implies that there is a deterrent efIeiT. i.e.. a < 0. In addition to this theoretical 
restriction on the sign of (1 — a/3), we noted previously dial there is a positive 
relationship between the total number of price-fixing < ases brought by the .Antitrust 
Division and the Division’s budget level. If we can assume that other collusion-prone 
industries made derisions about collusion in the same way as bread producers, then 
this aggregate regression gives information about the second set of reduced-form 
coefficients. Again, if t > 0, the sign of the reduced-form rocfficicm on ABIIDGET 
indicates that (1 - afi) is positive. 

*• See U.S. Office of the President. President Nixon’s Task Force on Prtxluctivity and 
fompetition 1969. 

” It is only necessary to assume that DO] capacity is produedve (t > 0) to infer from 
our empiric^ results that ij > 0, or that filing of a D<^ J case in the region increases d. 
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The AFOODM and AGENM variables in table 2 control for general 
year-to-year variations in manufacturing markups that might not be 
adequately controlled by the first-difference procedure. I'he variable 
AFOODM is an annual series of the first differences in markups of 
food-and-kindred-product manufacturers; AGENM is an analogous 
series for all manufacturing firms.^® The coefficient on AFOODM is of 
the expected sign, suggesting that, holding enforcement constant, 
markups in the bread industry move in the same direction as markups 
in other food-related industries. Althotigh using the general man¬ 
ufacturing markup (AGKNM) in the equation does not alter the 
estimates of the coefficients on the enforcement variables, the sign of 
the coefficient on AGKNM is curious. 

D. ('.lass A(lions and the Effect of Antitrust Enforcement on Maikups 

Historically, trial judges have ptinished price fixers leniently.'” The 
cases collected for this study were no exception. All but two rtf the 17 
bread price-fixing ca.ses between 19.57 and 1976 involved nolo conten¬ 
dere pleas, and in only one case did a defendant serve an actual prison 
sentence; moreover, total fines as a percentage of the pretax profits of 
the colluding fit ins averaged only 7 percent.®'^ Neither imprisonment 
nor monetary penalties posed a credible threat to colluding fil ms. We 
hypothesized, therefore, that the deterrent effect of DOJ’s enforce¬ 
ment efforts came not from the threat of publicly imposed fines or 
impri.sonment, but fiom the increased likelihood of an award of 
private treble damages to bread consumers or distributors.“ 


Boili senes wei e < oinputecl Irom data reported by the Federal 1 rade (’.otnmission 
111 (huirterlf Finrmnat Kepart firr Manufcuhmng, Mining, nml Tnuir C.iiipiiraUons, in U .S. 
('ouncil of htonornic Advi.sers, Efonomic Rfp<rrt of Frestdenl (annual). Of (ourstc. boih 
maikup measures arc iiifluciKcd by aniiirust <*iilon<‘n)cnt cfforis to the extent that 
c<illusion is important in these tnatuifailuring industries. 

F g.. during 1966-7() over 85 penent of all price fixers who were convicted or who 
pleaded nolo amU'ndErr did not serve prison terms (sec Block et al. 1978). 

For (he 16 (riiiiinal cases involving cither a noh pica or a lorivutioii the actual 
( riminal fines for prue fixing were: 



Mtiiimum 

Maximum 

Average 

Individuals (S) 

1.031 

9,966 

4.025 

Funis ($) 

Fines, as ^ of defen- 


50,63H 

20,690 

dants' annual sales 

09 

1.0 

.31 


All fines are in J 976 dollars. Data on the rauo of fines to sales were derived frfim court 
d<Kumenis relating to 13 cases (76 firms) during 1957-75. I he overall ratio of net 
profits to sales was obtained from a study by the U.S, Executive Office of the President, 
Council on Wage and Price Stability (1977). 

“ Civil atiiotis alleging horizontal price fixing arc po.ssible but. certainly in this 
industry, uncommon without a preceding criminal case. 
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Since bread price fixing generally causes a small injury to many 
individual distributors and consumers, private damage recovery is 
usually feasible only through a class-action suit, Class actions enable 
plaintiffs who are numerous, and whose independent damage claims 
are too insignificant to justify litigation, to maintain a single action for 
their aggregate darnages.^^ District court documents revealed that 
settlements in class actions for price fixing in the bread industry were 
almost 10 times greater than government-imposed fines. 

To test the hypothesis that private class actions actually provided 
the effective penalty in price-fixing cases, we partitioned our sample 
into the periods before and after class actions became a credible threat 
in the bread industry. Since only one class action in our sample did 
not follow a DOJ case, we assumed in partitioning the sample that 


“ t hese adions, brought under Rule 23 of the Federal Rules of tuvil Piocediire, 
have beioine far more fiequent sinre the Supreme (k>un amended that rule in 1966. 
t he amended Rule 23 r oiuribuied to this greater frequency by c hanging the procedure 
for liectiniing a tneinber ot the ( lass. Originally, Rule 23 required persons to "opt into" 
the class before they could benefit from the adjudication of the < lass's cause of aciion; 
the amended rule instead prestiinctl (lersons 10 be class members unless they affirma¬ 
tively “opted out" of the class. Although surprisingly few historical data exist on class 
actions, the data that do exist—particularly the information on docket entries for the 
.Soiitheiii District of New York collected by the American College of I nal l-awyers" 
Special Ciommuiee cm Rule 23—suggest that the 1966 amendments to Rule 23 made 
die class action a much more aiiractivc legal device (American taillegc of t rial Ijwvers 
1972) A recent alternative to the antitrust consumer class action is the parens patntu 
device by which a .state attorney general may sue on Irehalf of the consumers in his slate 
For the purposes of deterrence,/wretu patnae actions arc virtually identical to class 
actions (see BIcxk et al. I97H), 

“■'A .search of Commerce tilcaring House (1955-75; 1966-76), Mcl.aughlm (1976), 
Newberg (1977). and the I.F.XIS computer file (1978) revealed seven major class actions 
lor price fixing in the bread industry since 1966. Act 01 ding to di.strici court dexumenis, 
clas.s-ac iiun setliemenl.s or damages during 1971-76 obtained for price bxing in the 
bread industry were. 


Minimum Maximum .Average 


Cases ($) 

Firms ($) 

Damages as '/< of defen¬ 
dants' annual salcrs 


1,197.810 6,100,000 

39.562 1,220.000 

.41 19 68 


1,998,646 

293,919 

2 87 


.\ll (mes are in 1976 ciollar.s. fhese figures are based on data for five of the seven 
recorded cases. One of the seven original class actions had no settlement or award since 
the district court refused to certify the class No data at all were available on another 
case. All hvc cases were settled rather than litigated to judgment Details of these class 
actions appear in Block el al. (1978. appendix table VI). Whereas average damages as a 
percentage of defendants' sales were only 10 limes average fines as a percentage ol 
defendants' sales, average damages per firm were almost 15 times average fines per 
firm, i'hi.s occurred because the average sales per defendant differed significanth 
between the twn samples: $5,795,271 for the fine calculations and $10,‘254,205 in the 
cla.ss-action calculations. 






JOURNAL OF POUTICAL ECONOMY 


442 


TABLE 3 

Esi IMATKl) Em,CT OK IX)J ENFORCEMENT AND CLASS ACTIONS 

ON Markups in the Bread Industry, 1965-76 


Independent 

Variables 



ABUDGE n 1 

-.018* 

(-.622)t 


ABUDGF.ri2 

-.014 

(-2.68) 


ABUDGE12I 


-.002 

(-.158) 

ABUDGF.r22 


-.019 

(-3.53) 

DO|REt;i 1 

+ .004 
(.160) 


nOJRFGl2 

-.037 

(-2.70) 


IKyjRF('.21 


-.019 

(-1.03) 

nOJREG22 


-.029 

(-1.87) 

IK)I REM 

-.046 

-.042 


(-U.43) 

(-2.19) 

t'.onstanl 

.013 

013 

Observations (.V) 

208 

208 


.0926 

.1024 

F'.siatisnc 

4.13 (5,202) 

4.60 (5,202) 


• This toethcicTit i« cstmMitnl per iiHlIton (iolLtrs 

t Ihe value ol ifie eMima(e<l <<tefhctrcti dividt^l by lu evtim<aed Mandard error 


( lass actions affected primarily the penalty cost of detection, not the 
probability of detection.^® We considered as partition dates the year 
(hat a district court first certified a class in a bread case (1 970) and the 
year that the Administrative Office of the Courts began reporting 
class-action activity (1972). Before 1970 class actions probably were 
not a credible threat in the bread industry: by 1972 no doubt re¬ 
mained that class actions had become prevalent and important. To 
accomplish the actual partitioning of the sample, we split the en¬ 
forcement variables ABUDGET and DOJREG around 1970 and 1972. 
In table 3, ABUDGETl 1 and DOJREGl 1 are ABUDGET and DOJ- 
RKG for 1965-69, while ABUDGET12 and DOJREG12 are the vari¬ 
ables for 1970-76. The comparable variables partitioned around 


To deiermine when, after the 1966 amendments to the class-action provisions in 
the Federal Rules of Civil Procedure, ihe class action became a credible threat in the 
bread industry, we searched lor both recorded and unrecorded class actions involving 
horizontal price hxing of bread products. None of the seven class acunns we found was 
hied belore 1968, actually certified before 1970, or settled before 1971. 





ANTITRUST ENFORCEMENT 


443 


1972 instead of 1970 are ABLIDGET21, ABUDGET22, DOJ- 
REG21, DOJREG22” 

The estimates in table 3 are consistent with our hypothesis that class 
actions represent the effective penalty in price-fixing cases. This re¬ 
sult does not seem to depend upon whether we partition the en¬ 
forcement variables around the date of the first class certification or 
around the date of the first reporting of class-action activity by the 
Administrative Office of the Courts. For either partition, only in the 
latter period, when class actions represented a credible threat, did a 
significant deterrent effect result from either an increase in the Anti¬ 
trust Division’s resources or from the actual prosection of a horizon¬ 
tal price-fixing conspiracy.** 

I’he pattern in the bread industry is clear. A successful federal 
prosecution signals to consumers that a treble damage suit has be¬ 
come feasible. Private plaintiffs subsequently provide the effective 
penalty in the form of class actions for treble damages.** For price 
fixing in this conspiracy-prone industry—and, we suspect, for price 
fixing in general—deterrence has been a product of both public and 
private enforcement efforts.*® 

*'Formally, the definitions of the variables are: ABL'DCETil = ABL'DGE l', 
1966-71, zero otherwise; JlBUDGEr22 = ^BUDGET. 1972-76, zero otherwise; 
IK)JREG21 = DOJREG, 1966-71 , zero otherwise; DOJREG22 = DOJREG, 1972-76, 
zero otherwise. 

** While the e.sciina(e.s in table 9 do not include the markup controls, AF(X)1)M and 
ACEN M, regressions with these < ontrols yielded similar results. Several factors possibly 
confound this analysis of tlie deterrent effect of class actions. First, the average criminal 
fine imposed by district courts might have increased over the pericxi, or might simply 
have been larger when class actions became a relevant concern. To rhet k for this 
possibly confounding influence, we estimated time trends for several measures of 
criminal fines. Overall, the evidence suggests that an increase in criminal penalties dwf 
not confound our results on the deterrent effect of class-action suits. Second, the early 
1970s included a period of pnee controls, and iX>JREC12 and/or ABUDG.ET12 possibly 
proxied for the effect of these controls. We lest^ whether the enfortement variables 
performed this role by including directly in the markup regression a dummy variable 
for price controls. The results of this procedure suggest that, although price controls 
had a significant depressing effect on markups, the enforcement variables were not 
merely a proxy for price t ontrols. Finally, we controlled for pure time trends asscx iaied 
with changes in demand or market structure within each city. This was accomplished by 
regressing the dependent variable (AM) against city dummies along with the deterrent 
variable. The effect of the enforcement variables remained significant and unchanged 
and nearly all of the city dummies were statistically insignificant (sec Blixk et al. 1978). 

** A nolo plea apparently is a sufficient signal. Although guilty verdit is are pnina facie 
evidence in a treble damage suit and should induce private enforcement, they are rare 
in price-fixing cases. In our sample of 17 price-fixing cases, one case ended in an 
aiquittal and another in a conviction; the other l.'i involved nolo pleas. Yet even a 
governmenl case that ends in a nolo plea signals to potential private plaintilTs that they 
I>| oljably would prevail in a damage suit against the alleged price-fixing conspiracs. In 
fact, all the reported class actions in our sample that followed a government acuon and 
received class certiflcation eventually obtained a settlement award 

The amendment of Rule 23 in 1966, ol course, was the event which iadliiaied class 
■letioris, thereby increasing the power of the federal government to deter price fixing. 
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In this paper we formulated and tested a simple model of collusive 
pricing in the presence of antitrust enforcement. We showed that if a 
cartel’s probability of detection increases with its markup, then the 
cartel’s optimal priie is neither the competitive price nor, in most 
cases, the price that a cartel would charge in the absence of antitrust 
enforcement, but rather an intermediate price that depends on the 
levels of antitrust enforcement efforts and penalties. 

Our empirical results revealed that increasing HOJ’s enforcement 
capacity or filing a DOf price-fixing complaint had the deterrent 
effect of reducing markups in the bread industry. We noted that 
government-imposed price-fixing penalties were trivial and found 
support for the proposition that the effective deterrent to price fixing 
was the < redible threat of large damage awards to private class actions 
that followed IfOJ’s case against the same conspiracy. Consequently, 
only after class actions became a credible private remedy did the 
Antitrust Division’s enforcement capacity or its filing of a bread 
price-fixing case deter collusion in the conspiracy-prone bread in- 
ciustiy. 
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The Market Evaluation of Human Capital: 
The Case of Indentured Servitude 


David W. Galenson 

Ihiii'mity of C.lniaj’o afut (iahfomta Institute of Technology 


This paper examines the market for human capita) created by the 
institution of indentured servitude in colonial America. The inden¬ 
ture system allowed English emigrants to obtain passage to the 
colonies by selling claims on their future labor. With the size of the 
debt approximately equal for all emigrants, the length of the term 
for whu h a servant was bound is predicted to have varied inversely 
with expected productivity in the colonies. Analysis of two collections 
of contracts made in the seventeenth and eighteenth centuries sup- 
pot ts the prediction. Age, skill, and literacy were negatively related 
to length of indenture. Women received shorter terms than men at 
young ages, white servants bound for the West Indies and those 
Ixiund in periods of high colonial demand for labor also received 
rediKtiotis. 


I. Introduction 

During the colonial period of American history, two institutions 
existed which provided for the explicit valuation of stocks of human 
capital in the market. One of these was slavery, under which blacks 
and their progeny were held in service for life. The other was in¬ 
dentured servitude, under which whites were bound to service for 
limited periods of time. 


I am gratelul lo Stanley Engerman for discussions of many of the issues treated in 
ihi.s paper and comments on an earlier draft. I would also like lo thank Andrew Abel, 
(lary Becker, I.ancc Davis, Roliert Fogcl. Richard Freeman, Russell Menard, Frederic 
Mishkin, .Melvin Reder, Sherwin Rosen, T. W, Schultz, GeorgeStigJer, and participants 
in seminars at Columbia University, the University of Chicago, the University of 
Illinois, the Newberry I.ibrary, the University of Southampton, and the 1980 Cliomet¬ 
rics Conference for their suggestions and comments. 
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As a result of both its direct importance for the history of the 
nineteenth century and its indirect importance for the history of the 
twentieth, slavery has long been one of the central concerns of Ameri¬ 
can social and economic historians. However, in the labor markets o( 
many regions, for substantial periods during the first half of the 
colonial era, indentured servitude was of greater quantitative impor¬ 
tance than slavery.* In the eighteenth century the quantitative 
significance of indentured servitude declined in most of these areas, as 
slaves were substituted for servants in the sugar fields of the West 
Indies, in the tobacco fields of the Chesapeake colonics, and in the 
rice fields of South Carolina.^ Yet indentured servitude nonetheless 
continued to perform an impiortant role in the colonial labor market 
by providing skilled craftsmen and managers to the large plantations 
of the West Indies and the .southern mainland colonies.® 

The essential difference between servitude and slavery was that it 
was the labor of the servant, rather than the person, which was bought 
and sold. Becau.se less attention has been devoted to the study of 
indentured servitude, we are less familiar with the history of the 
institution. An economic investigation of its functioning, based on the 
analysis of quantitative data generated by the operation of the market 
for servants, can therefore serve a dual purpose: It can provide 
evidence of how capital values were established when claims on the 
labor of humans for long periods were traded in the market, while at 
the same time increasing our understanding of the economic basis of 
white servitude in colonial America.* 


II. The Indenture Bargain 

Indentured servitude was a credit system under which human labor 
was leased. It functioned through two markets linked by a recruiting 
agent. In England, in the first market, a prospective servant signed a 

' An example is Maryland, where a study of probate inventones has indiraied that as 
late as 1674-79 the ratio of servants to slaves held in estates was 3.88 (Menard 1977A, p. 
.360). It should be pointed out that in some areas neither type of bound labor was 
quantitatively important. Slaves never accounted for more than 3 percent of New 
England's total population in the colonial period, and the share of servants was 
probably of a similar magnitude (Greene 1942, chap. 3; Abbot Emerson Smith 1947, 
pp, 28-29). A generalisation that held for a number of major colonial regions is that the 
share of bound workers in a region's labor force tended to rise as one traveled south 
from New Emgland, with progressively increasing shares in the middle colonies, the 
Chesapeake, South Carolina, and the West Indies, for evidence on populaiion 
composition, see Greene and Harrington (1932), Sutherland (1936), and Wells (1975). 

* For accounts of this process in the West Indies, see Dunn (19<3); for the 
Chesapeake, see Menard (19774); for South Carolina, see Wood (1975). 

“On this change, sec Gray (1958, 1:350) and Parcs (1960, p 19); for additional 
evidence see Galenson (1979o, chap. 10). 

* The present paper extends the analysis and empirical findings of Galenson (19774). 
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contract, or “incienture,” with a merchant, promising to serve the 
latter or his assignees in a particular colony for a given period under 
stated conditions. The servant was then transported to the specified 
colonial destination, where the merchant or his representative sold his 
contract to a colonial planter or farmer in the second market. In 
return fm' the commitment of his labor, the servant received pas.sage 
to the designated colony, maintenance during the term of the con¬ 
tract, and certain f reedom dues at its conclusion. Once signed, the 
indenture was negotiable property, and at any time before its conclu¬ 
sion the servant could be sold to a new master for the balance of his 
term. When the contract expired, the servant became free, '/he con¬ 
ditions of sers’itude were regulated by colonial statutes as well as by 
agreements written into the contracts. 1 he terms of the contract were 
binding u|ron both master and .servant. 

The indenture systetn normally operated within a context of com¬ 
petitive markets both in England and in the colonies. Servants were 
one importatit available backhaul cargo for English ships engaged in 
the ttade for colonial sugar, tobacco, rice, and other agricultural 
staples.’ (iontemporat y fare quotations indicate that the charges for 
passage from England to America were uniform at a given time for all 
individuals and did not vary by specific colonial destination.* 

The planter’s demand for indentured servants was ba.sed on his 
calculatioti of the discounted value of their net future earnitigs, after 
deducting the expected costs of the servant to him. I'he present value 
of the servant to the planter therefore depended upon the expected 
value of the servant’s output in each year of the contract; the expected 
tost of maintenance, supervision, and training for the servant during 
each year of the term; the discount rate; and the value of the freedom 
dues to be paid to the servant. These, or analogous variables, are the 
same considerations w'hich enter into the derivation of a free worker’s 
net age-wealth profile (Becker 1975, p. 223). Yet one critical differ¬ 
ence is that, whereas for free workers evidence on flows is used to 
calculate the values of capital stocks, in the case of servants these 
calculations were performed by planters, who based their demand for 
stocks of bound labor for fixed terms on their calculations involving 


'See Bruce 1907, 1,622; Stniih 1947. p. 39; and Middleton 1953, pp. 145-56. 

* While £6 was the fare cited in the early colonial peritxl, after the middle of the 
scveiiieenih ceriiiiry £5 was the fare normally quoted tor passage to all colonies. For 
references to quotations of passage charges from England to a numlrer of colonies, see, 
e.g., John Smith (1624, pi. 1, p. 162), Purchas (1625, pt. 4, p. 1791); Bulltx-k (1649, p. 
47); Taunton (January4, 1670); Wilson (1682. p. 19); Lnileton (1689, p. 17); jeaffreson 
(1878, pt. 2, p. 102); King.sbury (1906, pt. I. pp. 277-78); Abbot Emerson .Smith (1947, 
p. 35); and Alexander (1972, p. 45). A qualification to the stateinenl in the text is 
possible seasonal variation in the tost of delivering servants to stime colonies, discussed 
below. 
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the relevant flows. Therefore, like the slave market, the market for 
indentured labor produced capital values, and the flows underlying 
these must be inferred. 

The in.stitutional arrangements which provided for the ownership 
of human capital for discrete periods produced potential differences 
between the patterns of human-capital values under indentured ser¬ 
vitude and those implicit in the operation of free labor markets. An 
advantage of the indenture system for the planter, relative to the 
hiring of free workers, was the greater control it gave him over 
the servant’s time and effort once the bargain was made. Because the 
servant would have been compensated for his loss of freedom, in the 
absence of uncertainty, the price of indentured human capital would 
have been higher than that of free workers.’ However, in practice, 
uncertainty makes the direction of the inequality between these prices 
unpredictable without additional information, for it cannot be deter¬ 
mined a priori whether the insurance value of the contracts was 
normally greater to the planter or the servant.** 

All servants who migrated to America incurred debts of similar 
value. As noted above, passage charges were uniform for all servants, 
and maintenance costs and freedom dues varied little across indi¬ 
viduals. As a result, every servant contract was a promise to repay 
approximately the .same sum of money. Therefore, the present dis- 
(ountecl value of every servant’s contract should have been approxi¬ 
mately the same at the time of binding. Since the productivity of 
servants varied, the conditions of their indentures had to vary. The 
higher tlier servant s marginal-value product above bis maintenance, 
the faster he could repay the loan made to him, and the shorter the 
term of the contract. The length of indenture across servants 
should, therefore, have been negatively correlated with individual 
productivity or, equivalently, with the market valuation of the current 
flow of income generated by the individual’s slock of human capital. 


' 1 his asMiiiies a solution to the problem ol inomtoriiig the scrvmu's vsork in whirh 
the prodiitiiviiy gains I'roiii the master's tonirol over the servant's time and effort were 
not otfsel by shirking. 

“ In a world in which futures loiitracts for free wage labor could be made with certain 
fullillmenl, the present value of an imlenuire lor a given number of years and of a 
series of contracts for hires tor the same years would dilfer only by a premium which 
woulfl reimburse the servant for the loss of liecdom resulting from his residence in the 
master's household and due to other legal provisions governing servitude However, in 
the absence ol these guaranteed futures coniratts, in some cases the master might also 
have been willing to pay more for an mtlenluie because ot the assuranee it gave him of 
latxir supply in [xiak scastins tir fuiure years; similarly, workers might sometimes have 
been willing to accept lower implicit wages m return lor the guarantee of employment 
the indenture represented. The relative cost of this insurance to master and servant 
roiild vary, and as a result the relation between the iniplicii wage paid to servants and 
the hire rate (dr free workers is indeterminate. 
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Strictly, if costs had been precisely the same for all servants, all 
should have been bound on conditions that would yield the same 
expected price on arrival in the colonies. This, of course, does not 
imply that no variation should have occurred in the realized auction 
prices of servants in the colonies, for random disturbances between 
the time of binding and auction—such as illnesses of servants during 
the voyage or changes in the price of colonial outputs—could have 
produced differences between actual and expected prices. What the 
analysis does imply, however, is that if the market for contracts of 
servitude was efficient, variation in servants’ auction prices should 
have been uncorrelated with all the characteristics of the servants 
which were known at the lime of making the indenture bargain.® 

Very little evidence of the prices of newly arrived servants at colo¬ 
nial auctions has survived: From the entire colonial period the only 
known records of auctions of English servants come from the ac¬ 
counts of two ships, the Trislam and Jeane, which sold 68 servants in 
Virginia in late 1636 or 1637, and ihe Abraham, which sold 56 servants 
in Barbados in January 1637 (London, Public Record Ofhce, 1636, 
1637), The listings of the amounts paid for the servants show that the 
median and modal prices of both men and women were identical— 
.500 pounds of tobacco—in both auctions. Thus, although as will be 
seen both sex and colonial destination had a significant effect on the 
length of servants’ indentures, the available evidence, though limited, 
suggests that neither may have affected the initial colonial auction 
prices for the servants. This is consistent with the hypothesis that the 
variable dimensions of the contract were adjusted for the servants’ 
characteristics so as to make the expected auction prices of all servants 
the same. 

I he relation between auction prices of servants and the simple cost 
of passage is also of interest. Fhe modal price of 500 pounds of tobacco 
observed in the auctions was 11 percent greater than the fare that two 
free passengers each paid for passage to Virginia on the Tristam and 
Jeane on the same voyage as the servants and 4 percent above the fare 
quoted elsewhere for passage from England to Maryland in 1638.'® 


" This suieineiit neglects one potential element of cost. 1 he marginal cost of delivery, 
anti therefore auction prices, would have included any costs the merchant incurred in 
rccruiiing servants in England, these could clearly vary across individual servants, 
pioduiing differences in expected auction prices, these costs could have been corre¬ 
lated with individual prcxluctivity, as in some instances skilled servants were given 
lump-sum payments at the time of binding. For some evidence of this practice, see the 
analysis of the Middlesex sample in the Appendix. 

the two free passengers each paid 450 pounds of tobacco (London, Public Record 
Office 1637). For the 1637 Che.sapeake larm price of tobacco, see Menard (1975, p. 
475). The fare quotation, of £6 sterling per person, is in Maryland Historical Society 
(1889, p. 206). 
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That in these cases the typical auction prices tended to be higher than 
the fare may have been due to the cost of recruiting servants, or to the 
existence of a premium received by merchants for bearing the risk of 
servant mortality on the ocean voyage. 

Related evidence on the nature of the labor-market equilibrium in 
the colonies which induced flows of labor from England is available 
from surviving valuations of indentured servants recorded in Mary¬ 
land for probate courts. The mean price of 28 male servants with 4 
remaining years of servitude recorded on Maryland’s lower western 
shore during 1704-57 was £8.95, with a median of £9 and a mode of 
£10; while the mean price of 19 females with 4 years remaining was 
£7.75, with a median of £8 and a mode of £10." The typical marginal 
cost to an English merchant of delivering servants to the colonies 
cannot be estimated precisely but probably fell within a range 
bounded at the lower end by £5, the usual cost of passage, and at the 
upper end by the £10 estimated by Abbot Emerson Smith as the 
tnaxitnutn expen.se of delivery.'* Ihe evidetice of the probate 
valuations—although again limited in quantity—suggests that the 
central tendency of the colonial price of .servants did fall within this 
range and, therefore, offers additional support for the hypothesis 
that the price of a servant’s indenture in the colonies was equal to the 
iTiarginal cost of delivering labor there from England. 

A number t)f other general considerations relating to the adjust¬ 
ment of the bargain deserve mention. One is the role of mortality. 
The smaller the probability of a servant’s serving a given year of his 
term, the lower his expected net earnings and, ceteris paribus, the less 
favorable the terms of the contract he would be able to sigti. 1 his is 
true whether the mortality in question is that during the Atlantic 
crossing, when the merchant bore the risk, or that after arrival in the 
colonies, when the planter had purchased the contract and assumed 
the risk.** 

A second factor with a similar effect was the possibility ot a servant 
failing to serve out his term for a reason other than death, principally 


" I am (^raieful Ui ihe .St. Mary'.s C'.ity tamiinisMon, Aiinaptills. Maryland, lor tran- 
stripiions of the probate price cjuotatioii’f, from Maryland Hall of Record.s. Most ot 
ihc quolatiun.s arc from the early decades of the jteritxi; on devaluatiuii ol Maryland 
curiencv and its rcfleclioii in proliate valuations, see Main (1972, pp. I4-1S) and 
MtCiiskcr (1978, pp. 189-204). 

.Smith 1947. p. 37. A prctise analysis would include a positive preiniuin m the 
piobale valuations, for the.sc servants had iioi'inally been "seationcd,'’ i.e., had spent a 
year in the new colonial disea.se environnienl. and consequently had a longer life 
t-xjvectancy than the new arrivals, ceieris paribus. 

Mortality rales differed among colonies, and the cflecis t>l this on the length of 
tnclemure will be discussed below. The assumption here will be that inorialiiv rales 
among servants both during passage and in the colonies were not systematically related 
to individual prexluttivity. 
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running away. All the colonies enacied legislation intended to dis¬ 
courage servants from running away; of these some were preventive 
measures and others punitive. While some colonies provided for 
corporal—and even capital—punishment for runaways, the most 
common penalty was extension of the servant’s contract by some 
multiple of the time he was absent.'■* One of the provisions of the 
contract, the servant s freedom dues, constituted a nonvested pension 
and, therefore, also acted to discourage servants from running away. 
While the form and worth of the dues varied across colonies and over 
time, they were often of substantial value and could have constituted a 
significant deter t ent to servants who considered escaping f rom their 
masters.'-'^ 

Servants were not aflowed to marry during their terms. Since, by 
K.nglish practice, the expense of raising the illegitimate children of 
servants (ell on tfie county, colonial legislation provided that the 
father should he discovered by oath of the mother and that he should 
reimburse the county for the expense of raising the child until it could 
be bound out to work. Since servants coultl not normally pay this sum, 
either the master would pay it and the servant’s term would be 
extended, or the servant would be bound over tt) the county for an 
additional tei tn af ter the conclusion of his normal term, to be sold for 
the necessary amount. I'he mother’s term was also extended to reim¬ 
burse the master for her lost working time. 

III. Empirical Results 

1 he print ipal souttes of ({uantitative evitlent e bearing on this market 
for human capital are servant contracts rectirded and held in English 

For ificifHcfs It) la\NS lo luiiawa^s. see Semines (I9vi8. pp. 1and 

Siiiitli (11H7, pp ‘i()4-70). The suonj; tei ms of ihis colonial legislanon weic an inipor- 
uiiit <Iemeiii in die infleimire sysieni's siuccss, .iiul some atiihors have aij^ued ihal 
thange.s in ihc proMsions for ihe enfoiTcrnenl of (ontratLs weie central lo iLs 

decline in ihe early inneteemh tcniiiiy: see, e.g., Gerser 1901, p 4^. Similarly, the lack 
ol success of die attefiipi to re\ive a conirati lalxir system during 1804-85 lias been 
attrihiiied in pari to du* high cosi of enforcing tonirac(s ihiough civil action; see 
Knekson 1957. pp. 40-48. 

On freedom dues, see Smith (1947. pp. 238-41) and Hcavner (1976, pp. 50-51) 
For a discussion of an analogous provision, nonvested pensions as a firm's insurance 
against cjUits, see Bee ker (1975, p. 34) f he nature of freedom dues has some implica- 
ijons ff;r other dirneii.sions of die servant coniraci. The dues were specified by colonial 
law and were equal for ail servants in a given colony 1 herefore, the discounted cost of 
the dues If) die plainer al the time he purebased a coniracL varied inversely with the 
length of the corn ran. One effen of the fixed nominal value of the freedom dues niav, 
therefore, have l>een lo reduce the* amount of variation in the length and other 
dimensions of the contracts, since the vanaiions in ihe latter weic inicnded to equalize 
the net present values of all contracts This jKiinl should not obscure another basic 
effect of the existence of freedom dues, for, ceteris paribus, they raised tile cost of 
servani.s and therefore leiidcd !o lengthen ihc term of indenture. 
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courts, rhese were made both to protect servants from kidnapping 
and to protect merchants from false charges of kidnapping, The two 
largest known surviving collections will be analyzed here; the earlier 
set, recorded in Middlesex during 1683-84, covers a total of 812 
individuals, while the later, recorded in London during 1718-59, 
covers 3,187 servants. 

The principal variable analyzed here will be the length of the 
indenture. Furthermore, this analysis will be done only for the minors 
in both samples; these comprise 22 percent of the servants with 
known ages in the earlier and 67 percent in the later sample. I he 
selection of the variable to be analyzed and of the minors follows from 
consideration of the evidence of the contracts. It is clear that 4 years 
was the normal duration of an adult’s indenture in both samples, yet 
for a combination of legal and clerical reasons the full set of condi¬ 
tions of adults’ servitude do not appear to have been recorded in 
either set of indentures. For minors, the full conditions do appear to 
have been recorded, and the duration of the contract appears to have 
been the chief variable dimension of the conirat ts. Both variations in 
freedom dues and restrictions on the servants' occupations in the 
colonies were rare, C.ash payments made to adults were not generally 
recorded in the later sample, but the contracts of the minors, on 
which they do appear to be recorded conscientiously, show that 
fevvei than 6 percent received cash payments. Some cash payments 
arc recorded for both minors and adults in the earlier sample, and 
although it is uncertain whether payments were reti>rded in all cases 
ill which they were made, less than 7 percent of the contract.s contain 
such entries. A notable feature of the payments recorded in both 
samples is that viruially all—95 percent in the earlier and 97 percent 
in the later .sample—were made to servants bound for 4 years. Insofar 
as payments were made (or promised) and recorded, analysis of the 
characteristics of servants w'ho received them tends to remlorce 
the re.sults obtained from the analysis of the length of indenture.'* 
1 he evidence of froth samples, therefore, indicates that for minors the 
greatest variation in the conditions of indenture occurred in the 
duration of the term of servitude, while for adults this was not the 
case, as 4 years was both tlie standard term for adults and normally 
the minimum term assigned.'^ For adults, cash payments were appar- 

See the Appendix. 

terms of less than 4 years do ixeur, but they .ip|)ear to base In'en lare alter the 
niidsevente«'nih (ciuury; tiuis, ihey actouni lor only (1.5 yiencnt ot all indeniuie.s of 
known lenyph in the Middlesex rcgi.straiionsol 1683-84 and t 2 penem of ihose in the 
l.ondon re);istrations of 1718-59. The reastnis for thes arc ihm known. Four years may 
have lieeii llie term required at most tunes and places for the a\eray?r adult to repay the 
tost ot pas.sage out ol his net earnings. Inn it is uiielear why sborier lenns were not 
more often given to highly skilled servants like the acrouniant James (atrss, yvhom 
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ently substituted tor reductions in the length of servitude below 4 
years. 

Karlier analysis suggested that a servant’s term of indenture would 
be inversely related to the market valuation of his stock of human 
capital. This index of the servant’s human capital can be related to a 
number of observable characteristics potentially relevant to the de- 
lerminatitm of the present value of that stock. When this is done by 
multiple-regression analysis, the estimating equation differs from the 
common hedonic method only in the use of an index for price. The 
estimated coeflicients of the independent variables age, sex, literacy, 
and occupation can be interpreted as the marginal prices paid for 
servants' characteristics in units of the index, while those of desti¬ 
nations represent compensating differentials among regions.'* 

Table 1 shows a number of the basic relationships underlying the 
market valuation of the .servants. In both samples, the length of 
indenture was negatively related to both age and skill: With other 
things eejuat, servants with skilled occupations and those able to sign 
receivetl shorter terms. Women received shorter terms than men, 
ceteris paribus, and servants bound for the West Indies received 
shorter terms than those bound for the North American mainland. 

A comparison of the estimated coefficients of the sex variable across 
samples indicates that, on average, women received considerably 
larger reductions in their terms in the 1680s than in the eighteenth 
century. This decline in the premium for females is not surprising in 
view of the getierally declining colonial sex ratios during this period, 
for while women were preferred for some kinds of household work 
and some types of farming, their increasing relative availability in 
most colonies wtiuld be expected l»> lower the size t)f their wage 
differentials."' 

Table 2 piovides a more detailed analysis tif the length of indenture 
for the later sample, allowing separate age profiles of length of inden¬ 
ture by sex and skill. It reveals that there was a tendency for women 
to receive intlentures from 5 to 15 percent shorter than those of men 
through the age of 17, while for .servants aged 18-20 there was no 

Walter Tullicicpli jieiii to the manager of hi.s plantation in Antigua in 1759 with a note 
slating tliai he "hath Ixiund himself to serve me four years agreeable to the Laws of 
Antigua, but as he is 22 years of Age, he thought it hard to serve so long and for that 
reason, I have given him a Oniftcale that he is to be absolved from the last year’s 
.Servue” ( Tullideph 1759, vol. 3). That planters prelerred to substitute salaries for 
reductions of the term behiw 4 years suggests the possibility that fixed costs of hiring 
and/or a desire to capture the returns from a servant’s general training in the colony 
may have tieen important considerations. 

On the interpretation of toefficients in hedonic price indexes, sec Rosen (1974, pp 
34-35), 

'• I he difference between samples in the sex coefficients in table I is significant at the 
.01 level. On declining colonial sex ratios, sec Wells (1975, pp. 156, 219, 244). 



TABLI-: I 

EsriMA'i'KD Regression CoEmciENrs, Middlesex and Lom>on Samples 



Middlesex, 1683-84 

London, 1718-59 


Estimated 

Standard 

K»(ima(ed 

Standard 

Inoepb-Nbeni Variable 

(joefficieni 

Error 

Coeffioeni 

Error 


Age (years):' 
lotal sample: 


Less than 15 

2.655 

.385 

2 749 

.134 

15 

2.201 

.400 

2.147 

.080 

16 

1.457 

.304 

1.304 

.068 

17 

.893 

367 

.728 

.062 

18 

.174 

.270 

.331 

.055 

19 

738 

.306 

.169 

.050 

Sox" 

-1.484 

,207 

- 195 

.073 

Literacy*" 

-.575 

.217 

-.082 

.037 

OaW 



-.0060 

.0023 

I'rade' 

-.727 

.445 



Farmer' 



- 313 

.074 

Laborer 



-.146 

.079 

Services* 



-..348 

.066 

Metal and construction" 



-.320 

.067 

Clothing and lexiiles' 



-.313 

060 

Antigua' 

-.227 

812 

- 403 

no 

Barbados 

- .553 

.274 

- 176 

154 

Jamaica 

- 398 

.462 

- 233 

.060 

Other West Indies* 

-.401 

1.094 

-.479 

.088 

Maryland 

203 

.209 

.306 

.059 

Virginia 



.127 

.073 

Other mainland' 

-.889 

673 

050 

116 

Cnnsiani 

5 227 


4.665 


R’ 

555 


539 


t 

12.87 


112 82 


n 

171 


2.049 



S<HtKcr used 4re all frtRn mordi of nunon (age if‘\5 (han 21) Middicfcx. Umdon. (*rca(rr 

l^fndori Rn<ird (Jffkr, NKholson (1965). Wamn{( (1976) Lundon. 1718.-59 lofwion. fa>qMratKm of ia>itdon 
KnoriU Office, KartiinkoM and kiimink<m ((964). Oalcruon (1977a) 

No IT —Dcprndcni wiahlf * number of veare indrnnirnl. meihod of eymiaiion uicd is tinlmarv lease squares 
ID ubies i and *2 

'for age vanabk’, indiuiird age > I. rem <Uu ■=> age 70 
‘ Male « 0, lemaJe - 1. 

' Marked = 0, aigiicd ■ 1. 

“ Dale rmrred as hiul iwo digiu t>f year of regmranoii 

' trade » () (or laborers and no recorded (Mcupaiions. trade ' I for all iNher men t (xru|Mii(a)s 
'foi all nicupaiiofial variablni, irxiaaied ixcupaiNiitfs) I. zero ebu mt reri>rded ixvupaooii "FarTner" 
itMludea husbarklman. plowman, eic 
' Includes accoununt, barber, surgeun, etc 
'' (ikludea hUcluraiih, carpeuier. cooper, mason, etc 
' Includes t'loihier, uilor. weaver, etc 

' for all deslinaiion vanaUes, for MkidfrseK »aiuj4r, /cn» ibss - Virginia, for Umdon sample zero class = 
f’eiinsdvaiiM 

“ (iK-Jude« Nevit, St (Ibnnopher, etc 
' Includes Carultna, New Vori. e(r 
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I ABLE 2 

EsilMAIF.l) Rk.RFSMON CotmCItNTS, Ix)NDON SaMPI-K, 1718-59 


I ndcpt-nilcni 
Variable 

Ajre (years); 
loul sample' 
Less than 15 
15 
1(1 
17 
IH 
19 

\VoJ7ien, age." 
Less than 15 
15 
It) 

17 

18 

19 

20 

I I.xle, age." 

15 

lb 

17 

18 

19 

20 

I. ileiass 
Date 
.\ntigtia' 

II . 11 bachis 
)amaica 

Oihei West Indies 
.Mar) land 
Other ttiainlaiKl 
Kebi iiai i 
.\lar< I) 

■Ipiil 

Ma> 

June 

July 

August 

Scptetiibet 

Os tuber 

Nuveiiiber 

llecctriber 

8ugar‘ 

Cunstatit 
« = 

F 

n 


Esilinatcd 
Cloef ficieiU 


2.97b 

2.378 

1.5(2 

.959 

.473 

.2b0 

- 1 034 
-.472 
-.909 
- .302 
.041 
.090 
.198 


- L41«) 
-.884 
-.502 
-.275 

- .224 
-.103 

- .070 

- .0093 

- .200 

- .005 
-.084 

- 363 
.194 

- 039 

- 100 
.171 

L56 

-.095 

-.431 

.196 

-.486 

.223 

-.400 

-.225 

014 

-.0162 
4 830 
.566 
52.00 
2.044 


Slandard 
F.i rur 


.144 

092 

.084 

.084 

.079 

.083 

.460 

.390 

228 

.161 

.1.59 

.135 

.156 

.392 
206 
.117 
.085 
073 
.076 
.036 
0024 
235 
268 
209 
.221 
.063 
.072 
.092 
.113 
1 19 
. 164 
.132 
.113 
098 
.096 
101 
too 

.097 

.0074 


—liMidon. (AirpomiiiHi of K«Tor<h (.ffhtr, K<iiiiiiik.ow .in<l K.«rninkow (]%4), («dicnioi> (1977(1) 

No I Fs - Jxre uWc I for vanabies ikk dcfuiwl hrrr 

* Frinalr agr inifraruonb, ir>rijcaicd vaiiabtr = I lor women 4if jpven age 

’’ 1 radc age interactions, indicated variable - I lor men of given age who molded a trade (as clef mod in table I) 

‘ For deacmatiom, zero class * Virginia 

** For rtioriihs. zero class = faniury Separate inter action terms lietwecri West Indian demnatioii arni month of 
regtstratton were iixJuded in the equation, but (heir coefhcients were genet ally small m value and are not reported 
''Sugar* average annual price of rnuscrjvacto sugar in lamdcm, m shillings per hundredweight, Lagged 1 year, for 
servants bound lor West Indies (Shendan fl974, pp "f96-97|, with linear inter(>oiaii(m for 1717-20 and 1727) 
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difference in the length of term by sex. In a suggestive parallel result, 
Robert Fogei and Stanley Engerman found that, excluding the value 
of childbearing, the net earnings of female slaves were greater than 
those of men prior to the age of 18, apparently due to the more rapid 
physical maturation of women.“ I'he sex differentials in the terms of 
young indentured servants might have resulted from the same 
source. 

The results of table 1 indicate that premia for skills were reflected 
in the length of servants’ indentures. The more detailed specification 
of the occupations presented in table 1 for the London sample indi¬ 
cates that the marginal premium paid for servants in each of four 
occupational categories—farmers, services, metal and construction 
crafts, and clothing and textile trades—was virtually the same.^' Un¬ 
skilled laborers received terms longer than servants with skilled occu¬ 
pations but shorter than those with no recorded occupations;^^ the 
latter result may indicate that some premium was paid for the labor¬ 
ers’ work experience. 

Economists have devoted considerable atieniion to the analysis of 
the relationship between productivity and age and have at cumulated 
much evidence on the association between age and wages in recent 
periods. Less is known of the nature of this relation in past times. It is, 
therefore, of .some interest to consider in more detail the implications 
of the estimated relationships between age and length of indenture 
for the age-earnings profiles of servants.^ fable 3 presents estimates 
of the relative annual net earnings of servants by age for unskilled 
and skilled men. The calculations are based on the a.ssumplion that 
the expected colonial sale price of each individual's contract was equal 
to the constant marginal cost of delivering servants to the colonies.®'* 
The relationship between age and net earnings is assumed to have 
been linear, making average net productivity during the term equal to 
net productivity at the term’s midpoint, fhe estimates of average net 


Fdgel and F.iigerinaii 1974. p, 77, see also Met/er 1975, pp. 136-37. Inieresiiiigl), 
evidence on the lioiirly e.iniiiigs of North Carolina loilon null crnployee.s in 1907 
indnates that average female earnings were above those of males through the age ol 15. 
et]ual at 16, and Ivelow male earnings thcrealtcr (Wnght 198(1, p. 6). 

None of the lout eoeffiiients is significantly different from any of the other three at 
the ,10 level. On the value of skilled servants in the colonies, sec, e g., Martin (1761, vol. 
4, lol 97, verso); Jeaffreson (1878, 1:186); and (ialcnson (1979n, pp. 314-19). 

“ 1 he coefficient of laborer for the larndon .sample in table 1 is significantly different 
tiom that of fanner at the .10 level lor a one-tailed f-iesi, lioin those ol ineial- 
1 oiisiuK lion and clothing-textiles at .05, and from that of services at ,02,5. 

It might be noted that the erratic behavior of the coelficicnis ol higher ages in table 
1 for the Middlesex .sample may have been due to the talsihrauon of the ages of some 
servaiiis. This may have re.sulletl from the legal rctjuiremenis under which the regis¬ 
trations were made; for discussion and evidence see Calenson (1979a, appendix to 
(hap. 3). 

** Possible sea.sonal variation in delivery costs has been controlled for in the ec^uation 
reported in table 2 
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productivity are derived from the following formulation of the mean 
present value of the contracts of servants of agej at the beginning of 
the term; 


£ 


NPt - u>, 

(1 + rV ’ 


where j\F = expected mean annual net productivity during the term; 
n = mean length of term for servants in each entering age group; 
w = mean annual wage payments made to servants during the 
term; r = discount rate. 

To solve for the value of ^^P for each entering cohort, the mean 
present value of the contracts was set equal to £10, an estimate of the 
marginal cost of delivering servants to the colonies. The mean age of 
each group at the time of binding was taken as the recorded age plus 
one-half year to allow for the rounding of age upon registration. The 
tnean length of term by age was derived from the coefficients of table 
2, while the mean annual wage payments were taken directly from the 
iitdenture contracts.The estimates were made with a discount rate 
of 10 percent. 

The estimates of table 3, which indicate that the net-earnings 
profile of skilled servants was steeper than that of the unskilled, are 
consistent w iih the normal positive relationship between the steepness 
of age-earnings profiles and training. The ages at which the servants 
considered here were bound, between 15 and 20, were prime ones for 
training in a wide variety of skilled crafts in preindustrial England, 
through either apprenticeship or less formal arrangements. It is, 
therefore, not surprising that the net productivity of those in skilled 
trades rose rapidly during this period of the life cycle. 

Skilled servants received a considerable premium: Even at age 15, a 


No wages were recorded for unskilled servants, I'he average wage payments 
ret eived by the skilled servants in (he lamdon sample hy age were as in the table below. 
I he present value of freedom dues at the time of binding varied across colonics and 
over (ime, according to differences and changes in legislation and with changes in the 
value of colonial curicncics and commodities. No explicit allowance has been made for 
the dues in the calculation because of the difficulty of estimating their typical value; 
inclusion ot the effect of (he lump-sum payment would lower the estimates of the net 
annual earnings of servants without changing their relative values by age. 


Age (Years) 

Mean Annual Wage (C) 

15 

.0 

Iti 

.40 

17 

.79 

18 

1.45 

19 

1.52 

20 

3.02 


I 
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TABLE 3 

Estimated Relative Net Annual Earnincis of Servants by Ace 



Unskiixed 


Skilled 

Age 

Mean Nel 
Annual 
Earnings 
(£) 

Relative 

Net Earnings 
(Age 22.9 = 1.00) 

Age 

Mean Nei 
Annual 
Earnings 
(0 

Relative 

Net Earnings 
(Age 22,9 = 1.00) 

19.1 

2.01 

.742 

18.4 

2.39 

.413 

19.7 

2.20 

.812 

19.2 

2.86 

.495 

20.4 

2.36 

.871 

20.1 

3 32 

574 

21 2 

2.52 

.930 

21.0 

4.08 

,706 

22.0 

2.60 

.959 

21.9 

4.22 

.730 

22 9 

2.71 

1.000 

22.9 

5.78 

1.000 


Noiy ~Cal<ulal«d from table 2 and n 24 See text for procedure Lhe uniluUed profile it calculated from die 
haste ajje proltle ol table 2, that ni the skilled from the base age profile combined with the skilled (‘irade~l 
coefficients 


skilled servant received a term 21 percent shorter than his unskilled 
counterpart. The existence of a sizable premium at such an early age 
could have been due in part to differences in the relative average 
work experience of the skilled and unskilled. Thus, possession of a 
skilled trade at any age implied prior work experience. The age of 
entry into the labor force for those men registered without occupa¬ 
tions cannot be determined, but it is possible that the typical age of 
entry for the unskilled into employment was that at which English 
boys normally left home to live in service, roughly 15. If this were the 
case, work experience and acquired on-the-job training might have 
accounted for a significant portion of the premium for skilled work¬ 
ers. T his would particularly be true for the younger servants, as the 
relative level of work experience of a skilled to an unskilled worker 
would be greatest at the lower ages observed here and would decline 
with age thereafter. 'Fhat the ratio of skilled to unskilled net earnings 
increased with age would appear to be strong evidence of the pres¬ 
ence of formal training for those in the skilled group. 

An interesting feature of the relative age-net-earnings profile of 
the unskilled shown in table 3 is its close resemblance to those profiles 
obtained by Fogel and Engerman for unskilled male slaves in the 
southern United States during 1790-1860 (1972, charts 3 and 4). In 
view of the considerable differences among these samples with re¬ 
spect to such variables as location and crops cultivated, the similarity 
of the shapes of the profiles might suggest the importance of physio¬ 
logical factors, particularly the rale of physical maturation, in deter¬ 
mining the change, with age, in the productivity of unskilled workers 
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under conditions of plantation agriculture in the eighteenth and 
nineteenth centuries. 

d'he relatively small premium paid for the ability tt) sign in the later 
satriple may have been due both tet the abundance of literate servants 
and to high literacy rates in the colonies. The decline in the size of the 
reduction of the term due to the ability to sign between the dates of 
the two samples may have resulted in part from a considerable in¬ 
crease in literacy amottg the servants, as only 35 percent .signed in the 
earlier sample compared with 67 percent in the later one.''*® 

In both samples, with other characteristics constant, servants bound 
(or the West Indies tended to receive shorter terms than those bound 
for mainland colttnies. That servants who immigrated to the West 
Indies received shorter terms to compensate them for their choice is 
consistent with the fact that, while both working conditions for ser¬ 
vants and economic opportunities for freedmen were known to be 
poor in the islands after the introduction of large-scale sugar cultiva¬ 
tion in the second half of the seventeenth century—with its attendant 
slave gangs and consolidation of small farms into large estaie.s—the 
mainland long continued to be considered a land of opportunity for 
poor immigrants, where freed servants could hope to own land and 
become prosperous members of soviet), A persistent theme of West 
Indian complaints appeared in a 1675 petition sent to the king of 
F.ngiand by the Council anti A.ssembly of Barbados: ‘In former tymes 
Wee were plentifully furnished with (Christian (i.e., white] servants 
f rom England .. . but now Wee can gett lew English, having noe Lands 
to give thetn at the entl of their tyme, which formerly was their main 
allttremcnt,” fhe highei mortality rates of the West Indian colonies 
decreased servants’ expected productivity and made them reluctant to 
go t<j the regiem. Httwever, in conjunction with the high piftcluctivity 
of labor in sugar production, those rates acted to raise the demand for 
new flows of replacement immigrant labor, d’hat .servants bound for 
the West Indies received terms shorter than those bound for the 
mainland, in spite of the higher mortality rates in the islands, implies 
that the marginal productivity of labor was higher in the West Indies 
than in the mainland colonies.^^ 

Both the lower estimated intercept for the later sample and the 
estimated negative time trend of the later sample indicate a secular 
decline in the length of indenture. 'I'he direction of change is consis- 


On ihc lelalion.ship between ability to sign and other aspects of literacy in this 
peiiotl, sec Schcjfield (1968, pp. 311-2.5). I'hc difference fsetween the coefficients of 
literacy in the two samples is significant at .10 for a two-tailed /-lest. On the servants' 
ability to sign, see Galenson (19796). 

London, Public Record Office I67f>. On relative mortality rates in the West Indian 
and mainland colonies, sec Wells (1975, pp. 280—82). 
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tent with a number of long-term trends, including rising reservation 
wages of servants due to rising real wages in England between the 
mid-1680s and the middle of the eighteenth century, falling real 
shipping costs, and declining colonial mortality rates which could 
have produced a secular increase in the colonial demand for labor.** 
I'he negative estimated trend further suggests the presence of a 
secular increase in real wages in the colonies during the period 
spanned by the.se two .samples. 

A consistent seasonal pattern in the length of indenture appears in 
the results of table 2, as servants bound for mainland destinations 
whose indentures were signed between June and November received 
sizable reductions in their terms relative to servants lx>und in winter 
and spring. Most of these servants were bound for Maryland or 
Virginia, and the observed pattern could have been due to the effect 
of the seasonality of tobacco production on the costs involved in 
supplying servant labor to the f^hesapeakc. 'fhe shipping patterns 
which resulted from the timing of the harvests dictated that the 
amount of backhaul space for servants was greatest in summer and 
early fall. Since servants were provided with food and lodging from 
the time they signed their contracts, the cost of delivering a servant to 
the colonies tnay have declined in peak shipping sea.sons because the 
more frequent departures of ships reduceci the average waiting time 
itt port between binding and sailing, i'he lower costs of the peak 
seasons could, therefore, have resulted in shorter indentures for 
servants bound in peak sea.sons than for those bound in slack shipping 
months.*" 

Another potential source of variation in the length of indenture was 
annual changes in the cttlonial demand for labor, fhe results pre¬ 
sented in table 2 indicate that the lagged annual average price of 
muscovado sugar in London had a significant and negative effect on 
the length of indenttires of servants IxHind for the West Indies during 
1718-,59; the estimated effect of a change in the price of sugar from 
IIS minimum to its maximum in the period, with other things equal, 

On English wages, see Ciilboy (1934, pp 219-25) and Phelps Brown and Hopkins 
(1956, pp. 302-13), On changes in colonial nioitaliq rales, see, e.g,, Menard (1977fi, 
1 >|>. 99 100) I'he organi/ation and ijuality »t iiiloi inatioii in the inaiket for coniracts 
may have improved during the jMTitxl spanned by the two samples. Ii is suggestive that 
the coelfic ictii o( variation ol the (enn of indenture among men of a given age was 
(onsiderahly lower in I he later than the earlier sample for six of ihc eight age gioiips of 
minors above the age of 12. ()n the relalion ol wage dispersion lo inforinalioii, see 
Sligler (1962). 

On I he seasonal ily of tobacco prcxluclioii and shipping, see Bullock (U>i9, p. 46): 
Msop (1666, p. ,51); and Bruce (19t)7, vol. I. pp. 622-24). On the costs ol maintaining 
servants Isetwcen binding and sailing, sec Smith (1947, pp. 36-37, .59 -65) and Scennsh 
Record Office (GD 23/6/98, nos 4, 14, !8). For factors relevant to .seasonality in the 
(oltmial demand for labor, see Mullin (1972. p. 15) and Morgan (197,5, p, 158). 
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was a reduction of about 5 months in the term of indenture. The sign 
of this effect would be the one predicted if, as appears to have been 
the case, high sugar prices normally resulted from high levels of 
demand for sugar rather than reductions in supply, for high sugar 
prices would then have tended to produce high demand for labor 
and, ceteris paribus, to shorten terms.“® 


IV. Conclusions 

I he price paid for human capital in the colonial American market 
for indentured servants varied systematically with respect to factors 
which influenced servants’ productivity, as economic theory predicts. 
All .servants incurred debts of similar value in immigrating to the 
colonies and sold claims on their future labor, in the form of inden¬ 
tures, to repay these debts. Characteristics which raised the expected 
productivity of .servants in the American colonies raised the market 
valuation of their human capital and, therefore, shortened the term 
for which the .servant was bound. Thus, servants with skilled trades 
and those able to sign served shorter terms than the unskilled and 
illiterate of similar age and sex. Women were found to have received 
shorter terms than men until the age of 18, perhaps due to their 
earlier maturation. The results also indicated that servants bound for 
the West Indies received shorter terms in compensation for their 
undesirable destinations. Servants bound during peak shipping sea¬ 
sons were found to have received reduced terms, perhaps due to the 
shorter average waiting time prior to departure in those months when 
backhaul cargo space was most abundant. Finally, the length of inden¬ 
ture of servants bound for the West Indies was found to have varied 
inversely with the price of sugar, suggesting that increases in the 
colonial demand for labor shortened the term of servitude. 

Among the issues which need further investigation are the precise 
reasons for the ob.served shapes of the age-earnings profiles of ser¬ 
vants. Additional research, including the collection of evidence on 
wage rales in the colonies by age and skill, may serve to distinguish 


“ .Similar analyses of the contracts of servants bound lor the (Chesapeake during 
1718-40 indicates that the lagged annual farm price of Maryland tobacco had no 
signihcant eflect on the length of indenture; in a regression equation which included 
the variables of table 2, the estimated ctxrfficieni of an interaction term between the 
price of tobacco, lagged I year, and Chesapeake destination was insignificant. Due to a 
greater tontinuing reliance on production of a single staple in the West Indies in the 
eighteentli century, the price of sugar may serve as a better index of the West Indian 
demand lor labor than does the price of tobacco for the Chesapeake. On diversification 
of agricultural production in the Chesapeake, see Clemens (1974, pp. 100-148) and 
Stiverson (1977, pp. 65-103). 
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and isolate the effects of such contributory factors as physical matu¬ 
ration and investment in human capital in prtxlucing the age- 
earnings relationships which lay behind the market valuation of in¬ 
dentured human capital. What the present research has indicated is 
that the application of economic analysis to quantitative evidence 
generated by the system of indentured servitude can provide infor¬ 
mation on the way in which the market once explicitly evaluated 
stocks of human capital and, in so doing, can yield new insights into 
the operation of labor markets in early America. 


Appendix 

For the Middlesex sample of 1683-84, a regression equation was estimated 
with the amount of the cash payment (in shillings, sterling) made to a servant 
as the dependent variable, with the same independent variables used in the 
analysis of the length of indenture. The sample used was that of all .servants, 
minors and adults, who.se contracts contained all the necessary information. 
The results are as shown in table Al. 

The hypothesis that all the coefficients are simultaneously equal to ?.ero can 
be rejected at the .01 level, but the proportion of the variance explained is 
small, as might be expected in view of the rarity of the payments. Only one 
coefficient is significant at the .01 level; Possession of a skilled trade, which 
table 1 shows to have reduced the term of an indenture, significantly raised 
the expected cash payment made to a servant. 

As rioted in the text, cash payments appear to have been recorded only for 
minors in the London sample of 1718-59. The form of the payments re¬ 
corded differs (rom that of the earlier sample: Whereas the cash payments to 
servants in the Middlesex contracts of 1683-84 appear to have been simple 
lump-sum payments made at the time of binding, those contracted for in 
London in the eighteenth-century sample were generally salaries to be paid 

FABLE AI 

Esiimailu Re(;iu-;ssion CotmeaKNTs. Miiwi.fsfx Sami-i.e, 1683-84 


Independenl 

Variable 

Kstiinated 

CyOefficient 


Standard 

Error 

Age (years) 

.on 


.038 

Sex* 

- 48.'> 


.456 

Liierary" 

-.443 


.363 

1 rade"’ 

1.466 


399 

West Indies'* 

088 


.3.54 

Unnstani 

.325 



«’ 

.026 



F 

4.03 



n 

760 



^)ua( E —London Greater London Record Office (MKldlr»e* Section) 


■«= 0, feniahr - 1 
'’Marked " 0, ugned • 1. 

‘^I.^aborer or no occupacion « 0: aU (Khrr ocnipadcHit * ) 
West Indian dcstinattoni » ), mainland • 0 
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I ABLE A2 


Es7iMa in> 

RtOKtssioN CowFicatN 1 s. London .Sampi.k, 

1718-59 

Inciepcndenl 

Estimated 

Standard 

Variable 

(loelficieiii 

Error 

Age (yeans) 

.144 

,0.54 

.Sex 

..S06 

.329 

Litcriu V 

.42« 

.167 

Trade 

1.397 

.190 

West Indies 

1.291 

.165 

( iorKstant 

-.3.141 


IV 

.094 


F 

42.20 


H 

2.046 


S<n F - Itintiot). C'<»riK»r. 

■iHtiti ot 1 RcioutA Ollur 


\()if —Ml Adridblfs clcriitetl as id giW^ AI 



.tmuially in local currency. A regression was estimated with the amount crl llie 
annual salary (in pounds IcKal curreiuy) as the dependent variable, with the 
same indefiendcnt variables used altove. (The dependent variable is the 
unadjusted value of the colonial currency. Although most of the colonies' 
currencies were devalued relative to sterling, the differences in turrency 
tallies across (he principal American colonies in most ot this period were 
small. .See .McCnsker 1978.) 1 he sample used was lhal of all minors (age less 
than ‘il) whose contracts contained all the necessary infoi ination. I he results 
were as shown in table A2 

1 he relationship is again statistically significant, and the pro|)ortion of the 
variance explained is again low. Age, literacy, possession of a skilled irade, and 
West Indian destinations arc all significant at .01 and positively related to the 
scivani's salatt; in table 2 it is shown that all are signihcaiiily and negatively 
associated with the length ol indenture. 

1 he results of the analvsis ol .servants' cash [taymeiits and salaries in both 
samples are c onsisteiit with the hypothesis lhal ihc.se were positively related to 
the sersanis' expected earnings in the colonies. I hese results iherefore rein¬ 
force the analvsis piesenied ol the length cif indenture. 
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The Welfare Cost of Capital Income Taxation in 
a Growing Economy 


Christophe Chamley 

Yalf Unn'entty 


The wflfarr cost ot capital income taxation is analyzed in a general 
equilibrium fVarnework, where the private sector is represented by a 
competitive household endowed with perfect foresight and an in¬ 
finite life. 'I'hc value of the welfare cost depends essentially on the 
elasticity of substitution between capital and labor in the production 
function. Numerical estimates are presented for different values of 
the parameters of the model. The welfare gain obtained by the 
abolition of the capital income tax is smaller when the private sector 
is not endowed with perfect foresight (it is reduced by about 40 
pet tent when expectations are myopic). The allocation efficient y 
ct)si t)f the corporate tax dwarfs the intertemporal welfare tttsl. 


A central issue in the current debate on tax reftirni is the efficiency 
co.st of a tax on capital income. Such a tax introduces a wedge between 
the prices of consumption at different dates and distorts the inter¬ 
temporal allocation of resources. 

Previous studies on the efficiency cost of the capital income tax have 
relied on simplifying assumptions which rule out important conse- 
cjuences of capital taxation. In general, the.se studies are of two types. 
The first (Krzyzaniak 1967; Sato 1967; Feldstein 1974a, 19746; 
Friedlander and Vandendorpe 1978) analyzes the capital income tax 
in the context of a neoclassical growth model with endogenous capital 
stock and factor prices but does not allow an optimal response of 
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household to changes of taxes. Typically the levels of consumption 
and labor supply do not depend on future prices and are not consis¬ 
tent with utility maximization. This method leads to results about the 
long-run incidence of taxation on aggregate variables; the transition 
to the steady state can be examined by numerical simulations. The 
second type of study (Levhari and Sheshinski 1972; Feldstein 1978) 
allows for maximizing behavior but assumes that the factor prices are 
exogenous and independent of the capital accumulation. The welfare 
cost of the capital income tax is measured by an application of the 
Harberger-Hicks-Hotelling formula (Green and Sheshinski 1979) to 
the intertemporal framework. 

The welfare cost of the capital income tax is analyzed here in a 
stylized intertemporal general equilibrium model (described in the 
next section). Issues about intra- or iniergenerational equity are ig¬ 
nored. Therefore, it is assumed that the private sector can be repre¬ 
sented by a household with an infinite life.* 

This household determines the level of consumption by the 
maximization of its intertemporal utility function. Future factor 
prices (wage and interest rates) depend on the accumulation of capital 
through a neoclassical technology and are known with perfect 
foresight. The household behaves competitively: Future (endoge¬ 
nous) prices are taken as given. 

The welfare cost of the capital income tax is analyzed in the second 
section. Following common practice, we consider the welfare cost 
induced by a lax with lump-sum redistribution. Initially, the economy 
is assumed to be on the balanced growth path where capital income is 
taxed at a fixed rate (and tax revenues are refunded). At lime zero, 
the tax is abolished (together with the refunds). Thereafter, the 
economy moves on a dynamic path toward a new steady state. The 
welfare cost of the capital income tax is equal to the welfare gain 
obtained by the abolition of the tax, namely, by the difference be¬ 
tween the level of utility on the new dynamic path (after the tax 
reform) and the level of utility on the initial balanced growth path 
(with the tax in effect).** As usual, the welfare cost of the tax is 


' It would he sut'licieiil 10 as.-iume that individuals' uliliUes depend on iheir coiisump- 
lion in iheir own finite lifelirne and on the welf are of iheir immediate descendants (for 
a development of this argumein, see Barro (1974)). If individuals do not leave a 
lieque.st, the problem of inteigenerational equity arrses, and the concept ol excess 
burden used heie does not apply; Even in the case where no revenue has to lie raised, 
the no-tax solution is not the lii st liest; the level of capital has no optimal profierty 
(Diamond 1965). In order to ledistiibule income among gcneialions. a benevolent 
vx;ial planner would impose a lax or a subsidy on capital income (and possibly on other 
goods). For an analysis of taxation in a general equilibrium model where individuals 
maximize a lifetime utility function with no bequest, sec Hall (1969), Diamond (1970). 
■Summers (1979), and Chamley (1980ft). 

* We could also consider the imposition of a lax on the initial balanced growth path 
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measured by a wealth equivalent and is of a second order with respect 
to the tax rate. A second approximation of this excess burden is given, 
which depends on the parameters of the utility and production func¬ 
tions and on the growth rate. An extension of the Levhari and 
Sheshinski (1972) result is obtained as a special case. Since the 
general excess-burden formula is exact only for infinitesimal values of 
the tax rate, the error of this second-order approximation is analyzed 
in a numerical example. 

The assumption of perfect foresight is relaxed in Section III. Be¬ 
cause the dynamic path after the abolition of the tax is no longer 
optimal, the welfare gain induced by the tax reform is smaller in this 
case. The case of myopic expectations is an important example of the 
more general class of expectations which are considered. In the fol¬ 
lowing sections we revert to the assumption of perfect foresight. 

The case where the tax rate on capital income is not identical for all 
sectors of production (an example is found in the corporation tax) is 
considered in Section IV. The intertemporal welfare cost of the tax is 
compared with the inefficiency cost due to the misallocation of capital 
between the different sectors of production. 

In Section V, the assumption of a fixed labor supply is relaxed. 
Since the capital income tax lowers the long-run wage rate, its excess 
burden depends on the (compensated) elasticity of the labor supply. 

1 his analysis of the capital income lax relies on a stylized model.® 
However, numerous numerical examples show that some of the re¬ 
sults obtained are fairly robust. In the conclusion, these results are 
summarized and related to other studies using more disaggregated 
models. 

I. The Model 

There is one good in the economy. This good can be consumed or 
used as capital in the production process. Total output per efficiency 
unit of labor, net of capital depreciation, is given by the neoclassical 
production function, y = /{k), where k is the level of the capital stock 
per efficiency unit of labor. 

The private sector is represented by a household, growing at the 
rate n, which takes prices as given and maximizes under its budget 
constraint the utility function, 

U = I (1) 

Jo 

with no uxation. However, the analysis of the abolition (instead of the imjxrsition) of 
the tax IS technically more .simple and suits better the problems of tax reform. It has 
been verihed that for small tax rates, the two methods give the same results. 

® The same method could have been used to analyze the excess burden of the labor 
income tax or the income tax (Chamley 1980a). 
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where the following notation is used; p = pure rate of time prefer¬ 
ence, = rate of growth of labor augmenting technological change, 
and Ct = consumption per unit of efficient labor. 

The function u will be assumed to be of the form u(c) = c^~’’.* The 
labor supply per capita is fixed and normalized to one at time zero. 
The representative household is endowed with pierfect foresight and 
behaves competitively, taking the endogenous future prices (wage 
and interest rates) as given. 

Because of the first-order condition in the maximization of the 
utility function, the dynamic path of the economy satisfies the 
equation* 


cr = -^ (r, - p*). (2) 

fj 

where r, is the net rate of return available to the household, and p* = 

p + <r • p,. 

The capital accumulation is defined by 

=/(*<) - (n + p)A, - c,. (3) 

where Ao represents the initial capital stock. 

The dynamic behavior of the economy is defined by equations (2) 
and (3) and by the initial values Aq andco at time zero. The initial value 
of the capital stock ko is given. There is a unique value of Co, such that 
its associated dynamic path satisfies the budget constraint of the 
household (which is equivalent here to the transversality conditions). 
For this value of Co, the dynamic path converges to the steady slate 
defined by* 

P* = r* = (4) 

c* = /(A"*) — (n -I- p)A* (5) 

(an asterisk will denote a steady-state value). 

* This assumption is necessary for the existence of a competitive balanced growth 
path when the intertemporal welfaie function is additive and the rate of lalior aug¬ 
menting technological change is different from zero. (For an introduction to the 
literature on optimal growth, see Koopinans 11967].) In a discrelc-tiiiic formulation one 
<ould also use a stationary utility function. The existence and stability of optimal 
balanced growth paths in this context is studied by Iwai (1972). 

* The consumption levels at time I and t + &( satisfy the relation, 

u'(e*'r,) i -f rSl 

lip to the first order, this expression is equivalent to 

u'(r"'C|) + u''(<^*f,)(c,pA< T ^ ^ ^ 

u'ie^'c,) 

A straightforward manipulation gives (I/c,)(Ac,/A/) = (l/<r)(r - p - (zp). When A/ 
lends to 0. we obtain cq. (2). 

* The second-order conditions are derived from the concavity of the utility and the 
production functions. For an exhaustive treatment, see Arrow and Kurz (1970). 
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The optimal path also defines a consumption function, giving the 
level of consumption per unit of labor as a function of the capital- 
labor ratio at each instant (see fig. 1 ), 

c = c(A). ( 6 ) 

In the same way, at a given instant, the level of utility is determined by 
the integral ( 1 ) on the optimal path F for an initial value of the capital 
stock which is equal to k: U = J(k). 

We now review a few properties of the optimal path which will be 
useful in the subsequent sections. 

'Fhe slope of the consumption function c'(^*) at the stationary point 
k* is obtained by taking the limit of the ratio between the relations ( 2 ) 
and (3) when k tends to A*; c'(k*) is equal to the positive root of the 
equation, 


— Xx — 7 = 0, (7) 

where k = p* — n — fi and y = -[c(k*)/"{k*)]/cr = (l/cTe)(c(A*)(r* + 
S)w*/{k*lf{k*) + Sk*]}); € is the elasticity of substitution between 
capital and labor in the gross production function when the capital 
labor ratio is equal to Using a first-order approximation of the 
capital accumulation (3) around k*, the difference between k and its 
steady-state value A* decreases asymptotically at a constant rate a (to a 
first-order approximation):* 

(A, - A*) = ~a{k, - A*), ( 8 ) 

where a is the coefficient of adjustment of the economy toward the 
steady state and is equal to the difference c'(A*) - —a is also equal to 
the negative root of equation (7). 

The same asymptotic rule applies to every endogenous variablez, in 
the economy which depends only on the capital-labor ratio (as, e.g., 
the gross factor prices):*' 

(z< - z*) = -a(z, - z-*), ( 8 a) 

where the coefficient a is the same as in equation ( 8 ). 


' The parameter y can also be expressed as a function of quanuties which arc easily 
measurable: 


^ _ (I - a)(r* + fi) 


r* + 6 


{b -r n + fi) 


where a is the share of gross capital income in the production function and 6 is the 
depreciation rate oi the capital slock. 

• This regressive rule may be a very gexx! approximation even if the difference k - k* 
is large (see Chamley 1979). 

• If z, = g(k,), to the first order: (z, - i*) = g'{k*)k, - —g'{k*)a(k, - *•) =• -a(z, -z*). 
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II. The Excess Burden of the Capital Income Tax 

Consider now a tax on capital income at the constant rate d, with 
lump-sum redistribution of its revenues. The analysis of the model 
described in the previous section applies with a net interest rate r now 
given by r = (1 — 9)f(k). In particular, in the long run, the net rate of 
return is constant and still equal to p*. In the long run, the capita! 
income tax increases the gross rate of return and lowers the capital 
stock and aggregate consumption. The levels of consumption and 
capital per unit of labor in the steady state with taxation, c and k, 
respectively, are given by 


p* = (1 - e)r(k) 

(4a) 

c = f{k) ~ (n + p)k. 

(5a) 


This steady state is represented by point A in figure 1. 

We assume that initially the economy is in the long-run equilibrium 
with taxation (fig. 1, point A). The deadweight loss of the tax is 
measured by the welfare gain induced by the suppression of the tax.'® 

At time zero, the capital income tax is abolished” (together with the 
lump-sum refund). The net rale of return suddenly increases, and 
consumption decreases immediately from c to r# (the optimal value 
which depends only on the initial capital stock = A, as described in 
Sec. I). After time zero, because of the increased savings,” the capita! 
stock increases. The economy moves toward the new steady state £ on 
the segment BE of the path I' defined by the dynamic equations (2) 
and (3) (see fig. 1). 

The welfare gain of the lax reform At/ is given by the difference 
between the utility on the path BE and the utility in the steady state 
A ■A’’ 


AU =J(k) - 


P* 


»(c) 

- n - p 


(9) 


When the lax revenues are nm equal to zero, this is e(|uivaleni to the gain obtained 
by a shill from an inleresi lax li> lump-sum laxatiun. 

" II is esscnlial ihroughnul ihis study lhat the lax changes ate unaniicipaied; for an 
analysis of ihe effects of aniicipated lax changes, sec Hall (1971) 

" The observed elasticity of gross savings ij. with respect to the net rate of return at 
time r.eri) (under the assumplion of perfect foresight about future interest rates), t an be 
determined from the parameters of the model; tj = o«(r»/[(r* + 6){n -r n -i- 6)(1 - a)]}, 
where a and 6 are debited in n. 7 anda is given in table 2. For the values used lielow, the 
Cobb-Douglas case (tr = t = 1). t| = .95, 

” When the tax rate 6 is small, the billowing experiment is symmetrical and givcsthe 
same result: Assume that the economy is in the steady slate with no tax (point £). At 
time zero, the tax is instituted (with lump-sum refunds). The economy moves then on a 
path toward the point A. Call J, the uiibiy level on this path and J, the utility level at the 
steady stale E. T he excess burden of the interest tax is equal to Jt - J, 
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Fii. 1.—The consumption function and the dynamic path 


A Taylor expansion of (9) around k* gives 


~ J(A*) +J'(k*)(k - k*) + (k - 


u(c*) 


u'(c*) 


dc 


p* - n - fi. p* - n - p dk 


^(A - k*) 


u"(c*) 


d}c 


2(p* - n - p) dk^ 


(k - k*f. 


From (5a) dcldk = f(k*) — n-p=p*—n-p. Furthermore, the 
marginal value of capital J\k*) is equal to the marginal utility of 
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consumption u'(c*). Therefore, 


AU ~ 



u''(c*) (Pc 
p* ~n - fi ^ 





( 10 ) 


This welfare cost is of a second order with respect to the tax rate. It 
is convenient to divide it by the marginal utility of consumption at the 
stationary point E, u'{c*), in order to obtain its wealth equivalent 
AM:^* 


with 


AM 




( 11 ) 


( 12 ) 


the parameters a and y have been defined in the previous section. 

The relation (11) is to be interpreted as follows: The welfare cost of 
a capital income tax at the (small) rate 0 is equivalent to a permanent 
reduction of consumption on the balanced growth path by a fraction 
L0^. The variable L depends on the technology, the utility function, 
and the growth rate. We now consider some of its properties. 

The relation (12) can be rewritten. 


L 


— Lp 


(r* - n - /x)q ? 
7 


(13) 


where 


-n-p.)- 


( 14 ) 


From (14), and the definition of a and y (eq. [7j), it is straightfor¬ 
ward to derive the following properties; 

L < Lpi-^ > 0;-^ < 0; 


de 


do- 


lim L = Lp; lim L = 0. 

e —•« €—*0 

The excess burden of the tax is an increasing function of the 
elasticity of substitution « between capital and labor in the production 
function. When e is equal to zero, the capital-labor ratio is fixed, and 
since the labor supply is fixed, there is no distortion. The welfare cost 
is nil. When e is infinite, the gross factor prices are fixed,/. = Lp. This 


" The relations (II) and (12) are derived in the Appendix, Sec. II. 
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TABLE 1 

Normalized Excess Burden or the Capital Income Tax 


e 
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2.28 

3.31 

4.30 
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8.76 

9.58 

133.33 
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2.23 

3.22 

4.17 

5.07 

5.94 

6.78 

7.59 

8.37 

9.13 

100.0 

,Nt>rc! —(/ivrn rtf (12), in of ^nnujl conaumjAion. 
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Anni'ai. Rate of Conver<;ence toward Balanced G 
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.5 
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9,14 

8.40 
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13.31 

10.70 
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8.08 

7,30 

6,69 
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5.80 

5.46 
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9.14 

7.80 

6.88 

6,21 

5.68 

5.26 

4.91 

4.62 


Noie —<1 IS n)c<«>ured iii per<«riugc 


case is analogous to the partial equilibrium situation with exogenous 
factor prices and provides an upper hound for the general formula.'^ 

Since 1/cr is an index of the intertemporal substitutability of con¬ 
sumption in the utility function, it is not surprising that the excess 
burden is a decreasing function of a. 

Tables 1 and 2 present estimates of the variable L and of the annual 
rate of convergence of the economy toward the steady stale a for 
different values of e and tr.** The other parameters of the model are 
chosen to characterize the U.S. economy; fj. — .02, n = .0, the gross 
capital income share is equal to .33, the rate of capital depreciation is 
equal to 5 percent, and p* is equal to the long-run value of the net rate 
of return and is taken to be equal to 4 percent. There remain only two 


“This formula has been derived by Levhari and Sheshinski (1972) for a stationary 
economy. Another proof, u.sing the well-known Harberger-Hicks-Hotelling formula, is 
given in the Appendix, Sec. I. 

The.se values of the rate of convergence can be compared with those di.scus.sed by 
Sato (1966) and give additional information for a choice of e and o. 

“See, e.g., Jorgenson and Christensen 1973. 
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unknown parameters, p and <r. Since there is no general agreement 
about their values,'* we present estimates for different values of a. 
Once a value is chosen lor a, the pure rate of time preference is 
implicitly determined by the relation, p = p*^ — crp. 

As can be seen from table 1, for realistic values of e the excess 
burden is not very sensitive to the elasticity of the marginal utility o-.'* 

The last column in table I corresponds to the partial equilibrium 
case. It is clear that for realistic values of c, the excess burden is much 
smaller. The variable L is represented as a function of c for a fixed 
value of a, equal to one, on figure 2 (curve /3 = 0). For the relevant 
range of e, the excess burden of the interest tax increases almost 
linearly with e. 

The value of the normalized excess burden L depends also on the 
discount rate /* and on the growth rate n + p. No general rule can be 
derived about the effects of these parameters. When the elasticity c is 
large, L increases with n + p. and decreases with r* (as it is clear from 
the limit case c = <», described by eq. [14]). However, when the 
elasticity of substitution is smaller than two, and the other parameters 
of the model have the same values as above, it is found that L is an 
increasing function of both the discount rate and the growth rate.^® 


The Excess Burden for Large Tax Hates 

I’he measure of the excess burden given by expressions (11) and (12) 
is exact only for infinitesimal values of the tax rates. In order to 
estimate the bias involved for large values of 0, we have to use a 
different method. 

At time zero, the economy is on its balanced growth path under 
taxation (point /I on fig. 1), and the initial values of the capital stock 
and the consumption level are given by (4a) and (5a). The dynamic 
path after the tax cut, characterized by (2) and (3), is transformed into 


Available c.'itiinales of <r seem to point to a value higher th.in one. Wet>ei (1975) 
leports values between 1.3 and l.H. Wright’s (1969) e.stimates are somewhat higher— 
aioutid 4. 

In the limit case where the discount rate »• and the growth rale aic equal, the 
variable /. liccomes 

/ = . * 

2 f*/'!**) 2(1 - q)' In + m + S/ 

which is independent of the utility function and linear in the elasticity e (ceteris 
paribus). 

When < = o = I, the following values aie obunned: Take n + p = 2 (ici cent: when 
r* increases from 2 percent to 6 percent, L incrca.ses from 3.00 percent to 6.96 percent. 
When r* is equal to 4 percent and n + p increases from 2 percent to 4 percent. L 
increases from 5.46 percent (table 1) to 7,26 percent. 
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L 

(In %) 



Fh.. 2. —7'he norm;<li/cd excess burden; f - elasticity of substitution between capital 
and laboi,^ = coiTiftensated elasticity of substitution Isetween leisure and consumption. 


a discrete formulation and simulated numerically using a gradient 
algorithm of optimal control. The consumption equivalent of the 
utility after the tax cut c, is defined by the stationary level providing 
the same utility as the dynamic path after the tax cut: u(c,)l{p* — n — 
fx) = J(k). Table 3 reports the ratio ((c, - c)lc](\lB^) as a function of 6 
for cr = € = 1; the other parameters of the model have the same values 
as in the first two tables. 

The first number in the table (for 0 = 0) is obtained by formula (12). 
When 6 is small, it gives a good approximation of the normalized 
excess burden. However, we can see that for large values of the tax 
rate 8, the approximation formula (12) underestimates the welfare 
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TAB1.E 3 

Nokmai.ized Excess Buhden kok Lakoe Tax Rates (%) 






e 
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.05 
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.5 

f, - f I 
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5.46 

5.82 

6.25 

7.27 

8.58 

10.33 

12.77 


Noll - tX = « ns J 


gain of the tax cut (for 0 = 50 percent, by a factor slightly smaller than 

In order to illustrate the previous discussion, let us consider two 
numerical examples of a shift from capital income taxation to lump¬ 
sum taxation. 

Assume that the aggregate consumption in the steady state under 
taxation is normalized to one. When the tax rate 0 is equal to 50 
percent, total revenues are equal to 12.34 percent. A shift to a lump¬ 
sum tax increases the level of welfare by a consumption equivalent of 
3.19 percent and the level of consumption in the long run by 8.38 
percent. The welfare cost of the capital income tax is equal to 26 
percent of the amount of tax revenues. The marginal welfare cost is 
obtained by multiplying the average cost by a factor of two (this factor 
should be somewhat greater than two for large rates, since the nor¬ 
malized excess burden increases with the tax rate). 

When 0 is equal to 30 percent, tax revenues are equal to 6.73 
percent. A shift to lump-sum taxation would increase consumption in 
the long run by 3.3 percent. The welfare cost of the tax is equal to 
0.77 percent of the consumption level or 11 percent of the revenues. 

The method followed in this .section takes into account both the 
short-run and the long-run incidence of the suppression of the capital 
income tax. We can observe that the welfare gain of tax reform 
(measured in consumption equivalent) is much smaller than the per¬ 
centage increase of consumption in the long run. Also, the excess 
burden of the capital income tax, although small in terms of aggre¬ 
gate consumption, is not negligible in terms of the revenues 
generated. 

III. The Case of the Nonperfect Foresight 

In the previous section, the welfare gain of a reform from capital 
income taxation to lump-sum taxation was determined under the 

On the other side, when 9 is large, the model considered hei e implies a sudden 
increase of saving which could be too large to be borne by a real economy. Therefore, it 
is likely that the numbers in table 3 may be overestimated. 
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assumption that the economy follows an optimal path toward the new 
steady state. However, when there is no complete set of future mar¬ 
kets to convey information on future prices, individuals may not have 
perfect foresight. In this case, the dynamic path after the tax reform is 
no longer optimal, and the welfare gain obtained is smaller.*^ 

The assumption of perfect foresight is relaxed in this section. For 
its saving decision, the competitive household relies now on point 
expectations about the future wage and interest rates. By assumption, 
these expectations satisfy the following property; At each instant t, 
anticipated future prices depend only on the value of the capital stock 
at time l (e.g., through the factor prices at time t) and on some 
constant parameters (as the long-run values of the factor prices).'"'^ It 
follows that the consumption level can be expressed as a function of k. 

We also assume that expectations are consistent with the steady 
state and do not affect its stability. The rate of convergence of the 
economy «, depends on the type of expectations considered. For 
example, in the case of myopic expectations (where the factor prices 
observed ai a given time are expected to be the same in the future), 
the rate of convergence is greater than in the case of perfect foresight. 

As in the previous section the initial situation of the economy is the 
steady state with taxation. After the suppression of the tax at time 
zero, the economy converges to the steady state £ on the path r'(B'E}, 
which in general is different from the path f. The welfare gain is 
measured by the difference, 

-, (15) 

./ jr* — n — fi 

where J(k) represents the integral (1) on the path B'E. 

The second-order approximation of the utility level after the tax 
cut,Jik), depends only on the slope of the dynamic path at the steady 
state,*^ c',(k*), or on the rate of adjustment toward the steady state a, 
(since a; = C/[A*] — [r* — n — pi]): 

]ik)=F(a,,k); (16) 

this utility is smaller than the utility on the optimal path, 

F(a,, k) « F{a, k) = J(k). (17) 


“ For the class of expectations considered below, the dynamic path still converges to 
the steady state. 

“ This class includes (lerfcct foresight and myopic, stationary, and regressive expec¬ 
tations. This section uses some intuitive results derived in Chamley (1979). 

“ A proof is given in the Appendix, Sec. III. 
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The welfare gain induced by the tax cut is equivalent to a perma¬ 
nent increase of the consumption level by a fraction where L, is 
defined by 


= [Q<«,)/Q(«)l ■ />. (18) 

with 


y(«,) 


at I ‘h _ ^ 
A -H la, [a • c'{k*) \ 


(19) 


L is equal to the perfect-foresight value defined by (12) in the previous 
section. 

Using the relations (18) and (19) and the properties of a and c'(k*) 
(described in Sec. I), it is .straightforward to show that the ratio L,/A is 
always smaller than one. When the private sector does not anticipate 
future prices with perfect foresight, the welfare gain induced by the 
suppression of the capital income tax is reduced. 

The ratio between the values of the excess burden with nonoptimal 
and optimal adjustments L,IL is represented in figure 3 as a function 
of the ratio between the rates of convergence (w/Za). Fhe different 
curves correspond to different values of the discount rate r*. The 
growth rate n -I- pt is equal to 2 percent, and the product o-e is equal to 
1 . 

Also the cases r* = 3 percent and r* = 6 percent provide a very 
good approximation (less than 1 percent error) of the cases r* = 4 
percent, oe = i, and i* = 4 percent, ere = 2.5, respectively. 

It is interesting to observe that, in general, a shift from capital 
income taxation to lump-sum taxation always induces a welfare gain 
(L, is positive unless a, has an unrealistically large value, greater than 
[12.1] • Cl). This gain is close to its maximum for a wide range of values 
of the ratio {a,fa). When this ratio is between 4 and 'i, L, is equal to at 
least 86 percent of L (for r* = 4 percent, ae = 1). 

As an example, let us consider the case of myopic expectations. The 
consumption function takes then the following form; 

Csik) = (1 + y , l/<*) - + M)A]. 


Using (18) and (19), alter elementary manipulations, the ratio be¬ 
tween the normalized excess burden under myopic expectations and 
its value under perfect foresight, Ls/L, is given by 


l.,/L = 



xVA’'^ + 4y\ 
X* 2y r 


( 20 ) 


4 



Ratio botween the excess burden under 
non perfect and perfect foresight 
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■The excess burden under nonperfect foresight 
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with k = r* — n — fi, and y is the parameter described in equation (7). 
A good approximation of (20) is given by 

where a is the optimal rate of convergence reported in table 2. 

Under myopic expectations, the welfare gain of the tax reform is 
always equal to at least 50 percent of the perfect-foresight value. This 
case is represented by the point Af on figure 3 (for 6 = cr = 1, the rate 
of convergence toward the steady stale under myopic expectations a.,, 
is about six times larger than the value under perfect foresighta). The 
ratio f-.s-//. depends on the long-run value of the rate of return r*, the 
growth rate n + ft, and the parameters < and cr. When r* — 0.04 and n 
+ p, = 0.02, it can be determined for different values of € and cr, using 
table 2 and the relation (21). 

We see in figure 3 that, for the relevant range of values of the 
various parameters, the ratio between the welfare gains of tax reform 
under myopic expectations and under perfect foresight is between .55 
and .65 and is not very sensitive to the parameters of the model.** 

Figure 3 also allows us to consider a more general class of expecta¬ 
tions. For example, when the private sector underestimates the future 
rate of convergence of the economy, the realized rate of convergence 
fl; is in the interval (a, as). For this special class of expectations, the 
welfare gain of a tax cut is bounded below by the value found in the 
myopic case. 

Finally, it may be interesting to observe that when the private sector 
applies a modest amount of rationality, the dynamic path of the 
economy is close to the optimal path. Around the steady state, the 
variation through lime of the factor prices is given by the expression, 
(r, I*) = -a,{z, - z*). 

Assume now that the long-run values of the fac tor prices are known 
and that the only unceitainty left is about their rate of convergence to 
the new steady state. Furthermore, individual expectations are of the 
regressive form; the expected values of a parameter x, at some future 
date t, x'f, are determined by the following rule; 

(xf - x*) = -a(xf - X*), (22) 

xo = Xo- 

Individuals revise at each instant the estimated value a of the future 
rate of convergence, using the observed value d (given by a = z/(z* - 

When the difference A tietwecn the rate of return r* and the growth rate n + n 
lends to zero, the ratio LsJL tends to 1/2, Also, at the same time, the erroi implied by 
myopic expectations, which is measured by the ratio aja, lends to inhnily. 
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z), where z is an arbitrarily chosen endogenous variable) and an 
adaptive rule, for example, d = i/(d — a), where v > 0 . 

Under these assumptions, and for an initial value of the capital 
stock not too different from the steady-state value, both the estimated 
value a and the actual value d of the rate of convergence tend to the 
optimal value The dynamic path istongent to the perfect foresight 
path at the steady-state point. The value of the welfare gain of tax 
reform is close to its (maximum) perfect-foresight value. 


IV. The Welfare Cost of the Corporation Tax 

Most studies on the corporation tax assume that the corporate and the 
noncorporate sectors produce two different goods and that the ratio 
between their respective outputs depends on their relative prices.’*’ 
For the sake of simplicity, we assume here that both sectors produce 
the same good and that the aggregate production function can be 
written in the form,*")! = g(A,, ^ 2 ), where all quantities are divided by 
the total effective labor supply, and k, and represent the corporate 
and the noncorporate capital stock, respectively. The function ^(^ 1 ,^ 2 ) 
is assumed to be homogeneous in its arguments. 

Initially the economy is in the steady state where only the corporate 
capital income is taxed at the rate 0. In each sector, the net rate of 
return is equal to the optimal stationary value p*, and the values of the 
capital stocks, A, and Aj- determined by 

p* = (1 - e)-^{k„k^) = (23) 

The elimination of the corporate tax at time zero has two effects. 
First, capital is reallocated between the two sectors to equalize their 


Numerical simulations have shovvn that for almost iiny initial value of the capital 
stock, the convergence to the perfect-foiesight path occurs alter only a few periods. 
This piocedure may be used as an algorithm to determine the |>erfect-foresight solu¬ 
tion. 

” See Harbergcr (1962, 1976), Shoven and Whalley (1972), Boadway and Tred- 
deiiick (1975), and Shoven (1976) for a static analysis. Friedlander and Vandendorpe 
(1978) extend Harbcrger's .study to a dynamic framework. However, they do not 
address the problem of the incidence on the intertemporal welfare and do not consider 
a saving function derived from the optimization oi an intertemporal utility lunction. 
Fullerton, Shoven, and Whalley (1979) arc currently working on a more elalroratc 
model. 

“ The usual assumption that the corporate and the noncorporate sectors produce 
two separate goods may al.so be a crude description ol reality; .see, e.g,, Ebrill and 
Hartman 1977. 
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rates of return. The aggregate production function now takes the 
form, 


y = / (*). 


(24) 


where 


f(k) = m'MgOiuki). 

k, -^k^^k 


After time zero, the private sector is endowed with perfect 
foresight, and the economy moves on the dynamic path studied in 
Section I, which is characterized hy the equations, c = {c/cr)lf’{k) - 
p*], k =f{k) — (71 -t- fi)k -c, and Aq =^k=k^ + k^. As in the relation (9), 
the welfare gain of the tax cut is equal to Af/ = J(k) — [u{c)/(r* ~ n - 
p)]. This difference can be decompo.sed into 


^U 


y w - 


M(f) 


n — fjL 


u(c) - u(c) 
r* — n — fi 


(25) 


where c = J(k) - {n + fi)k is the stationary consumption available 
when the level of capital A is allocated efficiently between the corpo¬ 
rate and the noncorporate sectors. 

The first teiin of the RHS (right-hand side) of (25) has been 
analyzed in Section II and measures the intertemporal welfare cost of 
the corporate tax, originating in the reduction of capita! accumula¬ 
tion. It is equivalent to a permanent reduction of the consumption 
level by a fraction L( where L, , is defined by^® 

- (t)' ■ 


Since the tax revenues are equal to this relation implies that for 

small values of the tax rates, the intertemporal wellare cost of capital 
income taxation depends only on the total amount of tax revenues 
(e.g., a 10 percent uniform tax on the whole capital stock and a 20 
percent tax on half the capital stock have up to the second order the 
same intertemporal welfare co.st).“ 

Although the distribution of the tax burden across the different 
sectors does not affect the interteinporal welfare cost, it creates a 
distortion in the allcxiation of the capital stock between sectors. This 


A proof is given in the Appendix, Sec. IV. 

“ The extension to muny sectors is straightforward. In particular, this analysis could 
he applied to a model with font types of capital: corporate and noncorporate capital, 
housing, and human capital. 
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production cost is measured by the second term in the RHS of (25) 
and is equal to^‘ 


u(c) - u(c) 
(r* — n ~ fi) 


u'(c-) 

(r* — n — fi) 




(27) 


The welfare cost induced by the production inefficiency is equivalent 
to a permanent reduction of the consumption level by a fraction 
where 


i = [fik) ~ g(kt,h)]/c. 


(28) 


This is precisely the excess burden studied by Harberger and others. 

The total welfare cost of the corporate tax is equal to the sum of the 
intertemporal cost and of the cost due to production inefficiency. It is 
equisalent to a permanent reduction of the consumption level by a 
fraction whei e 


/ .(■ — -f- Lc2 ~ 


= (-^1 
\ k 


[/(^) - g(kuki)\ 


(29) 


k,, k.j, and k are given by (21). and 

c =J(k) - (n + n)L (30) 

In order to have an order of magnitude of the respective quantities, 
assume that the ratio k,/k is equal to i, which is close to the value 
observed in the U.S. economy, and that only the capital income 
originating in the corporate sector is taxed at the rate of 50 percent. 
Using the results of Section 11, the intertemporal welfare cost is alxiut 
0.25 percent of the level of aggregate consumption (or around 7.8 
fiercent of the tax revenues).'** The value is somewhat Ittwer than the 
cost of intersectoral misallocaiion, which in the same model is equal 
to 1.1 percent ol consumption (this value is well within the range of 
current e.stimates of 0.5-1.5 percent; see .Shoven 1976). The interin¬ 
dustry misallocation caused by the corporate tax seems to be much 
greater than the intertemporal distortion. 


V. The Case of ao Elastic Labor Supply 

The capital income tax increases the gross rate of return and, by the 
factor price frontier, lowers the wage rate. 'When the labor supply is 


'*■ To a second order, ii is equivalent to evaluate this difference at k or k*. 

■’* These estiniate.s are obtained with a simplihed version of the model in Section II: 
There is no depreciation, and the production function takes the form> = k^k^ (with a 50 
percent corporate tax rate, A,/*, = 0.5). The capital income share, net of depreciation, 
2/3, is the same as for the previous numerical results and is equal to 0.18. We have also 
used table 3 in the correction for large tax rates. 
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not fixed, this tax creates, in addition to the intertemporal distortion, 
a distortion in the choice between consumption and leisure at a given 
instant of time. 

For example, assume that the elasticity of substitution between 
capital and labor in the production function is equal to zero; the 
capital-labor ratio is constant. When the labor supply is fixed, the tax 
has no incidence on the capital stock and is equivalent to a lump-sum 
tax with no excess burden. However, in the long run the gross rate of 
return is higher and, by the price possibility frontier, the wage rate is 
lower. When the labor supply depends on the net factor prices, the 
incidence of the tax is to decrease the long-run labor supply, the 
capital stock, and the consumption level.®' 

In general, it can be expected that the excess burden of the capital 
income tax is increased when the labor supply is variable. In this 
section, the framework developed in Section H is extended in order to 
measure this additional effect. 

The utility function of the private sector depends on the amounts 
of consumption and leisure; 

U = \ (31) 

Jo 


where fjc**' is the consumption per capita and /, is the labor supply per 
capita (measured in natural units, and not in effective units). 

For the sake of simplicity, we consider the following form for the 
utility function u:“u(c,0 = (1 — )3) logc -I- /3 log (T - 1), where^ and 
T are exogenous parameters. The dynamic path of the economy is 
now characterized by the relations. 


and 


it = r([r,(l - d) - p*], 
ki =J(k,,l,) - {n+ti)k, - c„ 


__ 

(1 -PHT-I.) 


= ui„ 


(32) 

(33) 

(34) 


where r, and w, are the gross interest and wage rates, respectively, and 
k, represents now the aggregate capital stock divided by the efficiency 
index e*^'. 

Equations (32) and (33) are the same as in the fixed labor supply 
case, and, at each timet, the marginal rate of substitution between the 


” A sufficient condition for these properties to be venfied is the homotheocity of the 
utility function. This homotheticity is a necessary condition for the existence of an 
optimal balanced growth path. 

" When ft is diu'erent from 0, and the elasticity of substitution in u between con¬ 
sumption and leisure is different from I, there is no optimal balanced growth path. 
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consumption of leisure and of produced goods is equal to the wage 
rate (eq. [34]). 

At a given instant, the labor supply depends not only on the wage 
rate at the same moment but also on the future factor prices (wage 
and interest rates). 

The initial state of the economy (where the tax has been in effect 
for an infinite amount of time) is defined by the stationary equivalents 
of (32)-(34): 


1 ~e’ 

(35) 

c = A (f)l. 

(36) 

- ^ ^ = w(f), 

(i-mr-i) 

(37) 


whered (r) is defined using the production function, A (r) = r[(kll)(r)] 
+ w{,r) - n - /L,and the capital-labor ratio A/(is expressed as a function 
of the gross rate of return r. 

At time zero, the tax is suppressed. Thereafter, the dynamic be¬ 
havior of the economy is described by (32), (33), and (34), where 0 is 
replaced by 0; the new steady state is characterized by 

r* = p*, (38) 

c*=A(r*)l*, (39) 

and 


/3f* 

(I -/3)(r-/*) 


w(r*). 


(40) 


The increase of welfare induced by the tax cut can be approximated 
to the second order by the same method as in Section II; its wealth 
equivalent is of the form, AM = Lg lc*l{r* — n - p.)]0*, which depends 
on the parameter (3. The formula for Lg is rather complicated,'** so we 
present in tables 4 and 5 some numerical estimates analogous to those 
in Section II. 

Also, L 0 is represented as a function of e in figure 2, for the values /3 
= 0, .6, I; /3 measures the long-run compensated elasticity of the labor 
supply with respect to the wage rate. When it is equal to zero, the 
labor supply is fixed; this is the case studied in Section II. 

In table 4 and figure 2, we can see that the excess burden for a 
given value of 6, is increasing with )3 and is bounded by the value of 


" A computer program for its numerical evaluation is available. 
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N'ormauzeo Welfare Cost of Capital Income Tax with a Variable Labor Supple (%) 
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Lid*, obtained for /3 = 1,** It is remarkable that the difference Lp — L© 
does not appear to be significantly affected by the value of (for c 
smaller than 2) and that this difference is relatively small with respect 
toLg when the production function is of the Cobb-Douglas typ>e (e = 
1 ). 

When € is infinite, we have the partial equilibrium case again. It is 
straightforward to show that the relation (14) is still valid. 

The welfare cost of the capital income tax at the rate 6 is measured 
by a fraction of the full consumption of produced goods and of 
leisure (valued at the wage rate), which is equal to Lpd*, where Lp = 
i[r*/(r* - n - p,)]*. 

VI. Conclusion 

The general equilibrium models used in this study are highly stylized. 
However, the numerical examples considered indicate that the 
character of its results is fairly general. 

The value of the excess burden of the capital income tax depends 
mainly on the elasticity of substitution between capital and labor e. 
When c is smaller than two, the excess burden is almost proportional 
to e. No general rule could be derived about the effects of the dis¬ 
count rate. When the other parameters of the model are chosen to 
characterize the U.S. economy, the excess burden of the capital in¬ 
come tax increases with the discount rate and the growth rate. 

Interestingly enough, when e is smaller than two, an increase of the 
intertemporal substitutability of consumption (measured here by the 
inverse of the marginal utility of consumption) increases the value of 
the excess burden only by a negligible amount. In this case, although 
the deadweight loss of the tax is relatively small with respect to the 
level of aggregate consumption, or to its value in the ca.se of fixed 
factor prices (c = =»), it is not negligible with respect to the amount of tax 
revenues; A capital income tax of 50 percent implies a welfare loss 
equivalent to 26 percent of the tax revenues; when the tax rate is 
equal to 30 percent, the deadweight loss is equal to 11 percent of the 
tax revenues. These values should be multiplied by a number between 
1 and 4/3 if the labor supply is elastic. 

These numbers are indicative of the welfare gain induced by a shift 
from capital income taxation to lump-sum taxation, when individuals 

“ The compensated elasticity of the labor supply is bounded by 1. 

Numerical experiments have .shown that the difference L, - L, is not .sensitive to 
variations of 6, p, or p., foi realistic values of *. When e is equal to zero, the excess 
burden of the capital income lax arises because of its incidence on the wage rale. In this 
special case, the interest tax has the same excess burden as the wage tax which generates 
an equal amount of revenues (Chamicy 1980a). 
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have perfect foresight about the behavior of the economy after the tax 
is suppressed. Since the determination of the perfect-foresight path is 
not an easy task for most economists, even in a simple model, this 
assumption may not be realistic. When the tax reform does not lead to 
an optimal path, its benefits are reduced. For example, under myopic 
expectations, the welfare gain induced by the abolition of the capital 
income tax is only equal to about 60 percent of its value under perfect 
foresight. However, myopic expectations grossly underestimate the 
future behavior of the economy. When individuals anticipate some¬ 
what this future behavior, the welfare gain of tax reform is close to its 
maximum value, even if the degree of foresight is not very accurate.^ 
(A method to approximate the perfect foresight path has also been 
suggested in Sec. III.) 

Finally, under the present U.S. tax system, the rates of the capital 
income tax vary by a large amount from one sector of production to 
another. When the overall elasticity of substitution between capital 
and labor is not too large (less than two), the welfare gains obtained by 
an equalization of these rates dwarf the gains obtained by a reduction 
of the global tax on capital income.^* 


Appendix 


I. The Excess Burden in Partial Equilibrium 

An income equivalent of ihe excess burden is given by the well-known 
Harberger-Hicks-Hoielling formula. In a coniinuous-time formulaiion, this 
becomes 


with ihe following notation; 

m = n + ft 

' p, = (price of r, at lime 0). 

.Ap, = 



compensated derivative of c, with respect top,.. 


(Al) 


The assumption of myopic expectations is u.sed in Hudson and Jorgenson (1976) 
and in Fullerton et al. (1979). Both studies rely on multisectoral models where one type 
of capital is allocated between the different sectors of production. Because there is only 
one state variable (the aggregate capital stock), regressive expectations are still 
asymptotically correct, and the discussion in Sec. HI applies; in particular, the mecha¬ 
nism suggested there could be used to improve the accuracy of foresight, 

“This result is also found in Fullerton et al. (1979). 
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By application of Slutsky’s equation. 
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Cl 

M dct \ 

M 

, c, dM j 


The first expression in parentheses is simply equal U) the variation of the 
consumption at time/, after an uncompemaled interest tax has been applied 
(at time zero). 

Since the utility function is homogeneous, the income elasticity of con¬ 
sumption is equal to one, and the excess burden is given by 

AM = e"'A/,,Ac,dt + e"V,Ap,<f/) • 

From (2) and the budget constraint, the uncomptensated demand function, r,, 
IS expressed as follows: r, = vMe^, with v = r* — n — /jl — ^r*[i — (l/cr)],anda 
= —{t *l<T)0. Therefore, when 6 is small. 


Ac, = -Mr* 





We replace Ac, in the above expression for AM, and since the wealtli M is 
equal lo the present value of the consumption stream on the balanced growth 
path, we have 


with 


A,Vf = -Lp 


9\ 


(A2) 




For a given value of the interest tax rate, the welfare tost in partial equilib- 
iiurn is an increasing function of the intertemporal substitutability of con¬ 
sumption in the utility function, which is measured by 1/cr. It is a decreasing 
function of the di.stount rate and an increasing function of the growth rate. 


11. The Excess Burden in General Equilibrium 

We determine here a .second-order equivalent of the welfare gam AU, ob¬ 
tained by the suppression of the interest tax: AU = J{^) - [u{cJ/Aj (k = p* — n 
— p). The terms J{k) and u(f) are considered separately. 

Fhe term J(k) is equal to 

J(h = f e^*'u(f,)d(, 

Jo 

where r, is uken on the dynamic path BE (fig. 1). This integral can be 
decom|X).sed into two terms: 

^ «• 

J = u(ct)^t + I e'*^^u(Ci)dt, 

J 
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where Al is a small interval of time. By taking an inhnitcsimal value for At, we 
verify that the function J satisfies the relation, 0 = u(c) — \J + J'(h)k-, or, using 
the capital accumulation equation (eq.l3J),0 = ulc(k)] — k.J(k) +J'(k)[J(k) — {n 
+ ^L^ - c(k)]. 

Differentiating this expression twice al the point A*, we obtain 

uV'* + i"u' 

J‘(.k*) = «'(c*), /'{*♦) = - y - " -. 

where u' = (fiu/de){r*),u" = = (dcl(lk)(k*),c'' = lrfV(A*))/rfA''‘, anti 

a = c'(h*) — k. 

Therefore, a second-order approximation of y at A is given by 

J(k) = — + u'(k - k*) + -5-(U"f'* -I- - **)». (A3) 

jy I ^ ' 2(K + 2<i) ^ 

Using the properties ol the optimal consumption function described in Sec¬ 
tion 1 of the text, wc can rearrange the term of the second order; 

+ f'u' _ u" / fj" “l 

2(X -H 2a) ~ 2(X + 2a) T ~^l 

= + r'a) _ u"r'(f' + era ) _ u"r' 

2(X + 2a) 2(X 2a) 2 ' 

Hence, 


J(k) = -!^+u'(k-k*)^-!^{k-k*f. (A4) 

A Z 

The value of cotisumpiion in the steady state under taxation.c, is determined 
bye = / (k) ~ (ri + p,)A. 1 he setond-orderet|uiva!ent of ii(f)/X tan lx* written as 


= i 4- u'(k - k*) + -^(u'J" + u"k^)(k - k*)\ (A5) 

A A ^A 


= — r n'{k -k*) + -^(k + -—jik - k*f. 
k 2 V X / 


(Afi) 


I aking the difference Itetween }(k) and u{f)lk (in |A4) and (Afil), we have 

MJ = - — - **)*, 

if A 


where a is the positive rtwit of*’ + X* — y = 0 (described in text Sec. 1). 

Since f'(k) = p*/(l - 0) and (k - k*}J'' ~ p*6. the welfare change ran be 
rewritten as 


Af/ 


11 ’ c | p*\’/ <rXn \* 
2ar X \ X / i f/" / 


S’ = u' 


r* 



»*/.. 


where 
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(the partial equilibrium value of L expressed by the relation [14] in the text), 
and 


-cnk*) 

^ cr 

The term kaly is equal to the positive root of the equation (-y/X*)** + x — 1 = 
0. This root is contained in the interval (0,1). The derivation of its properties 
when e varies is straightforward. The sign of dLldo is obtained by the same 
method. 


III. ihe C.a'ie nj NonpfTfeet Foresight 

When the private sector is not endowed with perfect foresight, c'(k*) is no 
longer the positive root of the equation x* - Xx - y = 0. '['he welfare gain of 
the interest lax cut is obtained by the same method as before. However, taking 
the diflercnce between (AS) and (A5), we find 


AU 




1 


\X + 2a, 



X -+■ 2a, j 


where r, and a, repie.sem the consumption function and the coefficient of 
adjustment of the economy toward the balanced growth path, under nonper- 
lect foresight. 

Use the relation a, = c', — \ and the definition of the perfect-foresight 
values for a, and c',. respectively a and r', to obtain 

AU = it'(r*)/“(fe*)| (^(Q,). 

where 


Q(«y) = 


a, 

IjiL- 

X + 2a, 

[ar' 


1 he tonsumption equivalent of the welfare gain can then be expressed as L, 
= L ■ where L is the value ol L, under perfect foresight. 


IV. The (Corporation Tax 

We examine heie the long-run incidence of the corporation tax on the capital 
st<x.k. The capital sttKks in the corporate and in the noncorporate sector, k, 
and ki, respectively, arc determined in the long run by the relations p* = (1 - 
Differentiate this expression around® = Oto obtain the 
variation of the aggregate capital stock, dk: 

dk = dh, + dk^ = (A7) 

with A = ghg'ii - g'lig'ii- Assume now that the lax is applied uniformly to lx>th 
sectors, and call Dk the variation of the capital stock: 

nk = I ^ g” ~ 

Taking the ratio between (A7) and (AS), 

dk _ g22 ~ g'u 

Dk g^, 4- gi', - 2^1, 


(A8) 



CAPITA!, INCOME TAXATION 


495 


Since the production function g{k„ kj) is homogeneous, it can be rewritten 
under the form g(.k,, k^) = ff[f (*,, A,)], where F is homogeneous of degree 
one. 

Use the properties of F, and the equality of F[ and Fj (at d = 0), to write 


g^» - g:^ = m'{F'^ - f:,) = h'f;, ( 

gn +g22 - 2g1,^H'Fi,\\ -^2- 


k, 



Taking the ratio between these relations, dkIDk = ki/k. This result could be 
generalized easily. The long-run incidence of a sectoral capital income tax on 
the total capital stock is proportional to the share of the taxed capital with 
respect to the total capital stock. 
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Some standard topics in the theory of international trade are recon¬ 
sidered in this paper by distinguishing fx^iwcen national and aggre¬ 
gate iruome when fixed supplies of'foreign inputs are present within 
the home toumry. Under (onditions that would ensure a national 
welfare gain if foreign ownership were absent, international trans¬ 
fer, eeonomit growth, or tariff ijolity niiglil cause a national welfare 
loss in the presence of foreign ownership. The tec hniqiies developed 
could be ap[)lied to other domestic distinctions (such as those based 
on rate, sex, age, oi ethnicity) and to the theory of cnstonis unions in 
a three-country wot Id. 


I. Introduction 

This paper reornsiders a number of standard topics in the theory 
of international trade by taking explicit account of the distinction 
between national and aggregate income when fixed supplies of 
lorcign-owncd inputs are present within the domestic economy. Kx- 
lending the work of Bhagwati and Brecher (1980),’ the following 


I hank.s arc due to the National Sc ience Foundation gram no. StX;79-n7541 for 
partial financial sup|K)rt of ihe research underlying ihis paper. I'he coiniiienis and 
suggestions of Alan Deardcirft, Jacob Frenkel. Alasdair Smith, and anonymous referees 
arc gratefully acknowledged. 

' I heir work, in turn, extends the analysis of Bhagwati and I ironi (1980). who 
coiicemratc on a special case mentioned in n 3 below. 

\ Inurnal nf /'oliUi-til t.romtmf, I'lMI, viii H9. tK* 31 
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analysis takes a new look at welfare-theoretic aspiects of'international 
transfer, economic expansion, and tariff policy, while it emphasizes 
significant departures frtim conventional wisdom that arise in the 
presence of foreign ownership. As these selected departures suggest, 
many standard results are open to serious question when part of the 
domestic product accrues to factor inputs from abroad. 

Originally, the motivation for the present two-group analysis (based 
on the national-foreign distinction) came from a recent concern in 
Latin America, where policymakers have been worried about the im¬ 
pact of trade liberalization on national welfare, given the domestic 
presence of foreign-owned multinational corporations. After further 
reflection, however, it is clear that the treatment below has much 
greater applicabiliiy to a broad range of analytically similar cases. For 
example, it is possible to treat in much the same way a wide variety of 
aliernafivc domestic distinctions, including those based on race, .sex, 
age, or ethnicity. The following techniques and results, moreover, are 
directly relevant lor the fully analogous two-group issue relating to 
the distribution of gains (or losses) between trading partners in a 
customs union (sutb as the Fliiropean Economic Community) with 
factor mobility. While these other policy problems are of considerable 
importance and interest as well, only the national-foreign distinction 
is pursued explicitly here for the sake of brevity. 

Section II reviews the basic model of an open economy, in which 
foreign-owned and national supplies tsf two homogeneous factors are 
combined to produce two commodities. As Section 111 then shows, a 
transfer-receiving country might suffer a loss in national welfare, 
even under the usual conditions which would ensure a welfare gain if 
foreign ownership were absent. As established next by .Section IV, a 
country experiencing economic expansion (due to factor-endowment 
grow th or technological advance) might encounter a deterioration in 
national welfare, even under well-known conditions which would 
preclude this possibility of “immiserizing growth” in the absence of 
foreign ownership. Afterward, Section V explains why free trade 
might be inferior to both no trade and subsidized trade, as far as 
national welfare is concerned.^ Section VI summarizes the paper’s 
main results, based on the possibility of aggregate and national wel¬ 
fare moving in opposite directions. 

Needless to say, this possibility would not arise if foreign-owned 
factors were taxed to the nationally optimal extent. Indeed, with these 
factors in perfectly inelastic supply, the optimal lax on each foreign 

’ t his result is obtained al.so by Bhagwali and I'iruni (1980), for a special case 
identified in n. 3 below. In addition, since Bhagwati and Brecher (1980) compare 
free-trade equilibnum with autarky, the present paper will emphasize instead the 
coiripanson of free versus subsidized trade. 
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input clearly would be 100 percent, thereby removing the after-tax 
distinction between aggregate and national welfare. Assuming that 
this type of optimal taxation of factors is politically infeasible, how¬ 
ever, the present analysis cautions nationally oriented policymakers 
against the usual, automatic adoption of the standard welfare conclu¬ 
sions which reflect an aggregate point of view. More sftecifically, this 
paper shows precisely how the traditional (aggregate) propositions 
must be modified for a truly national perspective, when political 
constraints eliminate optimal taxation of inputs from abroad. 

II. The Basic Model 

Following the analysis of Bhagwati and Brecher (1980), the present 
section summarizes the basic two-commodity, two-factor model of an 
open economy (large or small), which plays host to given quantities of 
inputs from abroad. I'he aggregate factor endowments of the country 
are fixed « R'‘ umis of capital and L" units of labor, while the given 
antounts K” and L” are the national endowments of capital and la¬ 
bor, respectively. (Thus, the fixed supplies ol_foreign-owr^d capital 
and labor within the home country areii® - K" and /-“ - respec¬ 
tively.) It is assumed that K” > K’' > 0 andi.® > L” > 0, excluding the 
possibility that either factor within the home economy is owned 
wholly by nationals or completely by foreigners.^ Commodity two is 
always labor intensive relative to capital-intensive commodity one, 
and the well-behaved technology exhibits constant returns to scale. 

In figure 1, the home country is depicted in free-trade equilibrium. 
Aggregate production is at point Q® on production-possibility frontier 
7’57'? (corresponding to K" and L"), aggregate income is represented 
by budget line (y'D". and aggregate consumption occurs at point D" 
on indifference curve /J/?. (For simplicity of exposition, it is assumed 
that all inctJtne earned by factors from abroad is consumed locally, to 
avoid having to show repatriation of such income within the diagram.) 
By the reasoning of Bhagwati and Brecher (1980), national consump¬ 
tion takes place at point D" on indifference curve /Jf", with national 
income given by budget line (7*0" (parallel to Q“D®), as if nationals 
produced separately at p<>int (7" on production-possibility frontier 
(drawn for K” and 7.").* To emphasize that the main results of 
this paper qualitatively do not require any differences in consumer 
preferences between nationals and foreigners within the home coun¬ 
try, assume throughout the text that the same set of indifference 

* For the special case in which K" > K" = 0and /-“ = L" > 0. see Bhagwati and 'I'ironi 
(1980). 

' The discussion could be extended readily to allow for the possibiJily of complete 
si>cdali/alion, following the analysis ol Bhagwati and Brecher (19801 
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Fk.. 1. — Diirt-rential trade-volume phenomenon 


curves with unitary income elasticities of demand represents both 
national and aggregate tastes in consumption, although this sim¬ 
plification of the exposition could be dropped (as in footnotes to this 
paper) without detracting trom the essence of the analysis.'' 

The model may be summarized conveniently as follows: 


= 1 . 2 ,; = a,n; 

( 1 ) 

K-* = X^i -I- pX‘ 2 , ; = a, n; 

( 2 ) 

W> = U’{C{,Ci),j = a,7K 

( 3 ) 

(.'^1 + />C'4 = Y*,j = a, n\ 

( 4 ) 


where p denotes the relative price of the second commodity in terms 
of the first; denotes output of commodity i on frontier T{T\', each F^, 
is a conventional function of p, given K‘, U and the (uniform) technol¬ 
ogy for commodity i; Y" and K" denote the real value of aggregate and 
national income, respectively, in terms of the first commodity; Cf and 
Cp denote aggregate and national consumption, respectively, of com¬ 
modity i (i = I, 2); W” and W" denote aggregate and national welfare, 

^ .\n. 6-8, 10, and 15 below extend the discussion to let tastes differ between 
nationals and foreigners within the home country. The-se extensions bring out the 
essentially “three-country” flavor of the analysis, in which nationals, domestically lo¬ 
cated foreigners, and the rest of the world can be treated as three distinct components 
of the international economy. 
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Flo. 2.—Dilfert'nual trade-paucrn phcnomt-non 


respectively; and each V> i.s a concave function of C\ and C> 2 , with 
pisitive partial derivatives denoted by L’‘,= dU‘ldC\(i = 1, 2). 

I^tcr in the paper wiieii a change in relative prices is induced by 
various parametric shifts, either of the following phenomena might 
lead to a fall in national despite a rise in aggregate welfare, depending 
on the strength ot other induced effects. The differential trade- 
volume phenomenon is shown in figure 1, where the aggregate (ac¬ 
tual) volume of trade (defined by line segment Q"D") is less than the 
national (hypothetical) volume t)f trade (defined by line segment 
(2"/?"), implying (ceteris paribus) that a terms-of-trade deterioration 
worsens national by more than aggregate welfare. Figure 2 (lalx^led 
similarly) illustrates the Bhagwati and Brecher (1980) differential 
trade-pattern phenomenon, which arises when the aggregate and 
national patterns of trade differ (in direction), so that an aggregate 
terms-of-trade improvement (tending to raise IF") means a national 
terms-of-trade deterioration (tending to lower W"). The national 
relative to the ^gregaie endowment of factors is labor abundant in 
^ure I (with A'"//," > K^/L") but capital abundant in figure 2 (with 
< K''/L’'), as sugge.sted by the relative shapes of frontiers T^V} 
and TJT^, in accordance with the reasoning of Rybczynsltt (1955). 
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To understand the possibility of a fall in national welfare despite a 
rise in aggregate welfare, it might be tempting to go no further than 
the following simple observation. Whenever the national and aggre¬ 
gate endowments exhibit different capital/labor ratios, the domestic 
distribution of income might deteriorate for nationals, as a change in 
relative commodity prices alters the wage/rental ratio for reasons 
expounded by Stolper and Samuelson (1941). It is important to rec¬ 
ognize, however, that generally this income-redistribution ef fect will 
not be strong enough to produce the differential responses in na¬ 
tional and aggregate welfare if the relative factor-endowment discrep¬ 
ancy is too small to create either the differential trade-volume or the 
differential trade-pattern phenomenon. Even when either of these 
phenomena ari,se.s, moreover, a fall in national welfare despite a ri.se 
in aggregate welfare can occur if and only if certain specihc conditions 
(derived below) are satisfied. 


III. International Transfer 

According to a standard result in the literature (see Mundell 1960), a 
transfer-receiving country cannot suffer a loss in aggregate welfare 
despite any possible deterioration in the aggregate terms of trade, as 
long as international corntriodity-market equilibrium is stable. In 
other words, the iransfei-induced change in IV’“ cannot be negative, 
assuming that an excess demand for or supply of the second good in 
world markets can be cleared by a rise or fall in p, respectively. As 
the following argument demonstrates, however, a (large) transfer¬ 
receiving country might suffer a deterioration in national welfare, 
even under the assumption (maintained throughout the present 
paper) that commodity markets are stable. This demonstration of a 
transfer-induced fall in IV", moreover, does not even require a rise in 
the relative price of home importables. 

If it is assumed that the transfer is given only to nationals, equations 
(2) are modified as follows; 

= XJ + pXj, + T,y = «. n, (5) 

where r is the real value of the transf er in terms of the first commod¬ 
ity. If any part of the transfer were given to foreigners within the 
home country, the chances for a decline in IV" would simply be 
enhanced, thereby strengthening the argument below. 

To examine the welfare implications of the transfer, differentiate 
equations (1), (3), (4), and (5) totally with respect to t —assuming 
(without Joss of generality) that initially U\ = 1, while noting that 
ljyu\ ~ p — —{dF\ldp)/(dFydp) from the first-order conditions for 
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maximizing utility and profit. In this way, it is a straightforward 
exercise to derive 

ftW^/dr = 1 + (E^dpldr),] = a, n, (6) 

where Consistent with figures 1 and 2, which depict the 

home country exporting the second good, > 0 by assumption 
throughout the present paper. As illustrated above, however, /s" can 
be either positive (in fig. 1) or negative (in fig. 2). 

As equations (6) confirm, dW^ldr is the familiar sum of the follow¬ 
ing two components; the primary gain (= I) from the transfer- 
induced increase in aggregate income, at the initial (pretransfer) set 
of relative prices; plus the secondary effect (= E"dp/d^) from the 
possible increase or decrease in the real exchange value of the initial 
volume of home exports, in the event of a transfer-induced change (if 
any) in relative prices. The expression for dW^ldr is analogous. If 
foreign inputs were entirely absent from the home country, the dis¬ 
tinction between national and aggregate variables would disapptear, 
thereby implying that A'" = /•:*• and (hence) that dW"/dr = dW“/dT. 
Given the actual presence of factor inputs from abroad, however, 
dW'^ldr generally differs from dW°/dT, except in the special case where 
either E” = f.® (despite the foreign presence) or dpldr = 0. 

To determine precise conditions ft)r the direction oi change in 
welfare, consider the standard transfer-induced terms-of-trade re¬ 
sponse, analyzed previously by Samuelson (1952, 1954) and sub¬ 
sequently by Mundell (1960). Thus, by well-known reasoning, 

dpldr = (1 — m — m*)l(e + e* — \)E'‘, (7) 

where c (> 0) and m denote the relative-price elasticity of import 
demand and the marginal propensity to consume the importable, 
respectively, for the home country; e* (> 0) and m* denote the 
corresponding variables for the rest of the world; and t = 0 in the 
initial (pretransfer) equilibrium.® Given the assumption above that 
world commodity-market equilibrium is stable, c -F c* > 1 throughout 
the present paper. 

If equation (7) is substituted into equations (6), simple manipulation 
confirms that 

dW/dr = (€ + €*)/(f + c* - 1) > 0, (8) 


* If preferences in consumption were allowed to differ between naiionak and foreign- 
c'l s within the home country, it would be necessary to rewrite (7) as follows, to reflect 
dte present assumption that the enure transfer gi«s exclusively to nationals: 

(ip/dr = (1 - m» - OT»)/(e + e* - 1 )/• *, (7') 

where m" denotes the national marginal propensity to consume the home importable. 
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but shows that 

dW'Idr g 0 as (<■ + e* — \)E" g (»» + m* — !)£'", (9) 

where e (> 0) and e* (> 0) denote the compensated (constant-utility) 
relative-price elasticity of import demand for the home country and 
the rest of the world, respectively, while e = t + m and p* = e* m*, 
according to a standard decomposition.^ Although dW^/dr > 0 unam¬ 
biguously, it is evidently possible to have dW^/dr < 0 nevertheless.* 
To highlight the important role of the differential trade-pattern 
and differential trade-volume phenomena, it is helpful to revert to 
equations (6), which imply that a fall in national despite the rise in 
aggregate welfare can occur only if (£" — E‘‘)dpldT < 0. This necessary 
condition for a fall in IV holds if either dpldr < 0 in presence of the 
differential trade-volume phenomenon of figure 1 (where T" > E" > 
0) or dpldr > 0 in conjunction with the differential trade-pattern 
phenomenon of figure 2 (where t'" < 0 < (Correspondingly, if 
home exportables were relatively intensive in their use of capital 
(rather than labor), a transfer-induced deterioration in national 

' Alternatively, if c<| (7') from n. 6 above were substituted into eqc|. (6), .simple 
manipulation could show that 

(IW'‘I<It = [(« + «*) + (I — y)(m' — + r* — I) (S’) 

and 

dW'/dT = [(« + «*) + (I - y){nt' + m* - l)]/(r + e* - 1). (9') 

where m' denotes the marginal propensity to toasume the home importable for foreign¬ 
ers within the home country; y = (CJ - X7)/(LJ - X?) = and use is made of the 

(art thatm = ym’ + (1 - y)nr'. Eq. (S') indicates that dfV/rfr can be deromposed into two 
comparative-static components. As could be shown readily, the first component I(e +■ 
e*)/(c + c* - 1)1 is the transfer-induced change in W" that would ixcur initially if the 
transfer were given tein]Kirarily to nationals and domestically Itxaied foreigners in the 
respective amounts yr and (1 - y)T, whereas the second roiiiponent 1(1- y){m' — 

+ r* — 1)1 is the subsequent change in IV that would occur as the portion (1 — y)T was 
pas.sed f rom domesucalfy located foreigners to nationals (the ultimate recipients of the 
entire transfer). Eq. (9') could be interpreted analogously, since —dW'Idr equals the 
worldwide sum of transfer-induced changes in welfare for everyone excluding homc- 
touniiy nationals, as could be shown readily. 

’ Under the present assumption that in' = wi" = m, eq. (8') of n. 7 above is equivalent 
to eq. (8), while eq. (9') leads direcUy to condition (9), AllernaUvcIy, if it were the case 
that m' ^ Bi", there would arise the new possibility of havingdIV“/</T < 0 in eq. (8'). Also 
if It were supposed that m' = 1 - m*. it would lie the c ase that dWIdr > 0 unambigu¬ 
ously in eq. (9'). This last result can be understcKxl intuitively as follows: If foreign 
tastes are uniform throughout the world, the reasoning behind eq. (8) shows equally 
well that the transfer must lower worldwide foreign welfare; that is, —dW*ldr < 0, 
recalling n. 7 above. Incidentally, in view of the fact that worldwide foreign welfare- 
otherwise can rise (when dW'Idr < 0) if m' m*, international aid might be especially 
attracuve for a donor country with investments in the aid-receiving economy. 

• Although (£" — E'‘)dpldr < 0 also if 0 < £" <£“ when dpldr > 0, dW'Idr > 0 in this 
case, as implied by eejq, (6), The reader may also see alternatively that, from condition 
(9), dW'Idr < 0 if and only if (t + <*)£" + (£“ — £")(e + r* - I) < 0. Therefore, national 
welfare may decline despite the increase in aggregate welfare if £" < 0 < £“ (i.e., the 
differential trade-pattern phenomenon holds) or if £' > £“ > 0 (i.e., the differential 
trade-volume phenomenon holds). 
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(though not in aggregate) welfare would still be possible, provided 
that either the aggregate terms of trade improve in the case of labor- 
abundant nationals or an aggregate terms-of-irade decline occurs in 
the presence of capital-abundant nationals. 

Consequently, the basic results of this section can be summarized 
generally in the following terms. When the home exportable uses 
intensively the factor that is relatively abundant in the national (as 
compared with the aggregate) endowment, the national and aggre¬ 
gate patterns of trade are the same, in which case a fall in national 
welfare might ticcur through a differential trade-volume phenome¬ 
non if the (national and aggregate) terms of trade worsen unambigu¬ 
ously. Alternatively, when the home exportable uses intensively the 
factor that is relatively .scarce in the national (as compared with the 
aggregate) endowment, the aggregate and national patterns of trade 
could differ, in which case the differential trade-pattern phenomenon 
might give rise to a deterioration in national welfare if the national 
terms of trade worsen through an aggregate terms-of-trade im¬ 
provement. These general results, moreover, hold equally well for 
changes in p induced by economic expan.sion and tariff policy, as will 
be clear from the analysis below. 

IV. Economic Expansion 

As Bhagwati (I958^t) has demonstrated, a once-for-all increase in a 
factor endowment or in a technological level might deteriorate the 
aggregate terms of trade enough to worsen aggregate welfare ol the 
home country, but this irnmiserizing growth can occur only if either 
the rest of the world has an inelastic offer curve or growth would 
decrease the production of home importables at the initial product- 
price ratio. In other words, if the offer-curve elasticity for the rest of 
the world is not less than unitary and economic expansion is not 
“ultrabiased ” against the production of home importables, the 
growth-induced change in IT" cannot be negative. Even under these 
circumstances (assumed throughout the present section) which pre¬ 
dude a fall in aggregate welfare, however, the following analysis 
demonstrates that a (large) country might suffer a loss in national 
welfare. This demonstration of a growth-induced decline in IT", 
moreover, does not even require a rise in the relative price of home 
importables. 

T<i allow for factor-endowment expansion or technological ad¬ 
vance, equations (1) may be rewritten as follows: 

X\=F\(p,e).t=\,2,}=a,n. (10) 

where 0 is a general shift parameter, ji rise in which indicates either a 
factor-endowment increase (for or U) or a disemlKKlied 
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technological improvement for an industry (one or two). It is assumed 
that any addition to the aggregate supply of capital or labor is owned 
fully by nationals. If any part of such addition were foreign owned, 
the likelihood of a decline in W" would simply be enhanced, thereby 
strengthening the argument below. However, the ability of domesti¬ 
cally located producers to take advantage of disembodied technologi¬ 
cal progress should be independent of the source of ownership of the 
inputs used, as assumed here. 

Differentiating equations (2), (3), (4), and (10) totally with respect to 
0, while recalling that U\ = I initially and that ~ P — —{dF\ldp)l 

{dF‘Jdp), we readily obtain the following result: 

dW‘lde = + {EHpIde), i =a,n, (11) 

where Y‘g = dYHdO > 0. Thus, each dW^ldO is the sum of a primary 
growth effect (Y^g) plus a secondary relative-price effect (FJdp/dB), 
which are analogous to the welfare-related effects of the transfer 
mentioned above in Section 111. Although national and aggregate 
welfare again would remain equal if foreign inputs were entirely 
absent from the home country, the actual presence of foreign 
ownership gives rise to the possibility of having dWIdO < 0 when 
d\V"/d0 > 0, except in the special case where dpidd = 0. 

Turning to the standard growth-induced terins-of-trade response. 
analy7cd previously by Bhagwati (1958ft) and subsequently by Kemp 
(1969, p. 110), we see that it is a well-known fact that 

dpide = (/3 - m)Y%l(f + e* - ]]£«, (12) 

where /3 = (dX1/dd)/Y^. When this result is substituted into equations 
(11), straightforward manipulation confirms that 

dW^/dS = (e + /3 + e* - l)Y'i/(e + e* ~ 1) > 0 (13) 

but shows that 

dW”/de $ 0 as/•;«(? +e* - I)Kg g h'-fm - /3)K8, (14) 

where )3 S 0, which recalls the assumption that growth would not 
reduce production of home importables at the initial commodity- 
price ratio; and f* S I, which recalls the assumption that the rest of 
the world’s offer curve is not inelastic. Thus, despite the fact that 
dW"ld6 > 0 unambiguously under these circumstances, it is still possi¬ 
ble to have dW'/dB < 0 nevertheless.'® 


Along lines suggested by nn. 6-8 for the case of international transfer, the analysis 
of economic expansion could be extended readily to dLstinguish between m” and m'. It is 
worth noting, however, that it wt>uld sull be possible to have dW^fdO < 0 even if it were 
the case that = I — m*. 
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Equations (11) imply that a fall in national despite the rise in 
aggregate welfare can occur only if {£" — E'’)dpldd <Yi~ Eg. As could 
be shown readily, this necessary condition for a fall in W” can result 
from each of the following alternative events, for example: an in¬ 
crease in either the national endowment of capital or the technologi¬ 
cal level of industry one, with dpIdS > 0 in the presence of the 
differential trade-pattern phenomenon (fig. 2); and an increase in 
either the national endowment of both factors or the technological 
level of both industries, if dpidff < 0 with the differential trade- 
volume phenomenon (fig. 1).“ Correspondingly, if home exportables 
were relatively capital intensive, it would be possible to have an 
expansion-induced deterioration in national (though not in aggre¬ 
gate) welfare under a variety of circumstances, including the follow¬ 
ing: an increase either in the national stock of labor or in the level of 
technology for the production of importables, when the national 
endowment is labor abundant; or an increase either in the national 
endowment of both factors or in the level of technology for both 
sectors, when nationals are capital abundant. 


V. TarifT Policy 

According to a standard result in the literature (see Bhagwaii 1968), 
free trade is ranked superior tt) both no trade and subsidized trade 
(assuming that both offer curves are well behaved),’* from the view¬ 
point of aggregate welfare. In other words, the home country cannot 
increase IE" above the free-trade level either by using an import (or 
export) lax to eliminate trade or by imposing an export (or import) 
subsidy to encourage trade. From the national-welfare f>oinl of view, 
however, the ranking above may be reversed. Since Bhagwati and 
Brecher (1980) already demonstrated the possibility of such a reversal 
for free trade versus autarky, the following analysis concentrates on 
free versus subsidized trade. 

Fo allow for tariff policy, equations (2) may be modified as follows: 

= XI + pxi + [(C? - X?) a/(l - a)], i = a, n\ (15) 

where a denotes the ad valorem tariff, which is an import lax (if a > 0) 
or an import subsidy (if a < 0); the domestic relative price of the 

" Although F| = EJ with a national faaor-endowmcnl increase, 11 is possible that >'J > 
Ej for a technological advance. Fhus, with the latter (but not the former) type of 
etonomic expansion, a fall in national welfare despite the ri^ in aggregate welfare 
might occur even without the differential trade-volume and differential trade-pattern 
phenomena—if both industries experience the technological advance and dpld9 < 0. 

** I.e., the offer curve is assumed to represent imports as a monotonic decreasing 
function of their relative price in world markets. For the significance of this assumption 
m tariff analysis, see Bhagwati and Kemp (1969). 
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second good is still denoted by/>, so that the relative price of this good 
in world markets is now equal to /»(! + a); and (C° — X^)a/(l + a) 
equals the real value (in terms of the first good) of tax revenues or 
subsidy payments, evaluated at domestic prices.In writing equations 
(15), it is assumed (for the sake of simplicity) that all tax revenues or 
subsidy payments, respectively, are returned to or collected from 
national consumers in lump-sum fashion. If foreigners within the 
home country were to receive or finance any part of these revenues or 
payments, respectively, comparison of the Iree-trade and autarkic 
equilibria (which generate no lax revenues) clearly would be unaf¬ 
fected, while the chances of having free trade inferior to subsidized 
trade simply would be enhancetl (thereby strengthening the analysis 
below). 

To show that free trade might be inferior to subsidized trade from 
I he national point of view, it is sufficient to establish the possibility of 
having < 0 in free-trade equilibrium. Consequently, through¬ 

out the following discussion, let a = 0 in the initial (pretariff) 
equilibrium. 

Differentiating equations (I), (3), (4), and (15) totally with respect to 
a and again recalling that = 1 initially and that UyU\ = p = 
— (dF\/dp)l((lFydp), we may verify readily that 

dW‘lda = pE" + (E^dplda), j = a,n; (16) 

note that/t/T" = f.',‘ - X? when (balanced) trade is initially free (with 
a = 0). By well-known reastining (see Kemp 1969, p, 96), 

dpida = p(l ~ m — c*)/(c -I- c* - 1); (17) 

note that 1 — m equals the home country’s marginal propensity to 
consume the exportable and recall that a = 0 initially. When this 
result is substituted into equations (16), simple manipulation confirms 
that'^ 


dW'Idoi = epE"l{i' -(- c* — 1) S 0 (16) 

but shows that 

dW/da g 0 as (c -I- c* - 1 )E" g (m -)- c* - 1 )/'.■”. (19) 

Thus, despite the fact that dW'‘lda = 0, it is evidently possible to have 
d\V”fda < 0 nevertheless.''’ 


Although the corrcspttnding value MworUi prices would be (f^l - X1)a, consumers 
respond directly to dimeHir (tarifl-inclusive) prices instead. 

.Note that dW'/da = 0 only in the small-tountry case where e* = and even then 
the change in W“ does not equal zero for any discrete change in q, by well-known 
reasoning. 

'* Along lines suggested by nn. 6-8, the analysis of tariff policy could be extended 
readily to let m' ^ m", without eliminating the po.ssibility of having dW"/da < 0 even if 
= 1 - m*. 
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As implied by the equations (16), it is possible to have dW'Ida < 0 
(even though dW'Ida cannot be negative) if either a differential 
trade-volume phenomenon arises (fig. 1) when < 0 (the “nor¬ 

mal” price response) or a differential trade-pattern phenomenon 
occurs (fig. 2) when dpida > 0. (The “perverse” price response [dpida 
> 0] can occur only in the large-country case, under conditions 
discussed by Metzler [1949].) (Correspondingly, if home exportables 
were relatively capital intensive, it would be possible to have dW'‘lda 
< 0 (even though dW^lda cannot be negative) if either nationals are 
labor abundant when dpida > 0 or nationals are capital abundant 
when dpida < 0. 

Thus a trade subsidy might raise national (but not aggregate) wel¬ 
fare above the free-trade level. This analysis of a small subsidy (tax) 
on trade, moreover, complements the discussion of Bhagwati and 
Brecher (1980), who concentrate on prohibitive taxes on trade and 
thus are able t<i avoid the issue of tariff revenues. 

The analysis of this section has an important implication for the 
traditiotial tnethod of estimating the cost (benefit) of tariff protection 
or trade liberali/ation, Sitice the conventional method (as outlitied by 
Johnson (1960j) igtiores the .source of ownership of domestically 
located inputs, the concept mea.sured (in present notation) is clearly 
dlV'Ida rather than dW^lda. Thus, the traditional estimate of the 
impact of protection or liberalization is an aggregate measure, w hich 
overstates or understates the national cost (benefit) if (E" - E'’)dplda 
S 0, respectively, as suggested by equations (16). This misstatement 
ari.ses because the conventional estimate simply sums the three stan¬ 
dard components (namely, the external terms-of-tratle effect and the 
costs of disttjrtion in lx)th production and consumption), while it fails 
to exclude the foreign-factor jMrrtion of the tariff-induced change in 
aggregate welfare.'" 

VI. Summary 

As demonstrated by this paper, welfare aspects of international trade 
theory need to lx? reconsidered, when national and aggregate income 

'•’t he lorcign-facliir porlion ot the <hange in aggregate welfare is repre.sented by 
the expression (H" - t','''^filda. which must tie excluded fromrfU’"/rfa to givcdlV’/do, as 
suggested by et|<| (Iti), .Also by rcfrealiiig the procedure of Ithagwali. Kainaswanii. ai«t 
Siinivasan (Itt69), it is pos!>ible to write (in present noutiim) that dW ^lda ~ {E°dp*l 
da) + ((/»* - p)dXilda\ + fip - p*)d(:pda\. where the relaiive price of die stxond gtxxl 
m wot Id markers is denoted/t*, which e<|U.ils/i(l +• a). Vhe n>m\H>ncniiE°dp*lda. IJ>* — 
p)dX\lda, and (p - p*)d(^/da ate the effects due to the tcrnw-ol-trade change, the 
production dLslortion, and die consumption dislorlion. resjiectively. When evaluated in 
free-trade etpnlibrinm (where a = 0 and p = p*). the latter two (distoruon-related) 
components disappear, leaving only the first (terms-of-tradc) component. This re¬ 
maining (first) tomponent, moreover, is e({uivalent to the right-hand side of ecjq. ( 16 ) 
for; = <1, since (in free-trade e<|uilibrium) = (dp*lda) - p. 
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differ in ihe presence of foreign ownership. Examples of this need 
are provided by the analysis of international transfer, economic ex¬ 
pansion, and tariff policy. For a country receiving a transfer from 
abroad, national (but not aggregate) welfare might deteriorate even 
when international commodity-market equilibrium is stable, regard¬ 
less of the direction of change in the world product-price ratio. In the 
case of economic expansion from factor-supply growth or technologi¬ 
cal advance, national (but not aggregate) welfare might worsen even 
when the rest of the world does not have an inelastic offer curve 
and domestic expansion is not ultrabiased against production of 
home importables, no matter what the direction of change in the 
world commodity-price ratio. As for tariff policy, free trade might 
be ranked inferior to both no trade and subsidized trade (in either 
direction), from the viewpoint of national (but not aggregate) welfare. 
■Moreover, the conventional empirical estimates of the cost of protec¬ 
tion a la Johnson’s (1960) methodology are generally seen to be in 
need of correction if the economy has foreign-owned factors of pro¬ 
duction. In fact, many economies typically do have substantial labor 
inflows under gaslarbeiter or other programs dehned by immigration- 
quota policies, and, of course, equally there are substantial flows of 
capital among nation states." 
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Accounting for Price Changes: 
American Steel Rails, 1879-1910 


Robert C. Allen 

Vnn'er\ity of British ('olumhin 


A framcudtk is developed for decomposing product price changes 
into changes in input prices, technical efficiency, and deviations of 
price from unit cost. I’his framework facilitates the measurement 
of productivity growth in noncompetitive industries. The history of 
American steel rail prices between 1879 and 1910 is analysed, and it 
is concluded (in contrast with much recent work) that productivity 
growth remained rapid until the twentieth century and that the steel 
industry was sufficiently collusive so that the rail producers received 
the benefits of that productivity growth as excess profits. 


I 

I he 1890s mark a pronounced change in the position of the Ameri¬ 
can steel industry in the world market. Before the 1890s American 
prices substantially exceeded British prices, and the American indus¬ 
try achieved a large size only because of high tariffs. During the 1890s 
American prices dropped to British levels or below, and America 
emerged as a major exporter of iron and steel. This change in price is 
apparent in the case of rails, which was one of the leading steel 
products of the time. Between 1881 and 1890 the average price of 
steel rails at Pennsylvania mills was $37.01 while the average British 
price was $23.62. During the period 1906-13 the American price had 
fallen to $28.00 while the British price had risen to $29.46 (Allen 
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gestation period of this papier. I also thank Michael B. Perey. Peter Chinloy, (). F. J. 
Boonekamp, Frank Lewis, Peter Lindcrt, E. R. Berndt, and David Donaldson for their 
comments on earlier versions of the paper. 
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1979). To explain the emergence of the United States as a steel 
exporter, it is necessary to explain the fall in American steel prices.' It 
is the aim of this paper to account for the decline in the price of 
Bessemer steel rails. 

There is no shortage of possible explanatory factors. During the 
late nineteenth century low-cost iron ore fields—particularly the 
Mesabi in Minnesota—were exploited, and improvements in lake 
shipping services were effected. Both developments substantially low¬ 
ered the cost of iron ore in the Midwest and hence the cost of 
Bessemer pig iron, which was the principal input in steel rails. 
Moreover, many observers at the time believed the rate of productiv¬ 
ity gmwth to have been very high in the American steel industry. This 
development would also be expected to have lowered the price of 
American steel. On the other hand, there were several attempts on 
the part of producers U) collude and thus raise the price of steel rails. 
With the formation of U.S. Steel in 1901 the difficulty of enforcing 
collusive agreements was reduced. In the rase of steel rails there is 
prima facie evidence that producers subsequently exercised monoj> 
oly power. Prior to 1901, the price of steel rails had changed fre¬ 
quently. In that year, however, the price of .steel rails was .set at $28 
per ton, and it remained unchanged until the First W'orld War. Such 
stability was unprecedented in the industry and indicates some power 
to control the price but does not in itself prove that price was set above 
cost (lemin 1964, pp. 192, 284-85). 

The procedures that economic historians have used to analyze price 
changes have not been fully .satisfactory, 'ihe technique most firmly 
rooted in theory is the applic ation of an identity derived by jcjrgenson 
(1966, pp. 2-4). The identity asserts that the rate of change of the 
price of the product ec|uals an index of the rates of change of input 
prices minus a residual which is usually interpreted as the rate of 
technical progress. Mc(.loskey (1973, p. 99) useci a relative levels 
version of this identity to argue that—since the ratio of the price of 
rails to the price of pig iron (whic h was regarded as a proxy for the 
index of input prices) was trendless after the 1880s—there was no 
productivity growth after that date.* 1 he difficulty with this proce¬ 
dure is that the identification of the residual with technical progress is 
correct only if the industry is in competitive long-run equilibrium. 
This assumptiem was barely mentioned in Jorgensons discussion of 
the identity,* and, indeed, it is not immediately obvious from the 


' Ic i,s also nec rssary to explain the rise in British pnees. t his aspec I of the problem is 
lonsidcrccl in Allen (1970). 

' l emin (1964. pp. 218-19) develops a similar argumciu. 

' T he discussion is conhned to jorgeiison (1966. p. 4. n. I). 
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.he Hssumpuon has Ix-en invoked. However, a 
^Le which decompows pnce change, ,mo efficency and .npn. 
Trice effects only il Ow .ndusuy is in compeuuve Jong-run equiJibnum 
is not H ,satisfactory f ramework for analyzing American steel prices in 
the fate nineteenth ancf curly twentieth centuries. 

Reflection suggests that changes in market power iiiHuente the 
difference between input and output prices. This realization has also 
been exploited by economic historians analyzing the evolution of steel 
prices. I'hus McCdoskey (1973, pp. 24-28) examined the ratio of rail 
to pig iron prices in order to investigate the effectiveness of various 
attempts at collusion by rail producers in Britain, femin (1964, pp. 
186-92) undertook similar investigations for the United States, 
(’.learly, c hanges in monopoly power and technical progre.s.s cannot 
Ixjth be inferred from the ratio of output to input prices. At best, only 
their combined effect can be identified. 

What is rec|uired to account for the fall in the price of American 
steel is an accounting scheme that identifies the separate effects of 
input price changes, effic iency changes, and changes in the deviation 
of price from unit cost. Such an accounting scheme is developed in 
Section II for a single product firm (an analogous scheme for a 
multiproduct firm is developed in an appendix).'' In Section III the 
sc heme is used to account for the fall in the price of steel rails between 
1879 and 1910. The data discussed there indicate that declines in 
input pi ices and increases in efficiency both exerted powerful dckwn- 
waid pressures. Indc'ed, the late of technical progress was at its 
highest in the 1890s. The greater effectiveness of collusion after the 
U.S. Steel merger, however, meant that these cost declines were not 
entirely reflected in lower prices. Indeed, profit margins were raised 
enough between 1889 and 1902 so that rail producers realized most 
all of the gains of technic al progie.ss as higher profits. In Section IV 
these conclusions are compared with McCiloskey’s and Temin’s 
findings. Jorgenson’s price accounting identity is examined in detail 
in order to make explicit the necessity of assuming competitive long- 
run equilibrium if one tries to infer the rate of productivity change 
from diff erences in the rates of change of input and product prices. It 
is also shown that substantial errors result if this assumption is er¬ 
roneously maintained. 


1 hc'dutfior has extended die puceaccoumingframewdilc disc u.s.sed here to theca.se 
of a multiple output firm. The multiple output case Is developed in Allen (1978, pp. 
7-1 1) and in an appendix to this paper which is available from the author upon 
ret)Uest. 
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1 o develop the price accounting scheme for a single product firm, it is 
assumed that the firm minimizes costs subject to the billowing pro¬ 
duction function; 


(m=^A{t)-f{X(l)l (1) 

in which Q(/) is output at time t, X(l) = [X,(0. X-iH) .is the 

vector of inputs used at time t, and A(t) is an index of Hicks neutral 
technical progress. 1 he assumption ot Hicks neutrality can easily be 
relaxed (Diewert 1980, pp. 489-95). The function/ is assumed to be 
linearly homogeneous. If / were known, the relative increase in effi¬ 
ciency between andt, could lx? computed fromO(/,),0(<2),X(<,). and 

Aih) ^ .2) 

A(l,) /lX(/,)]//[X(4)J • ^ ' 

The term is the relative increase in output that actually 

(Kcurred. The term/[X(ij)]//LX(/,)J = A(t,) ■ /lX(ig))/A(e,) ■ flX(t,)] is 
the relative increase in output that would have tKcurred in the ab¬ 
sence of technical change but given the change in input quantities that 
actually occurred, fhus/4((2)//4(t,) equals the proportitmal increase in 
output that is not attributable to the increase in inputs. 

If the firm minimizes cost, then there exists a total cost function 
which is factorable into a unit cost function r* multiplied by the 
production rate. Under the assumption that technical progress is 
Hic ks neutral, the unit cost function can lie written as a lunction of 
the input prices divided by A(l): 

• (>(/) = ciiv(0] ■ Q(0/d«). (3) 

Here ir(/) = IIV,(I), . .., lf'»(0l is the vector of A input prices. It is 

presumed that the supply to the firm of ,V(0 is perfectly elastic at U'{0. 
rhe right-hand equality implies that the relative change in unit costs 
can lie decompo.sed into an elficiency component and an input price 
c ornponent: 

(^ 2)1 _ A(tf) cjlf (^j)] 

A(tt) rlH’(/,)]- 

Ihe term d(< 2 )/.'I(/,) is relative efficiency in the two periods and 
'lh’(<*)l/clH’(/,}] is the relative increase in unit cost that would have 
obtained had there been no prcxiuciivity growth but given the actual 
< hange in input prices. Kcpiation (4) provides a sec ond method for 
measuring productivity growth, ife were known, thenr[H (<i)l 
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could be computed by direct substitution. Dividing that result by the 
observed change in unit costs would yield/I(Ij)//!(I,). From the per- 
sjjective of equation (4), the increase in efficiency equals the relative 
decline in unit costs not attributable to declining input prices. 

The price of the product will always be related to unit costs by the 
fbihwing identity; 

P(t) = M(l) ■ c*[I,lT(0], (5) 

where M{t) is the implicitly defined markup of price over cost. For a 
competitive industry in long-run equilibrium, Af(/) = 1. One would 
expect to observe Ai(t) usually not equal to one. The relative change in 
price betu’cen /, and /j can be written as 

P(l2) = M(/,) c*[4.1F(<,)] 

F(/,) M(t,) cn<„lV(!,)]• ' ^ 

Equation (6) could be used to decompose a change in price into a tost 
change and a markup change. Equations (2) and (4) provide a basis 
for decomposing the tost change into input price and efficiency 
(omponents. 

If r and / were known, equations (2), (4), and (6) could be used to 
decompose price changes into markup, efficiency, and input price 
changes. In an empirical study, of course, the functional forms off 
and / would have to be specified and the parameters of the function 
estimated. Prodigious task! Recent work in index number theory (e.g., 
Diewert 1976, 1980) has shown how such elaborate undertakings can 
be circumvented, (,'orresponding to each of the func tional forms that 
would likely be used to specify c or/"are one or more index number 
formulae. The indexed input quantities (oi- input prices) would indi¬ 
cate exactly/[,V(< 2 )J//[X(/i)] orr[lV(l 2 )]/clW(I,)J on the assumption that/ 
ore had the functional form that correspHuided ter the index number 
formula used. For example./[XOj)]//[X (I,)] would ecqual a l.aspeyres 
or Paasche index of the input quantities if f were Leontief. If/ W'ere 
Cobb-Douglas,/[X(^ 2 )J//lX(fi)] would equal a geometric index of input 
quantities. If/were a cjuadratic function,/[X(l 2 )]//[X (I, )J would ec]ual a 
Fisher ideal index, and if / were transiog, then/[X(< 2 )J//[X(/,)J would 
equal a Tcirnqvist index of the input Cjuantities.* Analogous results 

* J hvse ec|uivalen< <■>1 are dLs<usiwd in Diewei l (1980, pp. 446-5V) I'hc linearly 
homogeneous iran.slog function, whi<h is used extensively in this paper, is 

In f[X(l)] = a„ + X X Z 

i-l m-l n-l 

subject to the restrictions. = 1, Pm, = P,m, and = 0 for m = 1,2 . * 

Diewert (1976) has shown that 
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hold Fore and input price indices. It should be noted that in the index 
number approach to the measurement of/IX(t 2 )]//IX(<,)] or c{W{li)\l 
it is not necessary to know the values of the parameters ot 
/ or f. Instead, the index number formulae require the values of the 
factor shares, estimates of which are frequently easier tr> obtain than 
estimates of the parameters of / or c. 

In an empirical study, one should choose the formula for indexing 
input prices or quantities that corresponds to the true structure of r 
and /. In the absence of any prior information on the true structure, 
the choice of an index number formula must be made cjn the basis of 
w'hich possible specification of/ ore would best approximate the true 
structure, whatever it is. Since Leonlief and Cobb-Douglas functions 
can be regarded as first-order approximations to / or r, while qua¬ 
dratic and translog funedons can be regarded as second-order apprttxi- 
mations, one would expect the quadratic and translog functions to fit 
the data more closely. Hence, there are grounds for preferring the 
fitrnqvist and the Fisher ideal indices to the Laspeyres, Paasche, or 
geomett ic. Since the Fisher ideal and I'drnqvisl indices are exact for 
functions that can Ite regarded as second-order approximations to 
any linearly homogeneous production, co.st, nr expenditure function, 
Diewert (1‘.176) has termed the 'Fdrnqvist and Fisher ideal indices 
‘‘.superlative." In the remainder of this paper, translog functions and 
IVirncivist indites will lx; used. 

'I'here are two ways to pnxeed. depending on whether f)ne mea¬ 
sures d(/ 2 )/d{/,) with equation (2) or (4). If one uses equation (2), one 
presumes/ is translog, so/[X(<j)J//lX(t,)] equals a rdrntpist index of 
inputs. Hence, 


jIM = ^ rt r OSl.yXAh) 

MI J 

(amsi.stency re(|uire,s that clir(/j)l/cllF’(/,)] l)e measured implicitly by 
substituting df/jVdft,) from equation (7) into eciuation (.t): 


cllF(<,),IV'(t,)l = 


c*[<z.n'(<»)] 

c*U,.ll'(/,)l 




(8) 


/1X(I,)| ^ M [ -X-KG) l"*^**^**^ ^ 

/lX(t,)) }.! I J 


whkh is a lornqvixt index of X(<). Here, <,(/) is ihe share in costs of X',(f). 
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Rearranging equation (8) indicates that the change in unit costs can be 
expressed as 




t\W(t2).W{t,)V 




i»i 


(9) 


Substituting equations (7) and (9) into equation (6) provides one 
scheme for decomposing product price changes into input, efficiency, 
and markup components: 


P(t,) _ M(Q A(t,) 
P(/,) A((,) 




( 10 ) 


IF one uses equation (4) to measure productivity growth, a slightly 
different accounting scheme emerges. In this case, one presumes r is 
translog. so ([iV{t^)]U equals a Tdrnqvist index of input prices: 


rllV(/2)l ^ fy IV, 


( 11 ) 


Ktiuation (4) t an then be used to implicitly define an elficiency index: 

jp n = n (12) 


Analogously to etpiation (9), one can decompose unit cost changes as: 


^n<2.iv«2)i 



IV,(4) 
IV,«,) 


1 ", 


/A ((z,/i). 


(13) 


Substituting equation (I 3) into equation (6) provides the second price 
accounting scheme: 


^(^ 2 ) _ Af (( 2 ) 
P(t,) M(t,) 



(14) 


E(|uations (10) and (14) are each internally consistent but differ in 
the presumptions they make about the way the data were generated. 
In equation (10) the presumption is that the data were generated by 
minimizing costs subject to a translog production function. In equa¬ 
tion (14) the presumption is that costs were minimized subject tfi the 
production function whit h generates a translog cost function. Since a 
translog production function does not yield a translog cost f unction 
(and conversely), different accounting schemes follow from the dif¬ 
ferent assumptions. However, since translog productitm and cost 
f unctions both provide second-order approximations to the technol¬ 
ogy, one would expect the two accounting schemes to give similar 
results. That, indeed, is true in the case of steel rails. 
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Table 1 presents details of the costs of producing steel rails in the 
United States in 1879, 1889, 1902, and 1910. The figures for the 
latter two years are derived from the U,S. Commissioner of Corpo¬ 
rations (1913) investigation into costs, prices, and profits in the steel 
industry. The commissioner obtained and published deuils of the 
cost of producing most steel products made in the United States 
between 1902 and 1910. The figures for 1902 are averages for firms 
producing virtually all of the heavy steel rails made in America in that 
year. The 1910 figures are averages for all U.S. Steel’s Bessemer rail 
mills. These mills produced 60.2 percent of American production in 
that year. The cost figures for 1901 and 1903-9 that the bureau 
reports are quite similar to the 1902 and 1910 figures, so conclusions 
founded on those years are indicative of conditions throughout the 
first decade following the U.S. Steel merger.* 

1 he cost figures for 1879 and 1889 are less certain than those for 
1902 and 1910. The 1889 figures are based on the U.S. ('ommissioner 
of Labor’s (1891) survey of steelmaking costs in the United States. 
Great Britain, and the European continent. Although the cost esti¬ 
mates were based on the cost accounts of steel firms, the coverage was 
not as broad as the U.S. Bureau of <-orporations survey. Moreover, 
the U.S. (Commissioner of Labor (1891, p. 163) Report expressed 
concern that some of the leading American rail mills had refused to 
supply cost information. The 1879 figures are also from an interna¬ 
tional comparison of production costs. This survey was conducted by 
Alexander L. Holley, who was the leading American steelworks en¬ 
gineer. The survey was published privately by the Bessemer Associa¬ 
tion, which owned the American rights to the Bessemer patents, for 
the use of its licensees. Holley visited several European plants in 1879 
and reported their costs. For comparison, he provided “rcpresenta 
tive costs” in the United .States. It is the latter costs which are used in 
tabic 1. Ciiven Holley’s knowledge, the audience he was writing for, 
and the confidential nature of the communication, one presumes the 
American cost figures are indeed representative.^ 

‘.See U.S. UDinmissiomT of t'.orpoialion.s (1913. pp. 209-10, 46l> for 1910 rail 
priKliKlion figures arnl pp. 143-49, 209-16. ,346-68 for information on coses in the 
Ollier years, 

' It should be noted that none of these sources reports the tfuanuiy and price of all 
inputs consumed. That practice was followed in tlie ca.se ol metallic inputs, but m the 
(ascs of fuel, labor, miscellanetius materials, and capital, only the expense jier ton was 
ret orded. It was necessary to estimate input prices and iinplk illy input quantiues from 
other sources. I'hose .sources arc discussed in the notes to taWe 1. It must also be 
Mres.sed that all calculations reported in this jiaper assume that the ctrsts in table 1 are 
minimum long-run average total fo.sis. 
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Between the 1880s and early 1900s the American steel industry 
became internationally competitive because the price of steel in the 
United States fell sharply. Table 1 and tables 2 and 3, which report 
the index numbers discu.ssed in the previous section, make dear the 
decline in price was as.sociated with a .sharp decline in steelmaking 
costs. According to table 2, sieelmaking costs in 1889 equaled 77 
percent of costs in 1879. In 1902 costs were 81 percent f)f their 1889 
level. The decline in costs subsequently slowed, for costs in 1910 
equalled 97 percent of their 1902 level. 

Kquations (9) and (13) provide schemes for decomposing the cost 
changes into efficiency and input price components. The results from 
using both equations are exhibited in table 2. and, as the reader can 
verify, the results are nearly identical. Between 1879 and 1889 falling 
input prices were the main cause of failing costs. I hereafter input 
prices rose slowly. Between 1889 and 1902 costs continued to fall 
bc'cause productivity growth was rapid. After 1902. the rate of pro¬ 
ductivity growth slackened and input prices rose, so the decrease in 
costs between 1902 and 1910 was meager. 

Kciuation (7) indicates that produc tivity increased at the rate of 0.6 
percent per year between 1879 and 1889. Between 1889 and 1902 
the tate increased to 1.6 percent and then dec lined to 0.8 percent in 
the period 1902-10. (Kcj. [12J gives essentially the same results.) 
Throughout the period 1879-1910, the aveiage product of labor 
continuously rose, and the average product of capital generally fell. 
.Since the ratio of the wage rate to the rental price of capital consis¬ 
tently increased, the change in the average products of lalior and 
capital might have been clue to factor substitution. Moreover, recent 
econometric work suggests that technical change was labor augment¬ 
ing in this period (Uain and Paterson 1979). Technical change and 
factor substitution leinfbrced each other in determining the lalxu- 
capital mix. 

Productivity growth was highest between 1889 and 1902. What 
distinguished those years from the others was the marked rise in the 
average pnvducts of fuel and metallic inputs. Fuel consumption de¬ 
clined as the mixer (invented in 1889) vvas adopted by .American 
steelworks. In earlier years, it had bc'en impossible to use the molten 
pig iton from blast furnaces direc tly in Bessemer converteis. The pig 
iron was cast and then remelted. By 1902 most Bessemer steel was 
made from molten iron cemveyed directly from the blast furnace. I he 
saving in fuel was impressive*. 'The productivity of metallic inputs also 
increased due to developments in other areas of the steel mill, in this 
case due to the growth of open hearth steelmaking capacity. Bessemer 
shops and rolling mills generated a large volume of scrap in the form 
of poor rolls and sheared ends. I'hese could not be recycled in 
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1889-1902 .9573 = (1.1771) (.7962) (1.0215) .9573 = (1.1771) • (.7994) (1.0174) 

1902-10 1.000 = (1.0261) (-9305) (1.0475) 1.000 = (1.0261) (.9242) (1.0545) 

SoiBLF.—Table I Pu,)IP{l,) is ihe ratici of the pme in the lenninaJ )ear of ihe period to the price in the initial year. See table 2 for the definitions of other variables. 
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Bessemer converters but could be remelied in open hearth furnaces. 
In 1889 only 11 percent of American steel was open hearth, but by 
1902 38 percent was open hearth (Tcmin 1964, pp. 272-73). The 
development of this process allowed the recycling of scrap. Tables 1 
and 2 follow the nineteenth-century accounting tradition of charging 
Bessemer rail mills for their net consumption of metallic inputs. The 
recycling of Bessemer scrap in open hearth plants thus shows up as a 
rise in the productivity of metallic inputs in Bessemer rail production. 

Table 1 indicates that the fall in production costs was not entirely 
passed on to consumers in lower prices. An apparent consequence of 
the U.S. Steel merger was to raise markups to the 30 percent range 
for at least the first decade after the merger. In 1879 and 1889 
markups had been much lower. 

Table 3 uses equations (10) and (14) to decompose price changes 
into markup, efficiency, and input price components. The price of 
rails in 1889 equalled 72.8 percent of the price in 1879. Some of the 
decline in price was due to a decline in the markup; however, most of 
the decline can be ascribed to falling costs. As indicated previously, 
declining input prices were the predominant cause of declining costs. 
The fall in input prices was also decisive for the decline in price. 

Between 1889 and 1902, costs declined 19 percent but rail prices 
declined only 4 percent. The discrepancy arose because markups rose 
18 percent. This result indicates a major consequence of the U.S. Steel 
merger. Steel prices in the early twentieth century were set at levels 
prevailing about 1890, and the steel producers absorbed all of the 
decline in costs (due to technical progress in the case of rails) as excess 
profits. 

Table 3 indicates that there was little change in price, costs, or the 
determinants of costs between 1902 and 1910. The steel industry W'as 
sufficiently collusive so that a markup of 30 percent over unit costs 
could be maintained for a decade. 

IV 

These conclusions regarding productivity growth and the market 
power exercised after the U.S. Steel merger are at variance with 
recent conclusions advanced by McCloskey (1973, p. 99) and Tcmin 
(1964, pp. 186-93, 218-20). Both noted that the ratio of the price of 
steel rails to the price of pig iron was trendless from the 1880s to the 
First World War. (This constancy is illustrated by the data in table 1.) 
Both inferred from these price data that productivity growth ceased 
in the 1880.S, and Temin also inferred that market control after the 
U.S, Steel merger was not effective in raising the price of rails. 

To choose between McCloskey’s and Temin’s conclusions and those 



524 JOURNAL OF POLITICAL ECONOMY 

advanced here, it is necessary to scrutinize the theoretical underpin¬ 
nings of the calculations employed. McCloskey’s procedure consists of 
the (approximate) application of an identity deduced by Jorgenson 
(1966, pp. 2-4). Although the assumption is not obviously apparent in 
the derivation, it is the case that this procedure only correctly mea¬ 
sures productivity growth (and, hence, accounts for price changes) for 
industries in competitive long-run equilibrium. This will be shown 
subsequently. It is worth considering the matter thoroughly, both to 
resolve the issue at hand and because Jorgenson’s identity has been 
widely ii.sed by economic historians to measure productivity growth 
and account for price changes.* 

Jorgenson begins with the accounting identity that a firm’s revenues 
necessarily equal the total income earned by the firm’s inputs. In the 
case of a single product firm, 

Pd^'^wrx,, ( 1 .^) 

/*» 1 


where IV,* are the accounting valuations of the inputs X,. Totally 
differentiating equation (l.'r) with respect to time, dividing the result 
by equation (15), and regrouping terms gives: 

O V. 




(I6I 


Here the shares are .i* = IVfXj/SIVfXi, 

Jorgenson defined productivity to equal the ratio of output to total 
input; E = Q/X. I'otally differentiating this expression with respect to 
lime implies that the rate of change of efficiency etjuals the rate of 
change of output less the rate of change of total input: EIE = ()/Q — 
XIX. Defining XIX to equal 2.'*(X,/X,) implies 



Jorgenson contended, moreover, that Solow (19.57) deduced the 
left-hand equality from a production model. 

I( correct, equation (17) has an important implication for prodiic- 


* Noriti (1968, p. 9.64). SheptuTcl and Wallon (197',i!. pp. 60-63), Hailc-y (1973, j), 
376), and MfClo.skey (1973, p, 86) die Jorgenson (1966). Sandberg (1974, p. 100) dies 
ihc ihe.515 version o( MeCdoskey (1973) for a prool of ihe eijuivalentc of the G. 1. Jones 
index of real cost and ihe right-hand side of eq. (16)of this paper and goes on to assert 
the eqiiivalenre of that index and Solow’s (1957) index. Most of these economic 
historians recognize that the identity is not a sound basis for measuring productivity 
change when long-run conipctiiive e<|uilibnuni docs not prevail, and a numlrer ol ad 
h(x adjustments are developed to deal with the problem, l.indert and Trace (1971) do 
develop pnxluctivity indices that exclude excess profits. 
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tivity measurement, namely, that productivity growth can be inferred 
from the growth of input prices less the growth in output price without 
any explicit account being taken of markups. This result justified McClos- 
key’s procedure and appears inconsistent with equations (10) and (14) 
in this paper. 

Kquation (17) is an identity and hence always true. I’he difficulty 
with using it as the basis of productivity measurement is that ElE 
equals the rate of productivity groxvth in the Solow (1957) sense only 
if the firm (or industry) who.se productivity growth is being measured 
is in long-run, competitive equilibrium. This condition was not made 
explicit in jorgenson’s derivation. To see the importance of the com¬ 
petitive equilibrium assumptirm, recall Solow's derivation. Solow 
(1957) assumes a production function of the form of equation (1) and 
identifies the rate of productivity growth with AIA. Totally differ¬ 
entiating equation (1) with respect to time, dividing the result by 
equation (1), and rearranging allows/4/.4 to be expre.ssed as 



where the time symbol t is siippre.ssed for simplicity. If the firm faces 
exogenous input prices and minimizes costs, then the necessary con¬ 
ditions for cost minimization can be rewritten as = (d// 

dX,)/f(X). In that case equation (18) can be rewritten as 


A = _ V A 


(19) 


where s, = 

Kt|uation (19) and the lef t-hand equality of ecpiation (17) are almost 
identical. They would be identical if if — .v,. Since equation (17) is an 
accounting identity, one must make some assumptions about the 
institutional and accounting framework in order to determine if s,* = 
s,. Reasonable, simple assumptions are that inputs are available to the 
firm in infinitely elastic supply at prices IV,. any difference between 
revenues and the opportimity cost of inputs accrues as excess profit 
(or loss) to capital, and all inputs besides capital are valued in the 
firm’s accounts at their supply prices IVj. Given these presumptions 
IVf = tv, for all inputs besidt's capital. For capital (which will be 
indexed as the ifeth input), Wf % IV,^ if revenues exceed, equal, or fall 
short of the opportunity cost of the inputs. 

If the firm is in competitive, long-run equilibrium, revenues equal 
the opportunity cost of all the inputs and IVjf = VV^. In this case .q* = s,, 
and equation (17) correctly measures the rate of productivity growth 
A/a. Moreover, in this ca.se, productivity growth can be inferred from 



526 


JOURNAL OF POLITICAL ECONOMY 


input and product price movements without taking markups into 
account. This result is consistent with the framework of Section II of 
this paper, for, if revenues always equal opportunity costs, M(t,) = 
Afpj) = 1 and the markup terms disappear from equation (14). Effi¬ 
ciency change can then be inferred from changes in product and 
input prices. 

Suppose, on the other hand, that the markup exceeds one. In that 
case revenues exceed opportunity costs, ITjf > W^, and XlTfX, > 
XlEi-Y,. Hence, > r* andjr,* < for all inputs besides capital. Unless 
all inputs (including capital) grow at the same rate ^ 

XsfiXJXi) and the rate of productivity growth cannot be inferred from 
etjuation (17). The same conclusion follows if markups are less than 
one. Therefore, if markups do not equal one, equation (17) cannot be 
used to measure the rale of shift of the production function. In 
particular, one cannot infer the rate of productivity growth from 
differences in the rates of growth of input and product prices. 

Substantial errors result if equation (17) is used when markups dt) 
not ecjual one. 1 he errors are apparent if equation (17) is applied to 
the data in table I. That application requires that some discrete 
approximation be adopted. The most useful approximation is to 
apjiroximate the time-varying shares by the average of the shares 
pievailing at times/, and = 1/2 lt*(/i) + -'i*(/ 2 )]. and integrate the 
left-hand ctjuaiity in etjuation (17) between (, and tj. Exponentiating 
the integral yields: 


= FI f f 


( 20 ) 


If one treats equation (19) analogously, the result is etjuation (7). 
Again the only difference between the putative productivity measures 
is in the shares. Numerically, however, the differences are not incon¬ 
sequential. For the periods 1879-89, 1889-1902, and 1902-10 A(/ 2 )/ 
A(t,) equals 1.0676. 1.2.660, and 1.0747, respectively. For the same 
periods/’■(/j)//"'!/,) equals 1.0408, 1.1904, and 1.1199. The term/^ltj)/ 
E(ti) understates the average annual rate of productivity growth by 40 
percent in 1879-89 and 24 percent in 1889-1902 and overstates the 
rate by .69 percent in 1902-10. Assuming long-run, competitive equi¬ 
librium when it does not obtain leads to .serious errors in productivity 
measurement. 


V 

In this paper it has been shown that American rail prices fell in the 
late nineteenth century because production costs declined sharply. 
7’wo unexpected findings include the discovery that the rate of pro- 
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ductivity growth remained high into the twentieth century and the 
discovery that rail producers managed to maintain steel prices at a 
level 30 percent above average total cost for a decade after the U.S. 
Steel merger, le min’s (1964, pp. 189-93) recent work on the merger 
concluded that the markets for rolled products remained competitive 
after the merger. That conclusion is certainly not true for rails. 
Similar evidence lor other products suggests that markups were 
generally high.* A rea.ssessment of the significance of the U.S. Steel 
merger in this regard seems in order. 

In order to establish these conclusions, it was necessary to develop a 
more elaborate price accounting scheme than those previously avail¬ 
able. As part of that effort, considerable attention was devoted to the 
circumstances under which one could infer the rate of productivity 
change from the difference in the rates of input price and product 
price changes. A correct inference can be made only if the industry is 
in competitive long-run equilibrium. If that condition does ntn obtain, 
serious errors are likely to result. I'hese difficulties can be avoided if 
productivity growth is measured with equation (7) or (12). 
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Some Evidence on Cross-Sector 
Effects of the Minimum Wage 


George E. Tauchen 

Duke Unmerfity 


I'his paper tests Mincer’s mlnimum-vvagc model by estimating 
redueed-t'orm wage and employment equations for both the covered 
and uncovered sectors in nine regions oi' the United States. As 
theory predicts, in regions with comparatively small covered-sector 
dctnanci elasticities, the northern and mid western regions, the 
uncove red-sector wage increases after a minimum-wage hike; and in 
regions with comparatively large demand elastintics, the southern 
and western regions, the uncovcrcd-seelor wage derreases. Becau.se 
of data limitations the uncovered-sector employment effect could 
not be estimated shat ply, and so its relationship to the covered-sec tor 
demand elasticity is weak. 


1. Introduction 

Economic theory predicts that a minimum-wage hike decreases em¬ 
ployment of low-skill workers in the covered sector. Theory also 
suggests that a minimum-wage hike affects the ec|uilibriiim wage and 
the level of employment in the uncovered sector though, as pointed 
out by Mincer (1976), the directions of the unrovered-.scctor effects 
are ambiguous. On tlie one hand, if the absolute labor demand 
elasticity is relatively small so that firms eliminate only a few covered 
jobs after the wage hike, then workers find the minimum wage attrac¬ 
tive enough to justify the risk of unemployment while searching for 
jobs in the covered sector. In this case labor leaves the uncovered 
sector, and the wage thereby increases. On the other hand, if the 

I am grateful to Christopher A. Sims. Helen Tauchen, and an anonymous referee for 
many helpful comments and suggestions. 
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demand elasticity is relatively large so that the return to search is low, 
then displaced workers enter the uncovered sector, and the wage 
decreases. 

The objective of this paper is to test Mincer’s characterization of the 
elfects of a minimum-wage hike on the uncovered sector. In the 
empirical work the covered sector is an aggregation of four low-wage 
nondurable manufacturing industries; the uncovered sector is ag¬ 
riculture prior to coverage under the Fair Labor Standards Act. 
I'irne-series data arc used to estimate the cove red-sec tor labor de¬ 
mand elasticity and the uncovered-sector wage and employment ef¬ 
fects of a minimum-wage hike for nine regions of the United States. 
As theory predicts, in regions with comparatively small demand elas¬ 
ticities, the northern and midwestern regions, the uncovered-sector 
wage increases after a minimum-wage hike; and in regions with com¬ 
paratively large demand elasticities, the southern and western re¬ 
gions, the uncovered-sector wage correspondingly decreases. The 
uncdvercd-sector employment effect is not as sharply estimated as the 
wage effect, and so its relationship to the demand elasticity is weak. 
This restilt is explained by limitations of the data. 

There is a potentially serious problem with using regional time- 
seiies data to test the minimum-wage model. The problem arises 
bec ause it is impossible to get long time scries of data on the level of 
employment of persons earning exactly the minimum wage. Al¬ 
though average hourly earnings in the four low-wage industries are 
among the lowest in the economy, they are still well above the 
minimum wage, indicatitig that the data series contaiti contributions 
from high-wage labor. Thus the dependent variables in the re- 
giessions are total employment and the average wage, both aggre¬ 
gated over the wage distribution. It is conceivable, then, that the 
observed regional variation in the covered-sector demand elasticity is 
a statistical artifact leflecting regional differences in the mix of low- 
ancl high-wage labor. In other words, the true demand elasticity for 
minimum-wage grade labor could be the same in all regiotis, while the 
estimated demand elasticity is larger in the South simply because 
there are more low-wage workers there. 

I he evidence presented in sub.sequem .sections suggests that the 
spurious aggregation effect is small. Briefly, the argument is as fol- 
low's. Think of low- and high-wage labor as imperfect substitutes in a 
production function with constant returns to scale. Now let the 
minimum wage increase by 1 percent. There are both scale (output) 
and substitution effects. The scale effect causes employment of both 
factors, and hence total employment, to decline equiproportionately. 
The substitution effect, on the other hand, causes low-wage employ¬ 
ment to decline and high-wage employment to increase. Intuitively, 
and as is shown in Appendix A, the observed proportionate increase 
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in the average wage {l^WIW) is no less than the share (i,) of low-wage 
labor in total labor costs; if the substitution effect is zero, then AW/W 
equalss,. The regional estimates of \WIW from the regressions turn 
out to be very close to regional estimates of s, derived from indepen¬ 
dent wage distribution data. This can happen only if the major impact 
of a minimum-wage hike is the scale effect and not the substitution 
effect. Since the scale effect dominates, the observed proportionate 
decline in total employment must be close to the proportionate de¬ 
cline in low-wage employment; that is, the aggregation bias is small 
in the covered sector. 

The importance of the scale effect together with the cost-share data 
can explain why there are regional differences in the demand elastic¬ 
ity. I'he size of the scale effect depends upon (he increase in the 
marginal cost of production caused by a hike in the minimum wage. 
Since is 0.14 in the North and 0.32 in the South, the proportionate 
increase in marginal cost is larger in the South, and so output and 
total employment contract proportionately more there. 'I'his is con¬ 
sistent with the idea that one purpose of the national minimum wage 
is to reduce the cost advantage of firms located in low-wage regions of 
the country. It is also consistent with Silberman and Durden’s (1976, 
p. 325) finding that Southern congressmen are less likely than other 
congressmen to vote in favor of a minimum-wage hike. 

The remainder of this paper is organized as follows. Section II 
describes a simple two-sector labor market model which is used to 
guide the empirical work. Sections 111 and IV report the empirical 
findings for the covered and uncovered sectors. Section V uses the 
theoretical model to interpret the results. Section VI contains the 
concluding remarks. 


II. Two-Sector Model 

The following two-sector model of the low-wage labor market is used 
to illustrate the restrictions being tested and to guide the empirical 
work. I’he model consists of two labor demand schedules (one for 
each sector), an equilibrium condition connecting the minimum wage 
and the competitively determined uncovered-sector wage, and an 
aggregate labor supply relationship. Formally, the model is: 

labor demand (covered): N' = D'^iWtWz.X''), < 0, (la) 

labor demand (uncovered): A/“ = D“(IT“/IV 2 , X“), D'i < 0, (lb) 

equilibrium condition: W ~ ^ ^ ® ^ 

labor supply: + N" + U = S{W'‘/P), S'> 0, (Id) 



53 ^ 


JOVRNAI. OF POLITICAL ECONOMY 


where N'' N“ = empJoyment in the covered (c) and uncovered (w) 
sector- fv’= minimum wage: IV'‘ = wage in the uncovered sector; W, 
= control wage. W, > W; X^, = vectors of exogenous shift van- 

ables; U = unemployment; S = turnover parameter; and/* general 
price level. The labor demand equation (la) expresses employment in 
^Qvered sector, as a function of the relative minimum wage, 
W/W.J,, where is an exogenously determined measure of high-skill 
wages, and as a function of exogenous shift variables, X<'. Similarly, 
equation (lb) expresses employment in the uncovered sector as a 
function of the uncovered-sector wage relative to the high-skill wage, 
IV“/IV 2 , and as a function of exogenous variables, X“. Equation (Ic) is 
from Mincer (1976). In equilibrium the uncovered-sector wage must 
equal the expected return from search in the covered sector (equal to 
the probability of getting a job times minimum wage). The parameter 
5 in (Ic) is the proportion of covered-sector employees who each 
period must enter an employment lottery with the unemployed. Fi¬ 
nally, equation (Id) relates the total labor force to the real wage in the 
uncovered sector. 

The four endogenous variables in the system (1) are (V“, W'‘, and 
U\ the exogenous variables are W, W2, P, X‘’, and X“. Note first that 
(la) is the reduced-form expression for Now write the reduced 
form for the other three endogenous variables as 


N- =N'‘(W, 1T2,P,X",X“), 

(2a) 

IV = IT“(IF, W2,P,X'',X‘‘), 

(2b) 

U==U(W, IV2,T,X'-,X“). 

(2c) 

rheory places restrictions on the reduced-form equations, 
cally. Mincer shows 

Specifi- 

dW« . 

'' 

(3a) 

aw “ ' >*• 

(3b) 

where c'' is the absolute elasticity of labor deiriand in the covered 
sector (the absolute elasticity of [laj with respect to the minimum 
wage). The unemployment effect is positive. 

>0 

dW 

(4) 


and the sign of the labor force effect agrees with the sign of the 
uncovered-sector wage effect. 


dN< dN" at/ ^ ^ 


dW 


dW 


(5) 
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Some of these results have been previously investigated empirically, 
rhere are. of course, many studies of the disemployment effect (la) 
and of the unemployment effect (4). I he Mincer study measures the 
labor force effect (5) for teenagers and young adults. After finding a 
negative labor force effect. Mincer infers e’’ > 8 and uses (3) to 
conclude that < 0, that is. that, as a group, low-wage workers’ 

prospects decline after a minimum-wage hike. Gardner’s (1972) ear¬ 
lier work with aggregate U.S. agricultural data is consistent with 
Mincer’s findings, though Gardner’s estimates lack statistical signifi¬ 
cance at conventional levels. The purpose of this study is to use 
regional variation in the demand elasticity to test the characteriza¬ 
tion (3a) and (3b). If there is small regional variation in the parameter 
6, then regions with relatively small values of?'’ should show dlV“/dW 
> 0, dN'^ldW < 0, and vice versa for regions with relatively large values 
of e‘'. 

Ill, The Covered Sector 

I he first step in the empirical work is to estimate the demand equa¬ 
tion (la) to get regional estimates of the covered-sector demand 
elasticity, <'*■.’ The data are annual observations on employment 
(1949-74) and average hourly earnings (1958-74) in an aggregation 
of four low-wage nondurable manufacturing industries: tobacco (SIC 
21), textiles (SIC 22), apparel (SIC 23), and leather (SIC 31).* As 
tnentioned in the Introcluction, there is a problem because the em¬ 
ployment variable includes workers for whom the legal minimum 
wage is economically ineffective. The effects of this skill-group aggre¬ 
gation can be characterized. Appendix A shows that regardless of the 
substitution pattern within aggregate labor, the observed (absolute) 
diseniploymeiu elasticity underestimates the sought-after demand 


' riif stales within path region arc; New England. Vt., Mas.s., N.H.. Conn., R.I.. 
Maine: Middle Atlaniir; N.Y., Pa., N.J.; East North Central: Ohio, Wis., Ill., Ind., 
Mich.; West North Central: Mo., Iowa. N.Dak., .S.Dak., Nchr., Kans.. Minn.; South 
Atlantic: Va., Del., Ga,, Md,, W.Va., N.C., S.C.. Fla.; F.ast South Central- Ky.. Fcnn.. 
Mi.ss., Ala.; W'esi South Central: Ark., lit., l ex., Okla.; Mountain: Ncv., Wyo., Mont., 
N.Mex., Utah, Idaho. Colo., ,\riz.; Pacific: Calif., Oreg., Wash. 

* Two-digit manufacturing industries were defined as low-wage industries if U.S. 
average hourly earnings in 1960 were no more than $1.70; the legal minimum was 
upped from $1.00 to fl.I.S m September 1961. Only the four indicated nondurable 
manufacturing industries met the criterion. No other two-digit nondurable manufac¬ 
turing industry had an average wage lielow $2.00 in 1960; the lowest average wage in 
196<i of two-digit durable industries was $1.88 in furniture (SIC 2.5). Annual observa¬ 
tions on state employment and average Itourly earnings were collected from U.S. 
Bureau of Labor Sutistu's (1977), In the rase of missing values, the data from nearby 
larger states were spliced into the shorter series. This allowe<1 development «)f usable 
stale employment series from 1949 forward .md wage senes Irom 1958 forward. The 
series were aggregated over the lour induslnes and over contiguous states to nine 
legions. 
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TABLE I 

(iOVKHt-U'SKl TOR KmPLOYMLNT AND Waoii: E(^uatu>ns: 
SiGNIFH ANCI-. TkSTS* 



DKPfcNOKNI Variabi.e: 
nc (1949-74) 

SlC.NIFICANC.E I'rST 

Z(3,I7) 

DtPF.NOENT Variable: 
(19.38-74) 
Sic;NiFic:ANt;E Lest 

F(S,8) 

w 

f-I to +11 ( 

11,, 

-1 I«4I] 

w 

1-1 lo +1] 

»'2 

[-1 to + I] 

United Stales 


4I>||*4C 

•K 

**** 

.New England 


**** 


**** 

Middle .Atlantic 





East North Ontral 


#*>!>•» 


**** 

West North Ontral 

>l< 

i|(*>l>4i 


**** 

South Atlantic 



** 

**** 

East South Orittai 




**** 

West South Cential 

4'<»<4>4> 




Mountain 



N.A.o 

N.A. 

Pacific 






So It -DojiriKh-tjt vjrubJfs. «• « Jo^ of lotaJ rmpioyinenf. lour kjiv-wage noiuliudble manu/aciunn^ indunffw**. 
bv icj?K>n ds a' * logo! aveidgc hourly earnings, four Inw-^age nondurable indnufaaunng nKlustncs. by 

leguin as in<lt(.<ued liulepcndctii variables, u' * log of nummum wage, u'l * log ol aveiagc hourly eartimgs. ail 
indnuraoiurnig. excluding average hourly edniings in the foiii low*wage industnrs, : 3 log of the KRB index of 
iiKlusirial (ircxluciion, all nkdiuifd<.tunng. Lags are in brackets, negative lags are leads (I'uiure lenns). 

* lest (hdi uulKdied giotip o( dmnbuiesl lag toetlKieius is zero 

^ No svage data aie available ior the Mountain region. 

* 

•• 10*7 

••• 

«... |.^ 


elasticity. In addition, the observed elasticity ol the covered-sector 
wage with respect to the minimum wage overestimates the share of 
minimum-wage grade lahc^r in total labor costs. Below, evidence will 
be presented that c'' is only slightly underestimated. 

I'o focus attenticjn on economics the less important technical details 
of the regressions arc put in Appendix B. The results in tables 1 and 2 
are from distributed lag regressions of the logs of covered-sector 
employment and average hourly earnings, n*" and w'", on the log of the 
minimum wage, w, the control wage, W 2 , anti the cyclical index, z. I’he 
control wage is the log of average hourly earnings, all manufacturing, 
net of wages in the four low-wage industries; the cyclical variable is 
the log of the FRB itidex of industrial prcjduction.^ Each regression 

’ 71', 77'2, and z are the logarithms <>t the annual average of monthly values. The level 
H'j was computed as tV, = {EW — ^jF.,fVi)/(E - XjF.j), where F is production worker 
employment and ff is production worker average hourly earnings. Unsubscripted 
values refer to all manufacturing; the sub.script; indexes the four low-wage industries. 
The basic data were taken frt,m U.S. Bureau of l-abor Statistics (1976) and from issues 
of F.mploymfrU and Farmngs (U.S. Bureau of Labor Statistics 1976-77). The industrtal 
production index was taken from U.S. Department of Catmmerce (1976) and issues of 
the Survey nf Currenl Business (U.S. Department of Commerce 1976-77). 


TABLE 2 

Covered-Sector Employment and Wage Equations: Sums of Distributed Lac Coephcients 
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included trend and a consiani."* The regressions do not control for 
state minimum wages because in this sample state minimums are 
either at or below the federal minimum wage.® 

TheT-tests in table 1 indicate that the distributed lag coefficients on 
the minimum wage are significant as a group in nearly all of the 
equations. Now consider the long-run elasticities reported in table 2. 
As theory predicts, the sign of the minimum-wage effect is negative in 
all of the employment equations and positive in all but one wage 
equation. Moreover, the magnitude of the effect displays important 
regional variation. The disemployment effect appears to be negligible 
in the Northeast and upper Midwest, and it seems to be rather 
important through the South, the Southwest, and the Pacific coast. 
Also, the elasticities of employment and the average wage with respect 
to the minimum wage are consistent with one another, in the sense 
that, wherever the effect on the wage is small, so is the effect on 
employment. 

To my knowledge, there are no other comparable published esti¬ 
mates of the disemployment effect by region or state. 7'he estimated 
elasticity of —0.29 f^or the United States, however, is comparable with 
the finditigs of many other researchers. It is just outside Gramlich's 
(1976, p. 431) bounds of —0.05 to —0.25 for published estimates.® 
Comparing these long-run effects to regional data on the wage 
distribution in the covered sector suggests that the major impact of 
the minimum wage is the .scale or output effect. l o see this, consider 
the 1964 wage distribution displayed in table 3. d'hese data were 

^ I'hroughoul this paper legional dependent variables are regressed on national 
variables. This is due 10 data limitations, but a strong case ran be made for this practice 
even if regional dau were available. I'ruly exogenous variables are driven by broad 
economy-wide forces and should not contain endogenous local fluctuations. For the 
i)uarlerly agricultural regressions of Section IV it was possible lo roinpule regional 
agricultural prices but only on an annual basis. Quarterly inixlels with these prices 
repealed four times gave much lower than the reported exjuauons. Kvidenlly, the 
time variation is more important than the regional variauon. 

' In ihc Northeast, stale minimum wages tend to be jusi at the federal wage, and 
outside the Northeast they ate usually weU below the fcxieral wage. Movements in slate 
wage floors tend to parallel changes in the federal wage, and the main efftx t of state 
minimums is to extend c overage to industries only partly covered by federal legislation: 
retail trade, laundries, services, etc., industries excluded from the sample. In lt)70 the 
federal minimum was SI .60, while the simple averages of state minimums by region 
were; New England, $1.60; Middle Atlantic. $1.60; F.ast North Cieniral, $.98; West 
North (Central, $.65; South Atlantic, $.77; East South Central, $.18; West ^uth Cen¬ 
tral. $.84; Mountain. $1.07; and Pacihe, $1.50. New York was the only slate with a 
minimum wage above $1.60; its minimum was raised from $1.60 to $1.85 effective July 
I, 1970. Washington, D.C., and Alaska had 1970 minimums of $1.80 and $2.00, 
respectively, but were excluded from the sample. Source: U.S. Employment Standards 
Administrauon (I972A, p. 52, table 5), 

• Many of these studies use the ratio of teenage to adult employment as the depen¬ 
dent variable, and so they estimate only a pure substitution effect (Welch 1974). I'his 
study estimates a combined scale and substitution effect. 



MINIMUM WAGE 


537 


TABLE 3 

Wage Destribuiion for the Agcregaiion of Four Nondurablf, 
Manufacturing Inuusiriea, March 1964 (%) 


Straight-Time 

Wage ($) Less Than 

United 

States 

Northeast 

South 

North 

Central 

West 

1.25 

1 

1 

I 

1 

1 

1.30 

16 

II 

22 

18 

14 

1,35 

23 

16 

30 

23 

19 

1.40 

29 

21 

39 

31 

22 

1.45 

36 

28 

47 

37 

28 

1.50 

41 

32 

53 

42 

31 

2 50 

91 

85 

98 

94 

83 

3 50 

98 

96 

100 

100 

97 

1 otal 

100 

100 

100 

100 

100 

Average hourly 

earnings ($) 

1.76 

1.90 

1..59 

1 72 

1.92 

Under $1,40 share.s 
of total; 

Employment (*,) 

,29 

.21 

39 

.31 

.22 

Labor costs (t,) 

22 

.14 

.32 

.24 

15 


NfiiT —>ln September 196^ (he tusic iiiiMitnuni wage was laised from $) 15 lo $) 1^5. in February 1967 ii was 
^a^^etl to $1 1 he dtstnbuttotts weie (ootputrti Irom taMe 4 of U S Wage aivl Hour atid Publu (Amiraiis 

Dmiiona (1965). Ac(ou(i((Mg uleiuities and iiuerpitlaiions were used «> get <ontplete data for the North Ontrai and 
West regions, and so the disinbuiions f<»r these iw(» regions aie less reliable than the «Mhrr disinbuiions 


collecied in a special survey of firms by the Department of Labor. 
Define low-wage workers to be those within 10 percent of the 
minimum wage of $1.25 {10 percent wa.s about the average percent¬ 
age increase in the minimum wage in the early 1960s), t hen the share 
of low-wage labor in total labor costs is 0.14 in the Northeast and 0.32 
in the South. 1 hese cost shares indicate that, if firms held employ¬ 
ment constant, then a 10 percent minimum-wage hike would drive up 
labor costs by 1.4 and 3.2 percent in these I wo regions. Furthermore, 
if firms respond to the wage hike by contracting low- and high-wage 
employment equiproportionately, then the average wages in the 
North and South would increase by 1,4 and 3.2 percent. According to 
the regressions in table 2. the average wages actually increase by 1.5 
percent in the North and 3.2 percent in the South. ( These figures are 
10 times the unweighted mean elasticity for the two northern regions 
and three southern regions.) A similar agreement between the cost 
share and the wage elasticity exists in the North Central region and in 
the West. The data appear to say that covered-sector firms respond to 
a minimum-wage hike by taking steps to reduce total labor input and 
that both groups of workers suffer equiproporiionate disemploy¬ 
ment. Furthermore, since the substitution effect appears to be small 
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relative to the scale effect, the estimated disemployment elasticities 
are close to the true demand elasticities for low-wage labor. 

IV. The Uncovered Sector 

The purpose of this section is to e.stimate the minimum-wage effect on 
uncovered (agricultural) employment and wages. The reduced-form 
equations (2a) and (2b) give the specifications. Specifically, the logs of 
agricultural employment and wages are regressed on the minimum 
wage, the control wage, the general price level, and the shift variables 
appearing in (la) and (lb). Data are available quarterly, and the opera¬ 
tional definitions are as follows. Employment and wages, w“ and id“, 
are, respectively, the logs of hired farm employment and the hourly 
wage exclusive of room and board. Agricultural economists consider 
this wage to be more reliable than the U.S. Department of Agri¬ 
culture’s various composite measures. In the list of independent 
variables, the minimum wage variable iscie, the legal minimum multi¬ 
plied (before logging) by the Moore index of the proportion of 
nonagricultural employees covered by the legislation. Intuitively, the 
impact of a 10 percent minimum-wage hike on the uncovered ag¬ 
ricultural labor market is much different if 20 versus 90 percent of 
the nonagricultural work force is covered. The control wage rcj and 
the cyclical index z are the same variables as used in Section III. The 
general price level p is the log of the CPI. Finally, the uncovered- 
sector shift variables are the logs of the USDA's index of prices 
received by farmers, p“, and the index of prices paid by farmers for 
production items q. The index q excludes wages.’ 

The basic sample period (before lags and leads) is 1947:1-1966;! V. 

1 runcation at 1966 restricts the estimation to a period in which 
agriculture was effectively uncovered by federal legislation. On Feb¬ 
ruary I, 1967, coverage was extended to employees of large farms. 
I he Department of Labor estimated that by 1971 about 48 percent of 
the agricultural work force was covered by federal legislation.'* Also, 
after extension of federal coverage to agriculture, several states ex¬ 
panded their coverage to agriculture. Prior to 1967, state agricultural 
minimums were considered unimportant.'* 

' The agruultural employment and wage series were obtained from various issues of 
Farm Labor (U.S Department of Agriculture 1951-75). The input and output prices 
indexes were obtained from issues of Agrtculture Prices, Annual Summary (U.S. Depart¬ 
ment of Agriculture 1961-72). 

* For details sec U..S. F.mployment Standards Administration (1972a, pp. 5-9). 

* In 1966 only three states in the sample had a minimum wage applicable to agricul¬ 
tural workers; California, |1..50; Michigan, $1.15; and Wisconsin, $I.(M). The Califor¬ 
nia and Wisconsin minimums applied only to women and minors. Source: U.S. Wage 
and Hour and Public Contracts Divisions (1966, p. 30, table 12). Michigan and Wiscon- 
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Two conclusions are apparent from the regressions. First, the 
significance tests in table 4 confirm the well-known fact that aggregate 
cyclical forces play a strong role in the determination of agricultural 
labor supply. According to the U.S. long-run elasticities in table 5, a 
10 percent increase in industrial production is associated with an 11.3 
percent decrease in hired farm employment and a 2.9 percent in¬ 
crease in the hired farm wage. This is important because it indicates 
that there are indeed sizable labor flows between agriculture and the 
covered sector. 

Second, and most important, changes in the minimum wage do 
affect agricultural employment and wages. TheF-tests in uble 4 show 
that as a group the distributed lag coefficients on the minimum wage 
are significant at 10 percent in five of 10 employment equations and 
in eight of 10 wage equations. Note that in table 5 the long-run U.S. 
employment elasticity of —.18 is insignificant, but the wage elasticity 
of —.08 is significant. The latter measurement indicates that for the 
entire United States a minimum-wage hike lends to depress the 
uncovered-sector wage. This finding is consistent with both Mincer’s 
and Gardner's results. 

Examination of the regional results in table 5 suggests that there are 
important regional differences in the welfare effects of the minimum 
wage. Notice that the effect on the uncovered-sector wage is sig¬ 
nificantly positive in the New England, Middle Atlantic, and West 
North Central regions; it is significantly negative in the South Atlantic 
and Pacific regions. I hus, as indicated by (3a), the wage in the un¬ 
covered sector can indeed rise or fall after a minimum-wage hike in 
the covered sector. Viewed in isolation, however, the uncovered- 
sector results are insufficient to verify the model. Indeed, if the 
hypothesized economic mechanisms are really at work, then the ab¬ 
solute disemployment elasticities from Section III should be largest in 
the regions where the uncovered-sector wage falls. 

V. Sectoral Comparisons 

Consider the inequality (3a). If the demand elasticity c*' is less than the 
turnover parameter S, then after a minimum-wage hike the flow of 
labor out of the uncovered sector increases the wage there; if e'" is 
gieater than 5, then the flow of labor into the uncovered sector 
decreases the wage. Therefore, if there is small regional variation in 


sin are in the East North tientral region, and their agricultural minimum wages possibly 
could explain the results for that region. Cionsider. though, the fact that California 
accounts for 88 percent of agricultural employment in the Pacific region, and it turns 
out that after a hike in the U.S. minimum wage the agricultural wage drc/mes in the 
Pacific region. 
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UNCovERti) Sector Employment and Wage Eqlations: Sims 
OT Distributed Lag CoEmciENTS (Long-Run Effects) 
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4 West North Ceetrot 

5 South Atlorttic 


Fi(.. I.— I he relationship between the uncovered-sect or wage and employment 
cHetls and the tovered-sector dLsemployinent elasticity. 


the parameter S, then plotting regional observations on the effect of 
the minimum wage on the wage in the uncovered sector against the 
demand elasticity e'" should give a downward sloping relationship 
trossing the horizontal axis at 6. By (3b), plotting the effect of the 
minimum wage on employment in the uncovered sector against e’’ 
should give an upward sloping relationship also trossing the hori¬ 
zontal axis at 5. Panels A and B of figure 1 are the corresponding 
empirical diagrams, The dependent variable in panel A is the ob¬ 
served long-run elasticity of the uncovered-sector wage with respect 
to rhe minimum wage (from table 5); the dependent variable in panel 
B is the long-run uncovered-sector employment elasticity (also from 
table 5). The independent variable in each panel is the absolute value 
of the long-run covered-.sector di.semployment elasticity (from table 
2 ). 

As predicted by theory, panel A indicates an inverse relationship 
between the effect of the minimum wage on the uncovered-sector 
wage and the demand elasticity. Fhe fitted line’s horizontal intercept 
of 0.32 is an estimate of the critical demand elasticity (6) that deter¬ 
mines the sign of the wage effect in the uncovered sector. The esti¬ 
mate 6 = 0.32 says that in the steady state at least 32 percent of 
covered-sector employees compete with the unemployed for em¬ 
ployment. In contrast, panel B suggests no relationship between the 



MINIMUM WAGE 


543 

effect of the minimum wage on uncovered employment and the 
demand elasticity. The data points are essentially random deviations 
about the horizontal axis. Note that the F-tests in table 4 indicate that 
the minimum wage affects uncovered employment. The sign and size 
of the long-run effect, however, are measured very imprecisely. 

The skill-group aggregation in the uncovered sector data can ac¬ 
count for the apparent conflict between theory and panel B of figure 
1. When the low-skill wage changes in the uncovered sector, there are 
scale and substitution effects. As noted before and as proved in 
Appetidbc A, the larger is the substitution effect relative to the scale 
effect, the smaller is the observed (absolute) employment elasticity 
relative to the true proportionate change in low-wage labor. Evi¬ 
dently, the elasticity of substitution between low- and high-skill labor 
in agriculture is large enough to make this skill-group aggregation 
suppress the uncovered employment effect to an undetectable level. 
Notice that this is the reverse of the situation in the covered sector. As 
was noted in Section III, in the ctivered sector the scale effect domi¬ 
nates the substitution effect; in contrast, in the uncovered sector the 
.substitution effect dominates the .scale effect. 

VL Summary and Conclusion 

rhis paper examines the covariation over regions in the effects of the 
minimum wage on wages and employment in the covered and un¬ 
covered .sectors. Its major empirical finding is that the pattern of 
results is consistent with theory. Specifically, after a minimum-wage 
hike the uncovered-sector wage increases in the five regions with 
relatively small covered-sector demand elasticities: New England, 
Middle Atlantic, East North Central, West North Central, and East 
South Central; it decreases in the four regions with the relatively large 
covered-sector demand elasticities; South Atlantic, West South Cen¬ 
tral, Mountain, and Pacific. In addition. Mincer’s and Gardner’s ear¬ 
lier work suggesting that the uncovered-sector wage decreases in the 
entire United Slates is consistent with these findings, for their results 
are dominated by the four large regions (in terms of low-wage em¬ 
ployment) in which the uncovered-sector wage falls. This paper s 
results have some bearing on the issue of welfare effects of the 
minimum wage. It seems fair to conclude that the minimum wage 
increases the well-being of all low-wage workers in the New England, 
Middle Atlantic, East North Central, West North Central, and East 
South Central regions. In the remaining regions, however, the 
minimum wage increases the well-being of only those workers having 
covered-sector employment. 
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The purpose of this Appendix is to characterize the effects of using total 
employment and the average wage as dependent variables in the regressions. 
Suppo.se low-skill class 1 labor {TV,) and high-skill class 2 labor (A^) are 
imperfect substitutes in a production function with constant returns to scale. 
Let the class 1 wage (If’,) increase. Then 

Ni = (scale effect — SjcrullLi, 

/Vj = (scale effect + SiO’i2)14'i, 

where “ denotes proportionate change, s, and are the factors' shares in 
expenditures, and cru > 0 is the absolute Hicks-Alien partial elasticity of 
substitution. Let A, and Aj denote the factors' shares in Iota! employment; then 
the proportionate change in total employment is 

N = A,A/, + AjA?! 

= (scale effect -F (Aj - .^ 2 )o■l 2 ]^^',, 

which is less negative than N,. I’hc change in the average wage is 
W = (A.IL, -F AjfLj) 

= i.ff, -F (c, - k,){N, - iV2) 

= Ul - (h - *i)0'i2]I4'|. 

which exceeds C|ff| because s, < A, (cla.s.s I labor is the low-wage factor), 
(dearly the characterization applies ecjually well to cither sector. ( Lhc above 
arguments were found by an anonymous referee.) 


Appendix B 

1 his Ap{x;ndix discusses the technical econometric issues associated with the 
empirical work, 

Sprnficatum 

rhere are three important points concerning the covered-sector equations, 
first, adjustment costs (hiring and separation costs) induce a dependence of 
factcir demand on lagged and expected future factor prices (.Sargent 1978). 
rhis motivates using distributed lag methcxls and including a lead value of 
the minimum wage, for its movements are known well in advance. Second, 
minor distortions due to temporal aggregation (the unavoidable use ol annual 
instead of monthly data) are apparently the cause of a significant lead 
cfx;fficient on the control wage. This was verified by using Geweke's (1978) 
results. The effects of the temporal aggregation are minor because the ex¬ 
planatory variables are annual averages and the long-run effects are there¬ 
fore estimated consistently (see (icweke’s theorem 2). Third, the temporal 
aggregation and the forecasting of factor prices are gtxxl rca.sons to expect 
the homogeneity of degree zero of (la) in nominal variables to break down in 
practice. Consequently, m and Wj are entered separately in (la) and the 
homogeneity constraint tested (and rejected). The coefficients have a “real" 
interpretation, for the entire history of each nominal variable is held constant. 

The same comments apply to the uncovered .sector regressions, though 
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with quarterly (lata the temporal aggregation problems are absent. To purge 
the series of seasonal ctfecls, each regression included seasonal dummies and 
the prtxluds of .seasonals with trend. 

Sttial Correlation 

In the annual covered-sector regressions a first-order Cochrane-Orcutt cor¬ 
rection was used; in the ((uarierly uncovered-sector regressions ad hix 
prefilters were ii.sed together with fifth-order G(x:hrane-Orcutt. 

Exogeneity 

T able Bl reports the outcome of batteries of Sims's (1972, 1977) exogeneity 
test. T he test consists of the appropriate F- or /-test on lead coefhcients. The 
pattt ii y of rejections indicates no .serious problems with a lack of exogeneity in 
either set of results. Since lead values of w and enter the covered-sector 
equations for other reasons, the test’s power is obviously low in that set of 
results. 
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Taxes, Inflation, and the Durability of Capital 


Andrew B. Abel 

Hcu-uard L'nivtnsUy 


The choice of capital durability is affected by the rale of inllation 
betausc ntiminal depreciation deductions are based on historical cost 
rather than re[)lateinent cost. This paper analyzes a model with 
competitive firms and intertemporafiy optimizing consumers and 
demonstrates that an increase in the rate of inflation will lead to an 
intrea,se in the durability of capital, if the ratio of the nominal 
discount rate to the rate of depreciation exceeds a critical value. 
However, if this ratica is less than the critical value, this effect is 
reversed. The effect of costs of adjustment on the critical value is 
analyzed. 

I. Introduction 

Since depreciation deductions for tax purposes are based on historical 
cost rather than replacement cost, inflation can affect the investment 
and capital durability decisions of firms. This problem was recently 
analyzed by Auerbach (1979), using a general equilibrium model in 
which investment and durability arc determined by optimizing firms 
and consumption is determined according to a proportional savings 
function. To analyze the effect of inflation on the choice of asset 
durability, Auerbach simulates his model, using a Cobb-Douglas 
technology with reasonable parameter values.' His simulation results 
indicate that higher inflation leads hrms to choose more durable 
capital, although we ate left with a caveat about the robustness of the 
results. 

This paper was wriucti while I was an assistant professor at the University of Chicago 
Department of Economks. [ thank Alan Auerbach, Dennis tiarlton, Stanley Fischer, 
Robert LaLonde, Frederic Mishkin, and Michael Mussa for comments on earlier drafts 
of this paper. 

’ Auerbach also atisumes values for (he share of government expenditure in net 
output, the fraction of disposable income saved, and the corporate (ax rate. 

{Journal of PolUuol Economy, 1981, vol 89. no 5J 
© 1981 by The Umveniiy of Chicago 0022-Bft08/8l/8905-0001$01 M) 

548 



DURABIMTV OF CAPI'FAL 


549 

By modifying Auerbach’s model to make consumption decisions 
based on intertemporal utility maximization rather than on a propor¬ 
tional saving function, we can derive analytically the effect of inflation 
on durability without having to specify the form of the production 
function. 4 he direction of the effect of inflation on durability is 
determined by whether the ratio of the nominal discount rate to the 
depreciation rate exceeds a critical value which depends solely on the 
corporate tax rate and the rate of the investment tax credit. If the ra¬ 
tio of the nominal discount rate to the depreciation rate exceeds the 
critical value—as is the case in Auerbach’s simulations^—then an in¬ 
crease in the rate of inflation leads to an increase in durability. How¬ 
ever, if the ratio of the nominal discount rate to the depreciation rate 
is less than the critical value, then an increase in the rate of inflation 
leads to less durable capital. This reversal of Auerbach's finding is 
more likely to apply to durable equipment rather than to structures 
since equipment has a higher depreciation rate than structures,^ 

A second modification of Auerbach’s model is the incorporation of 
adjustment costs into the model. The presence of convex costs of 
adjustment removes the indeterminacy of the individual firm’s rate 
of investment in the Auerbach model. It should be noted that one 
complication of introdut ing adjustment costs is that the critical value 
of the ratio of the nominal discount rate to the depreciation rate is no 
longer a function of only tax parameters. 1 he critical value of this 
ratio will, in general, depend on technology, although we tan place 
bounds on this critical value. Using these bounds on the critical value 
of the ratio of the nttminal discount rate to the depreciation rate, we 
can still derive fairly strong results about the effect of inflation on 
durability based only on this ratio and on the lax parameters. In 
general, if the nominal discount rate exceeds the rate of depreciation, 
an increa.se in the rate of inflation reduces the rate of depreciation. If 
the nominal discount rate is “enough ” less than the depreciation rate, 
where enough depends only on tax parameters, an increase in the 
rate of inflation increases the rate of depreciation. In the region 
between the tipper and lower bounds on the critical value, the critical 
value depends on technology. Note, however, that this complication 
arises from the introduction of adjustment costs. In the absence of 
adjustment costs, the critical value of the ratio of the nominal discount 
rate to the depreciation rate depends only on tax parameters. 

* For a triore precise disrtission ot Aiierbarh’s siiniilalion results, see Set'tion III 

” This statement implicitly assumes that the investment tax credit rate is the same for 
structures and equipment, so that the critical value is the same for Ixnh types of capital. 
However, under the current U.S. tax laws, the investment lax credit is applicable to 
equipment but ntM to structures. I'heretore. as will tecome dear in Section III, the 
(ritical value is smaller for equipment than h>r structures. This efFccl may weaken or 
reverse the statement in the text. 
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In Section II we develop a model of a competitive firm and analyze 
the investment and durability decisions. I’he model of the firm is 
incorporated into a general equilibrium model in Section 111, where 
we analyze the effect of changes in the rate of inflation. Section IV 
presents concluding remarks. 

II. The Model of the Firm 

.Since the model of the firm in this paper differs from Auerbach’s 
model only by the inclusion of adjustment costs and the investment 
tax credit, the description of the model will be brief; further descrip¬ 
tion may be found in Auerbach. 

We assume that the firm is a price taker in its output market and in 
the labor market. I'he price of output/) and the nominal wage rate w 
both inflate at rate 77 , so that the real wage is constant over time. Let 8( 
be the (constant) rate of exponential decay of capital goods installed at 
lime/. F.x ante, 8, is a decision variable of the firm. We assume that the 
instantaneous flow of capital services from a unit of capital is an 
increasing concave function of S, A(8), with A' > 0, A” < 0. 

Investment in physical capital consists of purchasing a unit of the 
homogeneous output and installing it in place at some cost which is an 
increasing convex function of the rate of investment. Let/, be nominal 
investment at time /, so that I,Ip, is the number of units of output 
bought and installed as physical capital. We assume that the real cost 
of installation is an increasing convex function of real investment, so 
that the nominal cost of installation at time t is p,c{l,lp,), with <(0) = 
c'(0) = 0, and c' > 0, c" > 0 for I,Ip, > 0.'* We assume that these 
installation costs can be expensed for tax purposes. Note that the 
assumption that the purcha.se of uninstalled capital goods is amor¬ 
tized over time while the installation cost is immediately expiensed is 
similar to Auerbach’s case in which a fraction e of the cost of invest¬ 
ment is expensed.^ 

Capital services at time s are 

KS, = f [ A{8,W,lp,y-’>^‘‘~'^dl (1) 

since (/,//),is the amount of capital installed at time t which 
remains at time .v and .4(8,) is the How of capital services per unit of this 

* Aclju.siTiieiii costs have been intorporaced into models t)f investment by Eisner and 
Strotz (1963), Lucas (1967), Gould (1968), and Treadway (1969). Abel and Blanchard 
(1980) distinf^iish the cost of purchasing uninstalled capital from the cost of installing 
capital, as is done in the text. 

’ T here is a slight difference, however, in that the fraction of the total cost which is 
expensed in this model is endogenous, whereas the fraction e is exogenous in Auer¬ 
bach's model. 
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capital. Gross output at time s is given by Kf = mKS„ L,), where H is 
first-degree homogeneous inXS and L. We also assume Hks, Hi., Hksx 
> 0 and HKs,xs> ^ 0- 

Taxable corporate profits, which are revenues less wages, installa¬ 
tion costs, and depreciation allowances, are taxed at rate t, 0 < t < 1. 
Let D(x, 6) be the depreciation deduction per dollar of purchase cost 
of capital of age x which decays at rate S, and let A (0 « A < 1) be the 
investment tax credit rate applied to the purchase of uninstalled 
capital. 

The objective of the firm is to maximize 

T = |(1 - r)lp,y[' - w,L, - p,c(I,/p,)] - (1 - A)/, ^ 2 ^ 



with respect toL„ and /„ where r is the rate at which equity holders 
discount after-tax nominal flows.® Note that (2) is the same as Auer¬ 
bach’s equation 3 except for the inclusion of installation costs and the 
investment tax credit. For simplicity of notation, let 

z(S) = f"D(x,S)e-^'dx (3) 

J 0 

be the pre.sent value of depreciation deductions, and let p = r - tt be 
the after-tax real discount rate. Differentiating V with respect to S,,/,, 
and Lt and setting these derivatives equal to zero, we obtain;^ 

(1 - r) J" [T'(S,) - (s - 

^-[1 - A - Tz(S,)], 

OOf 


(I - T)r' (I,ip,) = (1 - r)A(S,) - 


(4b) 


[I - A - t2 (8,)]. 


HdKS„L,) = w,/p,. (4c) 

These equations may be solved recursively.® At any point in time, /, 
KS, is fixed and the firm chooses L, to satisfy (4c). Note that, since the 


* We assume that firms are financed entirely by ecjuiiy. so that interest payments are 
not included in the expression for cash flow. 

'The first-order conditions are identical to Auerbach’s e^^, 4, except that (I - 
r)c’{f,/p,) in (4b) and * in (4a) and (4b) are zero in Auertiach’s framework. 

* If the installation costs were amortized over time rather than immediately ex¬ 
pensed, then the system would be only block recursive. Eq. (4c) could be solved first, but 
the analogues of (4a) and (4b) would have to be solved simultaneously. 
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real wage, wjpt, is assumed to be constant over time, Ht.(KSt,Lt) is 
constant over time. Sinceis linearly homogeneous, the con¬ 
stancy of Hi^{KSi,Lt) implies that//,(s(^'S<,i^() is constant over time. Let 
be the constant value of Hgs(^S,,L() and define X(6; <f>) as 


\(8,; 0) = 


(1 - T)A(S,).i> ^ ^ 

f} + 8, 


+ TZ(S,) - 1. 


(5) 


Note that [(1 - t)/ 1 (dt)tf>]/(p + 8,) is the present value of the stream of 
after-tax marginal products accruing to the undepreciated portion 
of a unit of capital installed at time /. In addition, by installing a unit of 
capital at time /, the firm reduces the present value of its taxes by A -I- 
7z(8,). 'I'herefore, \(S; </>) is the excess of the present value of the 
after-tax cash flow accruing to the marginal unit of installed capital 
over the cost of an uninstalled unit of capital. 

Note that the first-order conditions given in (4a) and (4b) may be 
expressed as** 


X'(5,; 0) 


(1 - 

p + 8, 




P + Sj 


as 


(6a) 


(I - ry(/,/p,) = \(d,; if>). (6b) 

lb summarize, the firm may use the following sequential derision 
process. Given a real wage rate, the firm chooses a KS/L ratio to equate 
Hi_ with the exogenous real wage. This KS/L ratio uniquely deter¬ 
mines <f>, the marginal product of capital services. Given <f>, the excess 
of the shadow value of installed capital over the net cost of uninstalled 
capital is a function, \(8,; (p), of 8,. According to (6a), the firm chooses 
8i to maximize \(8,; Given the resulting maximized value of X(8,; 
<p), the firm will choose the real rate of investment to equate the 
marginal installation cost with X(8,; <p), as in (6b). This solution proce¬ 
dure applies at all points in time, even when the capital stock has not 
yet reached its steady-state value. Since the real wage rate is assumed 
constant, the marginal product of capital services, <f>, is also constant. 
Therefore, the X-maximizing value of 8 is constant over time, and the 


“ If we imjKwe ihe constraint that grtcss investment is nonnegative, the first-order 
conditions with respect to/ are; (u) (I — r)i'{l,lp,) '3= A(6; <l>); and (ft) (/,//i,)l(I - t)c' {t,/p,) — 
<I>)1 = d. If f,lp, > 0, then (ft) implies etj. (fib). 

If z = S/(p - 1 - S), as will be the case later in this note, then A(6; <^) = (ft - 1) -H 1(1 - 
T)A{li)4,/{f, + 6)] + [t8/(p -(- 8)1, \'(8; <ft) = [l/(p -I- 6)q{<ft(l - t)[A'(S){p + 6) - W(8)] H- 
Tp). and \"(S; <ft) = [-2/(p + 8)q{(I - T)4.[/t'(6)(p + 8) - ,4(8)] -H rp} -t- [(1 - t)<M"(6)/(p 
- r 6)]. Therefore, when X'(8; <t>) equals zero, the term in braces in the expression for 
A"(S; <P) IS zero and A"(8; <ft) = [(1 — T)<fta"(6)/(p + 6)) < 0. Hence, setting A'(S; (ft) equal to 
zero maximizes A(8; (ft) with respect to 6. Note that later, when we examtne the case in 
which z = 16/(p -I- TT 6)1, a similar line of reasoning will establish that A’'(S; (ft) < 0 when 
A'(S; (ft) = 0, as long as zr is not too large. 
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real rate of investment is also constant." Given the constant value of 
real investment, say (Up)*, the steady-state capital stock is { \ /d)(lIp)*, 
and the steady-state flow of capital services is [A(d)/d](I/p)*. 


III. A General Equilibrium Model of Investment and Durability 

In this section, we examine the effects of changes in the rate of 
inflation within the context of a general equilibrium model. Rather 
than posit a proportional savings function, as in Auerbach, we assume 
that consumption is determined as the outcome of infinite-horizon 
intertemporal utility maximization by identical households. As noted 
by Auerbach, the after-tax discount rate p will be constant in such a 
model. 

To keep the analysis simple but without any essential loss of 
generality, assume that the number of households is constant, and 
each household owns one firm and inelastically supplies L units of 
labor to its firm. Each household selects c, and m, to maximize 

I* oe 

U(c„ (7) 

subject tt) C( = y, + (wtlpt)L — (M,lp,), where c, is real consumption 
at time l, m, is real balances at time t, M, is the growth in nom¬ 
inal money balances, antl y, = (1 - t){H (KS,,L) - (w,lpi)L — c{I,lpt)] ~ 
(1 — k)I, + T ,/pie~’'''~‘''D (t - s,d,)ds is the after-tax real cash How of 
the firm. The maximization problem is based on the assumption that 
each household receives the net cash flow of the firm in addition to 
wage income and uses its total income either to consume or to add to 
its money balances. We also assume that the government uses tax 
revenue and the issuance of new money to purchase current 
output—which does not affect the utility function U(c,, m,) '* 

Solving the maximization problem (7), we can show that, in the 
steady state, equations (6a) and (fib) continue to hold. However, in this 
general equilibrium model with labor supply constant, 4>, the margi¬ 
nal product of capital services, is a decreasing function of the steady- 
state level of capital services {A(8)/f)](IJp). Therefore, for the steady- 
state values of the variables of interest, (j>, 6, and Up, we have the 
following system of equations; 


" Gould (1968) showed that, tor a prue-laking firm with constant returns to scale, the 
rate a< investment is constant if the rea! wage rate is constant. 

Abel and Blanchard (1980) present a more tompleic discussion of a general 
equilibrium model with taxes and adjustment costs. However, in that model the depre¬ 
ciation rale is fixed exogenously, and money balances do not appear. 
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g’<0, 


(8a) 


X'(S.4.) = 0. (8b) 

k(d.(t>)-(l-Ty(I/p) = 0. (8c) 

Note that the system (8a)-(8c) could also represent the behavior of a 
single profit-maximizing firm with constant returns to scale with 
either stime monopoly power in its output market or some monop¬ 
sony power in the labor market. 

To examine the effect of a change in the rate of inflation, we 
linearize the system (8a)-(8c) around its solution to obtain 



X.i, 

k,t. 


d'(S) 

0 


d(8) ] 

8 J 


j1(8) 
^ 8 
0 

-(I - r)c" 



d(}> 


0 


d8 

= 

-x; 


d{//p) 




(In, 


(9) 


where = [(1 — t)A(&)/(p -I- S)J > 0, and XJ, = (1 — t)/(p + 8){A'{8) — 
M(8)/(p + 8)J} < 0, and Xji < 0 because of the second-order condition 
(or the optimal 8.*^ Let A be the determinant of the square matrix in 
(9), so that 


A = - 


[xi + x;g' --^[/('(S)- 


//(8) 


H-ry'+g' 


/I (6) 


XjjXfi ^ 0. 

( 10 ) 


Note that A > 0 regardless of whether c" is positive or zero. 

Given the linearized system (9), Cramer’s rule can be used to de¬ 
termine the effect on 6 of changes in the rate of inflation n, 


dS 

dn 


A 


(1 - T)c"K-g' 


A(S) 




( 11 ) 


wherefi^ = “XiX„ + In analyzing (11), we will also consider the 

following two special cases: (a) the absence of adjustment costs, so that 
(" = 0; and (8) the polar opposite case of extremely convex adjustment 
co.sts, so that c" = ». In the case without adjustment costs, which 
corresponds most closely to Auerbach’s model, it can be shown that 


d8 ^ ^ , 
dn A ^ 



{€" = 0 ), 


( 12 ) 


where the coefficient on is positive. In the polar opposite case, it 
can be shown that 


Note that the element in the third row. second column of the square matrix in (9) is 
which is zero according to (8b). 
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sign 


dS 

diT 


signX; (c" = oo). 


(13) 


In order to analyze the expressions for ddidn, we must explicitly 
model the effect of inflation on the shadow price of installed capital. 
Following Auerbach, we assume that depreciation deductions reflect 
physical decay of capital but are based on historical cost, so that’* 


2(8) = 


8 

P + IT + S 


(14) 


Using this specification of depreciation deductions, we can easily show 
that 


X = iLz ^ —ig_ 

P + 8 P + TT + S 


- 1 , 


X' = 


(1 - r)<f> 
p + 8 


A'(8) 


^(8) 1 ^ t(p + n) 
p + 8j (p + 7r + 8)* 


(15a) 

(15b) 


Now we must calcviiate X„, XJr, and B„. Differentiating X with respect 
to TT, we obtain 


X„ 


— t8 

(p + IT + 8)* 


< 0 . 


(16) 


Note that X,r is negative because an increase in the rate of inflation 
reduces the present value of real depreciation deductions and hence 
reduces the shadow price of installed capital. Next, we differentiate X' 
with respect to tt to obtain 


x; = 


(p + TT + sy 


■ (8 — p — tt) ^ 0 as 8 ^ p + TT. 


(17) 


Thus the sign of XJ, depends on whether the rate of depreciation 
exceeds the nominal discount rate. As shown in figure 1, X^, is negative 
in region I and is positive in regions // and HI. 

Recall thatfl,r = that X* is positive, and that and Xi 

are negative. Thus in region /, where 8 < p + tt and is negative, 
is negative. To determine the sign of when 8 > p + tt requires 
more calculation. Using the expressions for X*, Xi, X„, and k'„, we can 
•show that 


1 — T t/ 4(8) g 

A’m 

1 

1 8 - p - TTl 

p + 8 (p + 7r + 8)*( 

A(S) 

p + 8 

p + TT + 8 


(18) 


[f depreciation deductions are based on replacement cost rather than historical 
cost, then z(S) = 8/(p + 5). In this case, changes in the rate of inflation have no effect on 
the firm’s decision problem. 



556 


JOURNAL OF POLITICAL ECONOMY 



Fig. 1 


It will now be useful to define q, the net cost of installed capital, 
ignoring the depreciation allowance, as 


(/ = I — A + (1 — t)c' . 
Using this expression for we obtain** 

1 - 7 _ tA(8 ) _£ 


( 19 ) 


ti. = 


(j + 8 (p + IT + 8)* q(p + 7T + 8) — t8 


(l - —)6** - (p + ttY 


( 20 ) 


Note that has the same sign as the term in brackets, so that 

B„^0asi?^^S - A (21) 

Thus the sign of depends on whether (p + ■ 7r)/8 exce eds the critical 
value V1 — {r/q). Note that the critical value V1 — (r/q) is less than or 


” Rewrite cqq. (6a) and (6b) as 

(1 - T)(t> 1 ^ -r(p + tt) 

f> + S 1 p + Bl (p + 7r + 6)* ’ 

and 


(1 ~ t)4> 
p + B 


.4(6) = 


V - 


tB 

f> + n + S 


Dividing (6a') by (6b'), we obtain 


.4'(B) _ 1 __ -t(p + tt) _ 

(8) p + S y(p + j7 + 8)* — t8(p + 77 + 8) ’ 
which may be substituted into (18). 


(6a') 


(6b') 
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TABLE ( 

The EfFFCT of Inflation on Uurability 


Sign of 

1 

n 

/// 

„ ^ 
Rei.ion S V 


fi V i - k 

C*eneral r" ^ 0 — 

> 

+ 

No adjusinient costs - 

- 

+ 

c" = W - 

+• 

+ 


equal to one, so that if p + tr > S, as in region /, then (p + ttJ/S exceeds 
the critical value, and is negative. Also note that, since q ^ 1 — k, 
with equality holding in the absence of adjustment cos ts, the critical 
value must be greater than or e qual to Vl — [t/(1 — ^)]. Therefore, if 
(p + 7r)/8 < V1 - [t/( 1 - /()] as in region III, B„ is positive. 

Recall from (11) that, in the general case, if and5„ have the same 
sign, then dS/dn also has that sign. Thus, as summarized in table 1, 
dbldrr is negative in region I and is positive in region III, In region 
II, is positive, but the sign of is indeterminate without fur¬ 
ther assumptions. If we assume that c" = »=, then the sign of dSidn 
is the same as the sign of XJ, and hence is positive. However, if we 
as sume ihat ther e arc no adjustment costs, then the critical value is 
V 1 - [t/(1 - ii)l, and is negative. Hence, in the ab.sence of adjust¬ 
ment costs, d^/diT is negative in region II. 

To summarize, if the nominal discount latc exceeds the rate of 
depreciation, then an increase in the rate of inllaiion reduces the 
optimal 8 and hence leads to more durable c apital. On the ot her hand, 
if the nominal discount rate is less than Vl — 1t/(1 - k)] times the 
depreciation rate, an increase in the rate of inflation leads to less 
durable capital. In the case in which there are no adjustment costs and 
the investment tax credit is zero (as in Auerbach), an increas e in th e 
rate of inflation increases the durability of capital if p + tt > Vl - tS 
but reduces the durability of capital if p -t- rr < Vl - r 8. 

It is interesting to compare the results for the case with zero ad¬ 
justment cost and zero investment lax credit with the simulation 
results in Auerbach. Note that these (wo models are not identical in 
that they treat consumption decisions and money-holding decisions 
differently. The essential difference is that, in the optimizing model 
of this paper, p is held constant, whereas p is a variable in the 
Auerbach model. In Auerbach’s table 1, the nominal discount rate 
exceeds the depreciation rale in every row except the first. Consistent 
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with the results derived above, an increase in the rate of inflation 
increases the durability of capital in Auerbach’s simulations. 

In the first row of Auerbach’s table 1, the nominal discount rate 
falls short of Vl - r S, and the results derived above would indicate 
that dfildiT is positive. However, S decreases when tt is increased from 
row 1 to row 2. There are two possible explanations for this apparent 
contradiction. First, although the ratio (p + 7r)/6 begins below the 
critical value V1 — [t/(1 — k)] in row 1, it ends up above this critical 
value in row 2. Hence, in raising the rate of inflation from zero to 5 
percent, dSIdn changes sign, and it may be that the negative values of 
ddldiT dominate the positive values over this range. Second, Auer¬ 
bach’s table 1 indicates a drop in the real discount rate p when going 
from -TT = 0 to TT = 5 percent. The results derived in this paper have 
assumed that p is constant. However, as shown below, a decrease in p 
will tend to cause a decrease in 8, consistent w'ith the results in 
Auerbach’s table 1. 

To calculate the effect of p on the depreciation rate S, simply 
replace X.„ by kp and XJ, by Xp in (11). Differentiating X with respect to 
p, we obtain 


_ -jl -T)A{8)d> t8 . 

(p + 5)* (p + TT -I- 8)* 


( 22 ) 


Note that Xp is negative since an increase in the real di.scount rate 
reduces the presetit value of future rentals (including depreciation 
deductions) to capital. 

Differentiating X' with respect to p, we obtain 


K = 


-d - T)d> 

(P + 5)2 


^'(5) 


d(8) 1 (1 -r)d>A(8) a-p-TT 

P -(- sj (p 5)2 (p -I- TT -t- 8)2 ■ 

(23) 


Evaluating this derivative where X' = 0, we substitute r(p + 7r)/(p tt 
+ 8)2 for l-(l - T)d>l{p -t- 8)1{/1'(8) — td(8)/(p -t- 8)]} to obtain 


X' = 


(1 - r)A{8)<i> 

(p + 8)2 


_r_ 

(p -I- TT -t- 8)2 



p + v 

p + 8 



(24) 


which is positive for tr ^ 0. To calculate Bp, substitute Xp and Xp into Bp 
= —XiXp -t- k^kp and use the substitution mentioned in footnote 15 to 
obtain 


Bp 


(1 - r)A(8) 


(p -t- 8)(p A- TT + 8)2 


iq - TZ) 


-t- - 

\ P + 8 j 


-f 


q(q - t) 

q — TZ j’ 

(25) 


Under the reasctnable assumption that r < 1 - A, it is the case that q - 
T > 0 and, hence, Bp > 0. Therefore, since Xp > 0 and Bp > 0, we 
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determine from the appropriately modified version of (11) that dbldp 
> 0. Therefore, as mentioned above, a decrease in p tends to decrease 
the optimal value of S. 

We have shown that the direction of the effect of inflation on 
durability depends on relative magnitudes of the nominal discount 
rate and the depreciation rate. We now examine the effect of inflation 
on the steady-state flow of capital services by examining the effect on 
4), the marginal product of capital services. Using Cramer’s rule, we 
obtain 


d<]> ^ \_ 
d-n A 


(1 - rY'g' 




5 



( 26 ) 


Note that the coefficient of XJ, is positive and that the coefficient of X,, 
is negative. Furthermore, recall that X„ is negative. Therefore, d(j>/dTT 
is positive if either r" = 0 or X^, is positive. Since </> is a decreasing 
function of the flow of capital services, it is clear that, in the absence of 
adjustment costs, an increase in the rate of inflation reduces the flow 
of capital services, even though the durability of capital may increase 
or decrease.'* 


IV. Concluding Remarks 

By replacing Auerbach’s proportional consumption function with 
consumption behavior determined by intertemporally optimizing 
consumers, we can derive strong results about the direction of the 
effect of inflation on durability without appealing to a simulation 
analysis. If, in the absence of installation costs, (p + 7r)/8 (the ratio of 
the nominal discount rate to the rate of depreciation) exceeds a 
certain critical value, then dbldn < 0, so that an increase in the rate of 
inflation leads to more durable capital; if this ratio is less than the 
critical value, then ddldtr > 0. As explained in Section 111, these results 
are broadly consistent with Auerbach’s simulation results in which 
dbld-n < 0 since Auerbach's simulations are confined to the region in 
which (p + 7r)/6 exceeds the critical value. However, for less durable 
capital, the ratio (p 7r)/8 may fall short of its critical value, and the 
model presented here indicates that rfS/t/rr would be positive. Regard¬ 
less of whether an increase in the rate of inflation increases or de¬ 
creases durability, we find that, in the absence of installation costs, an 
increase in the rate of inflation reduces the steady-state flow of capital 
services. 


'* For the case in which c" = », sign ddddir = sign X', = sign d&Jdv. In this case, the 
sieady-,state rale of investment is invariant to the rale of inHauon. H an increase in jr 
increaiies (decreases) 8, then the steady-stale flow of capital services [A{B)l6](Ilp) is 
reduced (increased), thereby increasing (decreasing) <)> 
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Uncertain Lifetime, Consumption, and 
Dissaving in Retirement 


James B. Davies 

Vninerstty of Western Ontario 


1 his pap<*r asks whether the continued accumulation^ or mild dis- 
saving, observed among the retired can be explained by uncertain 
lifetime. In the absence of annuities, after an initial period in¬ 
fluenced by borrowing constraints, under constant relative risk aver¬ 
sion, uncertain lifetime depresses consumption by a propiortion 
increasing with age if the elasticity of intertemporal substitution in 
consumption is “small." Illustrative compulations, based on actual 
income and survival data, show that plausible elasticities are suf¬ 
ficiently small to give this effect. The reduction in consumption is 
large enough to explain much of the lack of decumulation by the 
elderly. 


I. Introduction 

It has frequently been observed that the elderly either continue to 
save in retirement or decumulate much more slowly than would be 
predicted by the crude life-cycle model without bequest motive under 
certainty. Most recently, Mirer (1979) has found that, controlling for 
education, in cross section the mean net worth of American couples 
continues to rise in retirement.' When secular growth is taken into 

I would like lo thank (ilciin MacDonald, Nigel romes, Geoffrey C’.arliner, Ghns 
Robinson, and John Whallcy, as well as the participants in the Labor Workshop at the 
University of Western (Ontario and a seminar at McMaster University, for helpful 
comments. The responsibility for any errors or omissions is iny own. 

' Shorrocks (1975) points oni that the slope of the age-wealth profile i.s biased upward 
by the lower mortality of the wealthy. Mirer (1979) argues that controlling for educa¬ 
tion largely corrects for this. While il is not clear that explicit corrections based on 
•Estimates of differential mortality by stK'ial class provide more reliable results (Shor¬ 
rocks 1975, p. 163), It should be noted that Shorrocks obtained fairly rapid decumula- 
tion in retirement for the United Kingdom by using this alternative. 

Uoumal III l‘aUtual kemumj, IDSl.vol SO, iki 31 
® 19(11 by The llnivertily of ChKngo. 002S-3808/Hl/89OS-0(IO6»0l 50 
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account, this implies an increase, a fortiori, for a cohort. Almost as 
startling are the positive, or only slightly negative, rates of accumula¬ 
tion found in surveys of saving.* However, few, even among the old, 
,wv they arc saving for bequest.* Recent work suggests that this may be 
because human as well as nonhuman intergenerational transfers are 
possible, and many families are at a corner where only the former are 
desired (see Drazen 1978; Becker and Tomes 1979; Adams, forth¬ 
coming). Thus a bequest-motive explanation of the slow dissaving of 
the elderly is not as attractive as it might seem. 

This paper shows that the observed slow decumulation of the el¬ 
derly may to a large extent be explained by uncertain lifetime. It does 
so by developing a model in which the effect of this kind of uncer¬ 
tainty on consumption by persons without a bequest motive can be 
assessed at different ages. Constant relative risk aversion is assumed 
for simplicity, but an appendix (obtainable from the author) shows 
that the results also hold under constant absolute risk aversion. Non¬ 
pension annuities are ignored on the argument that a no-bequest 
life-cycle model is consistent with their observed unimportance in 
household portfolios only if available annuities are not sufficiently 
attractive to dominate regular saving instruments for most consum¬ 
ers, With the most plausible taste parameters and rates of return, in 
this model in the absence of pensions uncertain lifetime will not only 
depress consumption at all ages but will also have an increasingly 
severe proportional impact beyond middle age. Illustrative computa¬ 
tions, based on actual earnings and survival data, show that this 
outcome is also likely with realistic pensions and that the effects are 
sizable. On conservative assumptions, uncertain lifetime may more 
than halve the mean rale of decumulation among the retired. 

The paper is organized as follows. Section II reviews previous work 

^ In three major .studies, the following ratjo.s of mean saving to mean net worth fur 
those over b,') were fotiiul: 


Rate 


Sdui 11’ 

(V 

V c.ii 



1 vci.tii pp. 147) 

iiiul p 1 lO) 

T k 

19,12 


-1 3 

cUkI Pr<))c<i()i p 213) 

SuuiMKs (Canada (M)73r/. [) H4. 

t' S 

1 963 


+ 2 

\91U, p. 140) 

t.anada 

1970 


+ .8 


Only 4 percent of the resfx*ndent.s to the 1962 Survey of Finannal Charoflervitics in 
the Uniiet] States cited "providing an estate" as a saving objective (Projector and Weiss 
1966, table A30). The 1964 Brookings Survey ot alfluent families (incomes above 
510,000) found only 23 percent (of all ages) who were saving to make a bequest 
(Barlow, Bra/er, and Morgan 1966, p. 198). 



DISSAVING IN RETIREMENT ^ 

563 

on the impact of uncertain lifetime on consumption. Section III 
presents the model, Section IV the illustrative computations. 

II. Previous Work 

Yaari (1965) studied the impact of uncertain lifetime on the planned 
rate of change of consumption, examining four cases defined by the 
presence or absence of a bequest motive and actuarially fair insur¬ 
ance. Except in the cases with a bequest motive, insurance takes the 
form of either life-insured net borrowing or annuities. Since under 
actuarial fairness the interest rate on the latter exceeds that on "regu¬ 
lar" savings, consumers without a bequest motive hold their entire 
wealth in the form of annuities.'* 

Omitting a bequest motive, Yaari assumed that a consumer aged t, 
with probability P{t\i) of surviving to age t and a maximum lifetime 
of T, would have the expected utility integral: 

= f p(7\t)a{T)U[C(r)]dT (1) 

where C(t) is consumption at t, q((t) is a subjective discount factor, and 
the rate of time preference is 1 - /*(T|/)a(0- 

Maximizing (1) subject to the appropriate constraints yields the 
following results. In the absence of insurance, since the rate of time 
preference is higher than under certainty tP(r|0 < 1], consumption 
tends to grow more slowly throughout the life cycle. However, ac¬ 
tuarially fair insurance makes the price of future consumption P (t j <) 
exp [—r(r - <)]. where r is the (constant) rate of interest, that is just low 
enough to offset completely the higher time preference. The result is 
that, with insurance, consumption grows at about the same rate under 
uncertainty as under certainty.® 

It is important to note that Yaari’s conclusions concerned only the 
shape of the lifetime consumption path and not its level. Levhari and 
Mirman (1977, referred to as LM below) have extended Yaari’s work 
by analyzing levels in the absence of insurance. They assume constant 
rates of subjective discount and relative risk aversion, giving: 


* I'he rare of return on an annuity of a year's duration would equal the rate on 
regular savings plus the probability of death during that year (see Vaari 1965, p. 144). 

' The actual results are that CU) = -(r + d(dl(t'’/t/") under both certainty and 
unceriaitiiy with insurance, while c(<) = -(r + <»(/) — P(t j under uncertainty 

without insurance. (A dot denotes a nnie derivative and a hat a proptortional rate of 
change with time; V and U" are first and setond dcrivauves oft/ with respect to Cl/].) 
Since C'(r) will differ in the three cases, and U'fl!" is in general not constant, we can say 
only, loosely, that under uncertainty consumption tends to grow at the same rate as 
under certainty with insurance but more slowly without. 
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F= rV(T|/)exp[-p(T-0]-^^^^^T. (2) 

j t I — y 

Here —y is ihe elasticity of marginal utility with respect to consump¬ 
tion, and y is the index of relative risk aversion. The elasticity of 
intertemporal substitution in consumption, tr, is equal to l/y. 

Maximizing (2) when resources are composed exclusively of non- 
human wealth, LM find that the impact of uncertain lifetime is a 
priori indeterminate. In the special case where r = p = 0, initial 
consumption will be higher than under certainty if y < 1 and lower if 
y > 1. When r and p are nonzero, there are a number of distinct cases. 
It is argued in Section IV of this paper that the most “realistic” is 
where y > 1. In this case, unless r or p (or both) is very high, increased 
uncertainty always leads to a decline in initial consumption.* As shown 
below, this conclusion does not extend to the case where resources are 
largely in “human" form. 

Barro and Friedman (1977) have drawn attention to the fact that, 
under (2), w'ith actiiarially fair insurance, consumption grows at the 
same rate under uncertain lifetime as under c ertainty. They have also 
pointed out that if earnings did not change with age, and r and p were 
zero, consumption would be constant throughout lifeaf the Mine level 
under certainty and uncertainty. This special result appears to be the 
only available analysis of the impact of uncertain lifetime on the level 
of consumption, in the presence of insurance.^ 

These previous contributions might be criticized for focusing either 
on a world without insurance and annuities or on one in which they 
arc actuarially fair. Both situations are clearly “unrealistic.” Must we, 
then, model insurance with an equilibrium load to make further 
progress? In the present context, the answer is no. Aside from con¬ 
siderations of analytical difficulty, the exercise is not necessary if we 
wish to discover whether the life-cycle model without a bec^uest motive 
can explain the slow dissaving of the old. Nonpension annuities are in 
fact of little importance for savers of any age." There are two possible 


* I-evhan and Mirman 1977, p. 275. Note that LM did not rcgaid this case as 
especially .signifiiatit. In tact, in their abstract they state "the major result is that if the 
utilily function is Cobb-Douglas and the rate of return is not too large relative to the 
ainouril of future dLscouiiting then liletiinc uncertainty will always increase consuinp- 
iion,” which refers to the case where y = I. 

' An appendix to this paper which analyzes the case with actuarially lair insurance is 
available from the author. The illustrative compulations show that, although, iheoreti- 
cally, a positive or negative impact of uncertain lifetime on consumption is possible, a 
slight negative impact is likely at all ages. ITiere is no tendency, however, for this to lie 
more severe for the elderly, 

" In the 1962 Survey of Financial C.hararlrruUrs only I percent of households reported 
any private annuities, and on average these composed less than 0.1 percent of the 
household portfolio. Surprisingly, the figures are not any higher for those aged 65+ 
(Piojector and Weiss 1966, tables A8 and A31). 
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explanations. Either there is a widespread strong desire to make 
bequests, or insurance markets fail to provide annuities sufficiently 
attractive to outweigh the greater transactions costs and incon¬ 
venience of saving in this form.® Thus the only case in which the 
life-cycle model without a bequest motive could explain the low con¬ 
sumption of the old is that where regular saving instruments gen¬ 
erally dominate annuities. Accordingly, for analytical simplicity it 
is assumed here that regular savings always dominate annuities. 


III. The Impact of Uncertain Lifetime at Different Ages 

The object of this paper is to determine the impact of uncertain 
lifetime on consumption at different ages. One might suppose the 
apptopriate procedure would be to compare complete lifetime plans 
under certainty and uncertainty. Such a comparison would not, how¬ 
ever, tell us the impact of uncertainty on an individual of arbitrary age 
with given current wealth and other characteristics. It wiiuld indicate 
only differences in projected con.stimption at various ages for indi¬ 
viduals with given wealth at the beginning of their lifetimes.'* 

['he impact of uncertain lifetime on con.sumption by an individual 
o( aget with wealth, W(t), and a noninvestment income stream, 

^ r ^ 7', ran be measured by the difference between what he would 
con.sume if his lifetime were certain, and if it were uncertain, 

C“(/). (1 he argument t will be omitted in the text below when no 
ambiguity arises.) In making this compari.son we must vary only the 
degree of uncertainty. Iti pariicular, the expected lifetime, 

m( 0 =J^^/"(TlOdr + t, (3) 

must be held constant. 


(Consumption under (Certainly 

Under certainty a consumer of arbitrary age, /, would maximi7e (2) 
subjet t to the constraint that discounted consumption up to p,(/), K' (t). 
could not exceed total current resources, £''(/). Re.sources would be 
composed of current wealth, lV(f), and the discounted value ol nonin- 


" riicic is a sigiiificuiu problem <>( adverse selciiinii since sursivai ihances clearly 
ililler widely but may Ik- quite dil'lituh lor an insurance lompain to .wenam 

As mentioned previously, Yaari (I96.'>) showed that under uncertainly consump¬ 
tion tends to grow less quiekiv than under eertaiiuv. Thus, it respective of the dificrcncc 
in initial consumption (say, at age 21)), by relireniciil, consumption under a complete 
liletiine plan will almost certainly l>e lower under uneerlaini). This has no Iscaring, 
however, on whether an individual aged, say, 65, ol given wealth, consumes more or 
less than he would under certainly. 
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vesimeni income, E(t), t ^ t ^ fjL{t), denoted Adding the termi¬ 

nal condition W[/i(/)J = 0, we have the constraint: 

J M(/> 

exp L-r(T - t)]dT = 0, (4) 

or 

«<■{<) - K'-(l) = 0. 

t his is a standard problem in optimal control. (Consumption will grow 
according to 


(y^r) = = g. (5) 

where the hat denotes a propxirtional rate of change with time (as it 
will throughout). F.quation (5) has the obvious solution: 

= exp lg(r - t)Yy(t). (6) 

Substituting (6) into (4) we obtain;" 

= |J - »)(t - /)]dT| = PC''{l)R<'{t). (7) 

The tei in P('.', the_average ‘’propensity to consume" out of resources, 
is just P(y{t) = LAC'(!)/(>’'■(/))“' = that is, the inverse of dis¬ 

counted lifetime consumption normalized by initial consumption. It is 
helpful to view PC’ in this light since it is usually easy to see how a 
change in uncertainty affects "normalized” lifetime consumption. 
The latter simply reflects the shape of the consumption path and the 
strength of discounting. 


ConsumpUon under Uncerlatnty 

Under uncertain lifetime, the consumer maximizes (2) subject to the 
constraint that wealth may never become negative. ( To lend without 
set urity to a person whose lifetime is uncertain would be to provide a 
form of insurance.) As Yaari pointed out, under this constraint the 
optimal plan consists of a series of intervals where the constraint is 
effective and others where it is not. 1 refer to these as "blocked” and 
"free,” respectively. In the LM analysis, where all resources are in the 
form of an initial stock of nonhumah assets, the entire lifetime is a 
free interval. Here matters are more complicated. 

Over blocked intervals we have, of course, C” = E. In addition, it is 
clear that the consumption plan is continuous at the beginning, t*, 

" Eq. (7) IS not simplified, in order to empha.sir.e Us similarity to the corresponding 
expressions under uncertainty, cqq. (10) and (13). 
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and end, t* + m, of any free interval, with €“{1*) = E (t*) and C'"(I* + m) 
= E(e* + m).’* Over the course of a free interval, consumption 
changes according to 

C“{T) = -S-[r-p +/'(T)] (8) 

y 

(Yaari 1965, p. 143).That is, consumption grows less quickly than 
under certainty because expected utility from future consumption 
declines without any corresponding drop in price. Solving (8), we 
obtain: 


= P(T|t)''^exp |g(T - ^ t ^ r ^ t* + m. (9) 

Recalling that wealth must be exhausted over the interval, we have: 

(;■(,•) - . If-'" -HI 

1 exp l(g- - r)(t - t*)}dt 

Equations (9) and (10) completely specify the free interval when t* + 
m = r. In the case where /* + m < T, we have a further relationship. 
Since C’"(i*) = E{t*) and C“(l* + m) = E{t* + m), given (9), E(l*) and 
E(t* + wt) are related as follows: 


E(l*) = 


Pfll* + m) I exp igm) 


An indication of the range of plans that may be obtained with 
realistic E and P paths and plausible values of r, p, and y is provided 
by figure 1. The plans shown are the result of the illustrative compu¬ 
tations of the next section. Note that in all cases life begins with a 
blocked interval and there is just one free interval.'^ Part A indicates 
that with E = 0 beyond age 65 (no pensions), the free interval would 
extend until T. I hat is, wealth would not be run to zero as long as 
there was a nonzero probability of surviving any longer. In partB, the 
consequences of realistic pensions are shown. With a high y, the 
consumption path is relatively Hat (see (9J) and a free interval lasting 


If expected utility could be increased by raising consumption at t* 

and reducing it just before. A similar argument holds for t* + m. 

” Note that, since P(t|t) = 1, P(t) = /^(t| r). 

In general, the consumption plan will not always have these features. The initial 
blocked interval is caused here by E being low at the outset. A second free interval docs 
not succeed the first because, beytmd I* + m, the desired growth rate of consumption is 
declining because of the rapid fall ofP, whereas pensions (if present) fall a consunt 
rate of 2.25 percent per year. (See the next section for an explanation of tfeila rale.) An 
increase in the ratio of the growth rates of desired consumption and E would be 
required for a second free interval to emerge. 
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Fk.. I.—Scle< tf <1 optimal fonsuinption paths, r = .03. p = ,015 


until / is obtained, whereas with y — 1 the desired growth rate of 
consumption declines quickly and a second blocked interval occurs. 

Behavior of C“/C' if Free Interval Lasts until T 

How does C" compare with (F at different ages? Consider, hrst, 
situations where the free interval lasts until 7'. At any point in the free 
interval, discounted consumption up to /' cannot exceed total “re¬ 
sources,” 7?“, that is, the sum of current wealth and the present value 
of the/s stream uptoT,//". Adding the terminal condition WfL) = 0, 
we have: 

\V(t) + H-{t) exp l-r(T - Old/ = 0, (12) 

or/?"(/) - /w“(/) = 0. Substituting (9) into (12), we obtain: 

C“(/) = Tp (t I /)■'’■ exp [(g - r)(T - /)]dT ' «“(/) = Pr;“(/)ft“(/). 

^ (13) 


Again, PC“(/) = A“(/r‘. 
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g and 

»■ y 

PC" 1 PC' 


<1 

>1 if y sufficientJy below 1, 
otherwise < \ 


= i 

<1 

J 

>1 

<1 


<1 

>1 

^f ! 

= 1 

=1 

1 

>1 

<1 


1 

>1 

g<’' 

= 1 

>1 


1 

< 1 if y sufficiently large, 
otherwise >1 


The behavior olCJC^ is determined by that oi'R‘‘/R'^ and PC'/PC^'. 
When the free interval lasts until 7', in the presence of pensions we 
have > I. However, as will be shown, we may have PO'IPC < 1, 
so that C"/C'' < 1 is a possible outcome. 

From e<}uations (7) and (13), we see that PC/PC^ depends on the 
relative size of^ and r and the value of-y. In Section IV it is argued 
that y i.s typically considerably greater than unity. It is observed that 
apparently non-credit-constrained households consume at an in¬ 
creasing rate. As long as utility is additively separable over time, this is 
only consistent with an intere.st rate greater than the subjective dis¬ 
count rate.'® Thus rc*alisiic restrictions indude y > 1 and p < r. 
Further, if-y is typically considerably above unity, unless p is far below 
zero, which seems unrealistic, we have g < r. 

Levhari and Mirman have provided a full analysis of the relative 
values of PC" and PC'". Their conclusions are set out in table 1. When 
g = r and y = 1, recalling (3), we find that (7) and (13) imply that/'f." 
= PC^. laiwering y below 1 raises 1/y and increases T’f-'VPC'', while 
raising it does the opposite. Withg > r and y = 1, PC^IPC'' < 1 since 
greater relative weight is placed on the more remote future in the 
denominator of PC" than in that of PC''. Raising y simply reinforces 
this. On the other hand, although redut ing y below 1 tends to in¬ 
crease PC"IPC', the ratio will not rise above unity until y is sufficiently 
small, so that a simple result is not obtained. 

Now, in the apparently realistic situation where g < r. we have 
PC/PC^ > 1 with y « 1, again because more weight is placed on the 
more remote future in the denominator of PC". However, raising y 

“ Iti Yaari's framework, which does not require conslani risk aversion or constant 
subjective discounting, consumption wilt not rise unless the rate of return exceeds the 
subjective rate of discount; see Yaitri (1965), pp 143, 147, 
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above unity reduces 1/y and tends to make PC fall. Clearly, with 
sufficiently large y we obtain PdPC < 1. In the next section we find 
that with actual earnings and survival data, y does not have to be far 
abf)ve unity for this result to occur. 

In order to study changes in PdPC with age, one may compare 
the proportional rates of change:'* 

PC{t) = PC{t){\ - exp {{g - r)[ju,(/) - /]}A(0) + “ f). (14) 

and 

/^"(/) = PC"(<) ++ (g - r). (15) 

To interpret these relationships, it is helpful to note that PC* = —K*. 
Consider first a situation where the lifetime was actually fixed andg = 
r. In this case, normalized lifetime consumption, K*, would simply 
equal the remaining life expectancy, fi{t) — t. Each year K* would fall 
by a percent equal to its own inverse. Hence PC = (A'')~' would rise by 
this same percent, that is, PC itself. 

When allowingg r, sayg < r, we give A' some tendency to rise as t 
increases since weaker discounting (compounding, ifg > r) is applied. 
1 his gives rise to the (g — r) in both (14) and (15). Finally, in fact 
lifetime is not fixed. Under certainty, the extension in possible 
lifetime increases normalized lifetime consumption, reducing PC', 
while under uncertainty, the increased probability of survival to any 
future date also raises A* and reduces PC. 

Equations (14) and (15) show that, in the main, whether PiyiPC 
tends to rise or fall depends on whether it starts out greater or less 
than unity. When PC‘/PC‘' < 1, a decline may not necessarily occur. 
Htiwever, since /i(/) and P(t) are both typically cjuite small, in our 
"most realistic” case, of g < r and y well above unity, it is likely that 
PC‘/P(? will be low enough to ensure its continuous decline. Whether 
this translates into a decline in ('."IC depends entirely on the behavior 
of «"/«'•. 

In the absence of pension income, since most people retire before 
the age at which death was expected, even from the viewpoint of the 
start of adult life, is typically the same as /?*'. Hence everything said 
above about PCCIPC applies to C‘K7. That is, with the most plausible 
parameters CKy < 1 and declines iiionotonically with age over the 
free interval. In addition, it is clear that (71(7 < 1 during the initial 
blocked interval (('7 will be less than the “free” value given by eq. [13|), 
although the ratio would likely typically increase with age prior to t* 
since E rises quickly for the young. (Recall C'“ = E in the blocked 

'• In deriving (15) let 7r(T) = the unconditional probability of surviving to age t. Then 
= 7r(r)/7r(t), and dP(T\t)ldt = —n{r'ytr(t)~'‘Tr(t), or -P(T\l)P{t). 
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interval.) These conclusions are particularly interesting since many of 
the studies indicating slow decumulation by the old were conducted 
when pensions were less sizable and widespread than they are today. 

When pension income is present, we of course havei?“//i'' > 1; and 
even when the free interval continues to last until T , the above conclu¬ 
sions will hold only with a higher value of -y. In the next section we 
find that the required level of y is not, however, implausible. Here we 
should note that, although «"//?'■ > 1, it has some tendency to fall with 
age. The R* has two components, fV and the former of which we 
hold constant in the comparison of certainty and uncertainty. Since 
most often W is rising and H' is falling, R" and R'' have a basic 
tendency to become more equal. In the next section we find that in 
practice this dominates, and 7?“//?*^ typically falls with age. It should be 
noted, however, that the behavior of //“///'' is in opposition. We have 


!h(t) = r 


FAD 

//“(<) ’ 


(16) 


and 

ry-,., , , £Lm( 01 exp - t]}iX(t) -FAA ( 17 ) 

« (0 - r + 

Since£[M(0]exp { - t]}/i(0 is usually small relative to£(0 and 

//“ > this means that H" tends to fall more slowly than 


Behavior of C“/C“ if Free Interval Ends before T 

As we saw in figure I, in the presence of pensions the free interval 
may terminate before 7'. In this case, it is likely that CVC > 1 
throughout the free, as well as the following blocked, intervals. 

In the computations of the next section, in cases where the free 
interval ends before T, we usually find that n(t*) < I* + m. Hence fiit) 
must rise and become equal n>t* + m at some pioint in the interval. At 
this age C“ exceeds 6''. In general, we have: 

== ^^(0 + JT^" - DVj. f* ^(18) 

" £(r 10*'’' exp [(g - r)(T - Wt 
and 

f ^ 1^(0 + 0 ^ ^ r. (19) 

exp [(g - r)(T - OJdr 

for /jL(t) = (* + m relevant resources are the same, but the de¬ 
nominator is smaller under uncertainty, and C > <&*'. 
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What can we say about C‘‘/C’' at other points? Before the age where 
fi{t) = t* + rn, the longer horizon under uncertainty makes the 
denominator of (18) larger relative to that of (19). However, relevant 
human resources in the numerator of (18) are greater than those in 
(19). Similarly, above the point where = I* + m, the denominator 
of (18) falls relative to that of (19) but the numerator does the same. 
In the illustrative computations, the net result is often that > C' 
throughout the free interval. 

Finally, after ihe free interval we are likely to have C'‘ICy > 1 again. 
To see this, consider a case where + m occurs in retirement and the 
pension changes at a constant percentage rate. As long asg exceeds 
this rate, which appears likely, po.sitive .saving will be desired under 
certainty, and C'' < E — C. 

IV. Quantitative Impact 

I'his .section provides a quantitative study of the likely impact of 
uncertainty on consumption at different ages. It does this by com¬ 
puting hypothetical consumption plans under certainty and uncer¬ 
tainty at different ages, using actual data on noninvestment income 
and survival probabilities and a range of values ofr, p, and y. The first 
step is to compute the complete lifetime consumption plan under 
uncertainty. I his getierates a time path for W(0- Next, at intervals of 
1 year I ask how' much a person would consume, given the W(t) 
computed in the first step and a ceitam lifetime equal to life expec¬ 
tancy at that age. 

Figures 2 and 3 plot the data used. An inverse logistic curve (its the 
P path in figure 2 well and allows the u.se of continuous time in the 
computations.*^ Similarly, the lifetime path of mean noninvestment 
income in figure 3 is approximated from ages 20 to 70 by a fourth- 
order polynomial.'” Beyond age 70, it is assumed that E declines at the 
observed trend rate of 4.25 percent per year. It is also assumed that 
secular growth shifts the entire cross-section profile upward at a rate 
of 2 percent per year. This explains why the E path for an individual 
shown in Hgure 1 differs from the cross-section prohle of Hgure 3. 

The computations were carried out with alternative values ofr, but 
I focus mainly on the case where r = .03, suggested by popular 
estimates of mean household rates of return (see, e.g., Boskin 1978, p. 
S19; Feldstein, Green, and Sheshinski 1978, p. S64). The subjective 


The tui vcA’tO = 1(1 + a) exp (-it)]/(l + a exp (r/)] was fined by least .squares, using 
numerical methods (>?’ = .999). Values of the parameters are a = 1.00031, h = 
-0.002114, and c = 0.146.'51. 

'« l hecuive,£(0 = -36,999.4 + 3520.22< - 101.878d + 1,34816/’ - 0.00706233/’, 
was htted by ordinary least squares (/f* = .999). 
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SourcB Stotlsifn Conodo |l974,pp 14-IS) 



Fu.s. 2-3. — Fin. '2, Male survival piobabililies (Canada 1971). Fig. 3, Average male 
nonmvc.stmeat income (Caiiacla 1971). 


rate of'discount i.s limited to the arguably “realistic” range 0 s p s r. 
(It was suggested in the previous section that p > r is implausible, 
while allowing p < 0 simply rciiiCorces the results obtained below.) 
The parameter y is allowed to vary from 0,5 to 5.0, rellecling both a 
desire to cover the alternatives y g 1 and considerable evidence that y 
is typically much above unity. 

A variety of evidetice on y is available. Friend and Bliime (1975, p. 
920) estimated it in the context of risk aversion, using data on house¬ 
hold asset httldings, and found that it was likely to be in excess of 2.0. 
Fat her (1978), attempting to infer the preferences of members of the 
United Mine Workers from the results of caillcctive bargaining in the 
coal inclusiry, also estimated y as the index of relative risk aversion, 
obtaining alternative estimates of 3.0 and 3.7 (p. 935). Further evi¬ 
dence is provided by Ghez and Becker (1975, p, 133) and MaCurdy 
(1979, p. 34), who, in studies of the allcK ation of goods and time over 
the life cycle, estimated upper bounds on cr of .28 and .33, re- 
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TAB1.F. 2 

C“/C'' WITHOUT Pensions 


At.i- 





y 






r = .03 



r = .015 

r = .045 

.5 

1 

2 

3 

4 

5 

4 

4 





P 

= 0 




20 

1 060* 

.441* 

.306* 

.274* 

.260* 

.252* 

.236* 

.285* 

;t() 

1 264 

1.000 

.903 

.872 

.857 

.835* 

.794 

.886* 

40 

1.305 

1.000 

875 

.833 

.811 

.798 

.748 

.863 

.50 

1 375 

1.000 

.832 

.775 

.746 

.728 

.684 

.800 

60 

1 498 

1.000 

.771 

695 

.656 

.633 

.600 

.709 

70 

1 678 

1 000 

.702 

609 

.564 

.538 

.518 

.609 

80 

1.815 

1.000 

668 

571 

.525 

.499 

.490 

559 





P 

= r/2 




20 

.441* 

.306* 

.260* 

.247* 

241* 

237* 

226* 

257* 

30 

1 216 

986 

.857* 

.820* 

.802* 

792* 

.798* 

810* 

40 

1,261 

1.028 

903 

.855 

.830 

814 

760 

.885 

50 

1.342 

1 032 

862 

.798 

.765 

.744 

.695 

.823 

60 

1 481 

1 038 

.802 

.718 

.674 

.648 

.610 

.733 

70 

1.675 

1.041 

.731 

.629 

579 

550 

.526 


80 

1 823 

1 039 

.691 

.586 

.537 

.508 

.496 

576 

SB 

■■■■■ 

mm 

IHIHI 

imiimiHQ 

= r 






.224* 

224* 

.224* 

.224* 

.224* 

.217* 

.234* 



752* 

.752* 

752* 

752* 

.752* 

.769* 

743* 



1 044 

927 

.876 

847 

829 

.771 

.904 



1.054 

.889 

820 

782 

.7.59 

.705 

845 



1 067 

.830 

.739 

.692 

.662 

619 

7.56 



1.076 

.758 

.649 

594 

.562 

.533 

.652 



1.074 

.714 

602 

.548 

517 

502 

.593 


• N(inii<*ffdtivc wrallh <'oiis(raiii( rlle^uvr under iinrenainly 


speciively, implying y ^ 3.6. Finally, recent studies of the elasticity of 
aggregate saving with respect to the real rate of interest suggest high 
values of y as well. Boskin (1978, p. S4), for example, estimates the 
uncompensated elasticity to lie in the range 0.3-0.4 in the United 
States. Including pensions, with r = .03 and p = .015, my model 
implies uncompensated aggregate saving elasticities of 0.9, 0.5, and 
0.3 with y = 3, 4, and 5, respectively.'* 

To sum up: The available evidence suggests that y may typically be 
above 3; that a value of 4 may be a “best guess” at the true figure: and 
that even y = 5 could not be rejected out of hand as implausible. 

Table 2 presents the illustrative computations for the case where £ 
is .set to zero beyond age 65, that is, where there are no pensions. A 
value of y below 1 gives C“/C‘’ > 1 and rising with age in the free 

** Alternative values of r and p yield similar results. Using y = 4, with r = .06 and p = 
03 the elasticity is 0.6, while with r = .03 and p = .0 we get a figure of 0.4, e.g. 
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I ABLE 3 

C'‘IC' WITH PENSIONS 


y 




r = .03 



r = .015 r = .045 

A OK 

.5 1 

1 2 3 

4 

5 

4 4 


p -0 


20 

.997* 

.415* 

288* 

.258* 

24.5* 

.237* 

.217* 

.274* 

.30 

1.346 

1.089 

.909* 

.827* 

.790* 

.769* 

.740* 

.836* 

40 

1.397 

1.109 

.992 

.946 

.922 

.908 

.915 

.935 

50 

1.491 

1.145 

.988 

922 

.888 

.867 

.886 

.899 

60 

1 669 

1 211 

.983 

.885 

.835 

.805 

.842 

.840 

70 

1.965 

1.337 

994 

.844 

772 

.730 

.795 

.766 


1.808 

1.500 

1 065 

.843t 

748t 

.6967 

.7937 

.7267 

p = r/2 

in 

.41.5* 

288* 

.245* 

.232* 

.227* 

.223* 

.207* 

.247* 

HH 

1.232* 

.909* 

.790* 

.7.56* 

.739* 

.730* 

.714* 

.764* 

r^i 

1.330 

1.168 

1.043* 

974 

945 

.926 

929 

960 

r^i 

1.437 

1.178 

1.030 

9.59 

.917 

.891 

.904 

.933 

60 

1 618 

1.266 

1 046 

937 

.874 

.835 

.865 

.888 


1.705 

1 397 

1 090 

917 

.822 

767 

.825 

.827 

El 

1.222* 

1 230 

I 193 

943t 

807t 

.7377 

.8307 

.7987 


p = r 


20 

.210* 

.210* 

.210* 

.210* 

.210* 

.210* 

.199* 

,224* 

30 

693* 

.693* 

.693* 

.693* 

.693* 

.693* 

.688* 

.701* 

40 

.935* 

.935* 

.935* 

.935* 

935* 

.935* 

.943 

925* 

50 

1.138* 

1 138* 

1.056 

.991 

.944 

.914 

.921 

.964 

60 

1.398* 

1 263 

1.089 

.985 

.912 

866 

.889 

.934 

70 

1.123* 

1.318 

1.154 

992 

875 

.807 

.857 

.895 

80 

1.088* 

1 088* 

1.220 

1.053 

.8767 

7847 

.8717 

.8877 


* N<)niic|(aiiv« wealth constraint effective iimicr uiKCiiainiy. 
t No blcKketl iiiterv«ii succeeds free interval. 


interval, while 7 > 1 yields the opposite, as predicted in the previous 
section. With 7^3 dramatic results are obtained. Using the “best 
guess” figure ot 7 = 4 and the combination r = .03 and p = .015, for 
example, maximum negative impacts on consumption are 23.5 and 
46.3 percent for the middle-aged and old, respectively. With the 
same, quite conservative, parameters, the mean ratio of dissaving to 
net worth for those aged 65-85 declines from 9.0 to 3.7 percent 
with the introduction of uncertainty. Note, finally, that the results are 
robust with respect to changes in r. 

Table 3 shows that when noninvestment income in retirement is 
introduced, the ratio C/C’’ rises in most cases. However, with 7^3 
and p < r, the impact of uncertainty remains negative and much 
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larger for the retired than for the middle-aged. When we focus again 
on the r = .03, p = .015, y = 4 case, we see that, while uncertainty 
reduces consumption in middle age by up to 8.3 percent, it depresses 
that in retirement by up to 19.3 percent. Although this is a more 
modest effect than obtained without pensions, it is still impressive, as 
shown by the fact that with the same parameter values the average 
rate of decuniulation between ages 65 and 85 falls from 7.0 to 2.9 
percent with the introduction of uncertainty.*® Furthermore, isolation 
of the r = .03, p = .015, -y = 4 case is conservative. Small changes in 
parameter values can make the effect even more sizable. By reducing 
p to 0 or by raising y to 5, for example, we increase the maximum 
negative impact on consumption for the elderly to about 25 percent 
and reduce the mean rates of decumulation under uncertainty for 
those aged 65-85 to 2.4 and 2.1 percent, respectively. Finally, note 
that the results are again robust with respect to r.*' 

V, Conclusion 

1 his paper has shown that the life-cycle mttdel without a bequest 
motive can help lo explain one of the most interesting features of 
actual consumption behavior, the typical low rate of dissaving among 
the elderly, when uncertain lifetime is taken into account. Theory 
shows that, with the tnost plausible parameters, a negative impact of 
uncertain liletime which increases in proportional severity with age is 
likely in the absence of pensions and may also occur when pensions 
are present. Illustrative computations based on actual income and 
survival data show that this effect is, in fact, obtained with or without 
pensions. Fut ihermore, the magnitude of the impact is sufficient to 
suggest that uncertain lifetime could provide the major element in a 
complete explanation of the slow decumulation of the retired. 
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What Kind of a Science Is Economics? 

A Review Article on Causality in Economics 
by John R. Hicks 


Christopher A. Sims 


As computational costs have declined, economists have used increas¬ 
ingly complicated statistkal models in analyzing business cycles. Most 
ol these tnodels have been based on extensions of the IS-LM 
framework developed by Hicks, combined with the simultaneous 
equatiotis statistical framework which was the central subject matter 
of econotneirics. These large ccottomelric models were, and still are, 
successful iti certain important respects. By making possible the sys¬ 
tematic use of large amounts of statistical information and the 
codification of forecasiitig and projection procedures, they provided 
good forecasts—as good as or better than averages of “judgmental” 
forecasters, [ he models and their services have become commercially 
valuable. Nonetheless, they seem to many economists to have out¬ 
grown their underpintiings in economic and statistical theory. 

I'he economic theory entering the models does not in itself explain 
the existence of recurrent business cycles. In practice, the htted mod¬ 
els usually explain much of what might be labeled “the character of 
the business cycle” as a consequence of properties of the “error terms” 
in the tnodel about which theory has little to say. This is not entirely 
the fault of the economic theory. I'he simultaneous equations statisti¬ 
cal theory tnakes heavy demands in this area of application on sup¬ 
posed a priori knowledge. It is therefore perhaps not surprising that 
the actually available economic theory leaves large gaps to be filled 
with conventional or ad hoc assumptions. The burden of ad hoc 
assumptions and implicit statistical procedures carried by the models 
for these reasons weakens them somewhat as tools for forecasting and 
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p»olicy projection. It weakens them still more as scienfific instruments 
when important economic issues are subject to dispute. 

Reflecting these sources of dissatisfaction are two current devel¬ 
opments in macroeconomic research. On the one hand, some 
economists are building stochastic theoretical models which have test¬ 
able probabilistic implications for the behavior of major aggregate 
time series as stochastic processes. On the other hand, econometri¬ 
cians have been looking at methods for reducing the ad hockery of 
the standard methodology by formalizing the procedures for control¬ 
ling the dimensionality of models and for accounting for drift in 
model structure. 

These developments seem to me natural and hopeful. I hey do, 
though, point to an increasingly complicated and intense interaction 
between theory and data, using probabilistic models. What is going on 
is not quite like what has gone on in any other science, though one can 
see some analogies. Is what is going on useful? What are the criteria 
for confirming or disconfirtning models of this type? Are the formal 
and infortnal standards for reporting results in this area reasonable? 

According to its introduction, the btxtk under review grew out of 
Hic ks’s meditation on a 1974 conference on the microfoundations of 
macroeconomics, which he attended and saw as a failure. It also 
reflects his view that “econometrics is now in some disarray.” The 
book discusses scientific method in general and then in this light the 
special character of economics, especially macroeconomics, as science. 
As is evident from the book’s length (124 + xii pages), the argument is 
concise, and it is presented in a conversational tone. The central 
conclusions, as I see them, are that “as economics pushes on beyond 
'statics’ it becomes less like science, and more like history” (p. xi) and 
that "the probability calculus is no excuse for forgetfulness” (p. 122). 

I’o explain completely what Hicks means by these phrases would 
require more space than I have in this review, but some explication 
is obviously required. The former idea is that dynamic economic 
theories must inherently be incomplete, imprecise, and therefore 
subject to variation over time. One reason for this is that economic 
cause-effect relations involve a “recognition delay” about which 
theory has little to say and which may be expected to be variable. For 
example, when a price change causes a change in an amount pur¬ 
chased, there will be a delay between the actual price change and 
consumers’ hearing of it, as well as a f>ossible further delay before 
consumers decide to act on the information. It is wrong, then, to 
expect economic theories to be complete, mechanical, and divorced 
from reference to specific historical circumstances. 

T he latter idea is that, particularly in economic time series, prob¬ 
ability models are often inappropriate because the data are interde- 
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pendent in ways which make treating them as “random patently 
unjustifiable. Hicks argues that probability judgments in ecdnomics, 
both in microeconomic decision making and in evaluation of com¬ 
peting economic theories, often result in incomplete orderings, with 
many possibilities lying in a gray zone of noncomparable probabilities. 
Thus the usual axiomatic arguments for the probability calculus are 
unjustified in a wide range of econotnic applications. He cites as an 
example of this objection to the probability calculus in inference the 
situation in which a new fact which seems not to fit existing theory 
emerges, and a new but ad hoc theory is put forward to “explain” it. 
Because a priori and likelihood considerations point in different 
directions in this case, he argues that the two theories lie in the gray 
zone of noncomparability and that the reasonable action to take is to 
reserve judgment. Similar con.siderations, Hicks argues, suggest that 
economic behavioral models of decision making under uncertainty 
ought not to be formulated taking the usual probability calculus for 
granted. Probabilistic models and methods of inference are often 
awkward and misleading tools in application to economics. 

Hicks’s discus-sion is valuable for the insights it presents into the 
special character of economics as a science. Its conclusions, though, 
seetn to me deeply mistaken, in part because the argument too easily 
resolves some of the problems it raises. 

My dissatisfaction with the Hicks book is easier to explain if we 
compare it with another recent book on a closely related subject— 
C.hauce, (lanse, Reason, by Arthur W. Burks (1977). The Burks book is 
expansive (almost 700 pages), where Hicks’s is concise; it is densely 
formal iti parts, w here Hicks’s is persistently infoimal; it very seldom 
gives the impression of having chosen an easy resolution of a 
diffic ulty, where Hicks’s does. Burks, for example, considers at some 
length the objection to the probability calculus given central place by 
Hicks: that probability judgments do not yield a complete ordering. 
Burks argues that these objections are correct in principle but that 
they are in the nature of qualifications to the range of applicability of 
the calculus—when no decision has to be made, or when the decision 
to be made is unimportant, then it may not be “economic” (to use 
Burks’s word) to rank all probabilities and utilities. But (this is my 
conclusion, not Burks’s) in important issues of scientific inference or 
in economic decision making which involves substantial amounts of 
money and utility, it is both norinatively and descriptively rea.sonable 
to treat probabilities and utilities as completely ordered. In taking 
decisions, we implicitly rank them in any case. 

I could go on citing Burks in rebuttal to parts of Hicks’s argument 
with which I disagree, but it would be better for the reader of this 
review to read Hicks and struggle with Burks (I for one cannot “read” 
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Burks in the ordinary sense of ihe term) and reach his own conclu¬ 
sions. For despite my sympathy with Burks’s style and with many of 
his ai guments, 1 do not .see his book as in any sense a substitute for or 
a refutation of Hicks. Burks is constructing a model of scientific 
inference, and it seems that the hard sciences are the focus of his 
interest. He rightly points to the need for .some degree of uniformity 
in sdentific laws over space and time if inference is to be possible and 
also rightly (in iny view) claims that his motlel of inference applies 
even if there is some variation of laws in space and time. He rightly 
claims that there is a sense in which empirical probability statements 
(e.g., “the probability that this coin will come up heads is .5 ") can be 
true, and from this point builds up an intersubjcctive (as opposed to 
Savage’s personalistic) theory of inference. Economists do often aim 
at inlluencitig professional consensus as to what causal structure 
plausibly underlies historical data. If we are to think about how 
economists dti this or ought to do this, an intersubiective theory like 
Burks's fills an impoitant gap in Savage’s subjective theory. 

But Hicks is right about some of the central differences between 
economics and other sciences—temporal instability of our theories is 
not a marginal qualification to our methods of inference but a central 
aspect of our difficulties iu determining what is true. And treating 
"einpirital truth ” as a pi imiiive notion seems too simple it) lead to a 
ihetii'y applicable to economics. 1 ho problem of the “ad hoc theory,” 
ai'isitig from the dearth of “empirically true” probability statements in 
economics and the near impossibility of generating them by experi¬ 
ment, is teniral to us. Burks’s system, while lelevam to economics, 
fascinating on its own terms, and stimulating as an example, docs not 
confront in detail some of the central problems of inference in eco¬ 
nomics. 

Hicks is not explicit about how his broad conclusions apply to 
specific types of cut rent wot k in econoinics. (Indeed, except for the 
author’s owu works, uo publications in macroeconomics, probability, 
or statistics less than 15 years old are cited.) He does in his final 
paragraph appear to condemn all u.se of probability methods on cross 
.sections not generated b) random sampling; 

1 bus it is not at all sensible to take a small number of 
observations (sometimes no more than a cloven observations) 
and to use the rules of probability theory to deduce from 
them a “significant" general law. For we are assuming, if we 
do so, that the variations from one to another of the obser¬ 
vations are randotn, .so that it we had a larger sample (as we 
do not) they would by some averaging tend to disappear. But 
what nonsense this is when the observations are derived, as 
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not infrequently happens, from different countries, or lo¬ 
calities, or industries—entities about which we may well have 
relevant information, but which we have deliberately de¬ 
cided, by our procedure, to ignore. [Pp. 121-22] 

1 think Hicks is wrong here. Any use of experience to reach better 
decisions must involve “deliberately ignoring” information, whether 
the experience is the historical record or a sequence of repeated 
experiments. In physical experiments there is always observational 
error, which is treated as random. A good experimenter may have 
many bits of information he judges relevant to explaining the appar¬ 
ent random error in a sequence of experiments. In writing up the 
experiments, though, he does not report which research assistant was 
on duty at each meter reading, together with an analysis of each 
assistant’s bias atid mean square deviation as a meter reader. In 
physical sciences there is substantial consensus as to what is to be 
treated as random. 

In social sciences, with hi.story the most extreme instance, there is 
much less consensus. It is nonetheless necessary to ignore some in¬ 
formation, to treat it as unsystematic or random, if the past is to be 
useful. Because there is inevitably room for dispute about what is 
systematic and what is tiot in analyzing history, historians do not 
dogmatically suppress discussion of information which some might 
consider "random” or unsystematic. Nonetheless, when a decision has 
to be made, if history is to be relevant, some choice must be made as to 
what is ratidoin. A political decision maker might well need to judge 
how likely it is that a Marxist revolution in his country would be 
followed by political democracy. He might think that the historical 
record concerning previous Marxist revolutions is relevant to this 
question. However, it is obvious that every previous revolution has 
special characteristics which are well known and which are potentially 
relevant to explaining whether the postrevolutionary government 
emerged as politically democratic. In applying the historical record to 
his own problem, the decision maker will have to treat at least some of 
these special characteristics as random. 

The .same type of argument justiBes, in clinical trials of a new drug, 
reporting statistical summaries of the outcome of therapy rather than, 
case by case, the best guess of the attending physician about why the 
result of therapy was what it was for this case. And the same type of 
argument again justifies using statistical methods for observation on 
countries Or regions even when we could adduce a great deal of 
special information about each observation which might possibly help 
explain its “random"’ component. 

Economists must inevitably try to sort out systematic patterns from 
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random variations in the past—if only because, unassisted, policy¬ 
makers would do the same thing more naively. In doing so econ¬ 
omists will need probabilistic models and statistical methods of 
inference. Like historians, though, they must accept that a single 
agreed view of the causal structure of the record they examine will 
never emerge. Perhaps Hicks's stimulating but unsatisfying book will 
lead someone to work out a philosophy of inference which confronts 
both these aspects of economics as science. 
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To Save or Savor: 

The Rate of Return to Storing Wine 

Elizabeth Jaeger 

aj Vir^nia ajuL Freemark Abbey VVi»/*r^ 


For $28,000 one can buy a Jaguar XJ6, 308 shares of McDonnell 
Douglas stock, a one-tenth interest in an Oklahoma gas well, 4i 
years of undergraduate life at Stanford, or one bottle of Cihatcau 
Lafite, vintage 1806. “That’s $ 1,166.67 an ounce. At this moment gold 
is worth only $275 an ounce, and you can’t smell or drink it” (Saturdfiy 
Revieii’ of Literature 1979, p. 38). Nine years ago, $70 could persuade a 
Paris restaurant to part with this same bottle of li(|uid gold (Goldwyn 
1979, p. 4). 

But for balance admire the growth of a slightly more pedestrian 
vintage, a 1959 Beaulieu estate-bottled Ciabernet. Today a case of it 
may be purchased for $1,200. For the more modest budget, $950 will 
buy a ca.se of 1970 Beaulieu George Latour Private Reserve C^abernet. 
All this goes to show that wine, the right wine, in the last 10 years has 
provided the prudently daring investor with a spectacular return. 

The virtues of wine as a lk]uid asset have been tested recently by 
Krasker (1979). Krasker cannot reject a hypothesis that wine offers no 
extraordinary rate of return; the wine investor can expect no greater 
return than that whit h accrues to riskless assets. While Krasker hnds 
the concept that an investment in wine outperforms Treasury bills in 
nominal return “incompatible with the way economists believe most 
assets are priced,” his result seems counter to intuition. Wine as a 
subcategory of riskless a.ssets seems contrary to wine’s very nature. 

Wine is a living substance. Its maturation process extends well 
beyond the bottling day, lasting as long as 20-40 years in the case ol 
some reds, while a few others refuse to retire at 65. Butt/ic proper age 
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at u'ltich to consume a wine is not some known date but rather a peak 
which must be discovered by the bulk purchaser through ex¬ 
perimentation. When Iretiuenily posed with the question when to 
drink, Harry Waugh, the famous wine writer, would respond, “Just 
imagine looking around a room full id people and somebody saying 
to you, ‘Tell me at what date each of these individuals is going to 
ciie' " (Waugh 1968. p. 1), 

This element of uncertainty is reHciied in ihe development of the 
1966 vintage French burgundies. Thc-se wines were considered years 
avvay from full ripeness when, in 1975, out went the cry, “Drink your 
!966’s!”; they had unexpectedly gone over the hill (Robards 1976, p. 
38). 

Such events explain .some of the sudden downturns in wine prices. 
Figure I traces both the expected and actual maturation of a 
hypothetical vintage and its price through time. If. at some point in 
the life of the wine, investors discover that it has peaked prematurely, 
then they readjust their offer price to rellect this new information. On 
the other hand, a publii i/.ed tasting which reseated much life in a 
wine of old vintage could be expected to raise its value to the investor. 

Quality uncertainty need not, by itself, imply that wine will com¬ 
mand a risk premium unle.s.s the variance in the return to wine is large 
relative to the total variance of a portfolio. If wine speculators do not 
hold a “market” portfolio, such a possibility may exist. 

In addition to unanticipated (piality changes, wine is affected by 
cyclical demand fluctuations which can severely aflcct wine prices. For 
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examp/e ihc- 1974-75 recession coincided with a decline in mn 
prices This covariance with other asset returns is an additional reaso 
that wine will tommand a risk premium. 

In view of the uncertainty associated with wine’s growth and devel 
opment and of wine’s covariance with other assets in a potential 
portfolio. Krasker’s empirical finding that investors receive no risk 
premium for holding wine is surprising. This paper suggests an 
alternate method for estimating the wine investor’s return. 

My approach differs from Krasker’s in three important respects. 
Krasker’s data on wine prices are taken from the annual Heublein 
Wine Auction and cover the 4 years 1973-74, 1974-75, 1975-76, and 
1976-77. By limiting his data to these 4 years in particular, he has 
biased his procedure toward hnding a subnormal rate of return. 'I’wo 
years of bis sample of 4 were extremely poor from the perspective of 
the wine industry; in early 1974 and through 1975, newspapei head¬ 
lines proclaimed that “rivers of surplus wine’’ existed, reflecting the 
high inventories of wineries and their distributors. In these 2 years, 
wine prices, the French especially, fell precipitously below those prices 
of many preceding years. The negative trend of these Hgurcs could be 
expected to impact heavily on wine returns calculated during this 
period. My data include the prices determined by the Heublein Auc¬ 
tion over 8 years, beginning in 1969 and ending with the 1977 auc¬ 
tion. Observations covering 4 more years may offset the dampening 
effect of the 1974-75 and 1975-76 auctions on the total return to 
wine storage. 

Second, I eliminate an unknown in Krasker’s regression: the cost of 
storing wine. Where his regression estimate of the cost to the wine 
investor is $ 16.80 per case per year, I use the cost accounting figure of 
Freemark Abbey Winery, an annual per case storage ccjsl of $.449. 
I'his number is calculated on the basis of the opportunity cost of the 
storage facility divided by the number of cases. The Freemark Abbey 
storage cost figure should reflect the cost for the auction participants 
who, in general, store wine in commercial cellars. 

For comparison, however, I estimate the annual cellar costs and the 
rate of return to wine storage; in this way I have a basis for comparing 
Krasker’s and my results. 

Finally, my estimation procedure will differ from Krasker’s gener¬ 
alized least-.squares process. Limitations on available computer soft¬ 
ware forced me to employ an alternative estimation procedure which 
I explain below. 

'The Model 

Following Krasker, I restrict my attention to include only red Bor¬ 
deaux and C>alifornia Cabernet Sauvignon produced since 1950. 
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Thus, I concentrate on those wines which are commonly found to 
benefit by extended aging, and 1 exclude those wines which Krasker 
beheved were traded as anuques.' An observation is recorded every 
time a particular vintage wme is traded in consecutive years; then over 

K T" be represented by 

(Pti*i, PnVp,i< where p„ is the price of wine i I year prior to the price 
observation on the same wine, 

Krasker tests the hypothesis that the expected rate of return to 
storing wine is equal to the rate of return to riskless assets; that is, 

^^Pt-m ~ pH ~ h 

P,i 


r = 0 = 0. 


( 1 ) 


where 0 is the risk premium, S is the cost of wine storage, and r 
denotes the average rate of return t<i 3-month Treasury bills for the 
four quarters which precede the auction. Equation (1) implies the 
following regression equation; 


Pt+ii Pti 
Pa 


r, = $ + 8 —^ + -HmL.. 

P,i P,i 


( 2 ) 


where the error term u,+ ,( is serially uncorrelated as expectations are 
assumed to be rational, and is assumed to have mean zero and 

constant variance. 

Because the disturbances are correlated within each year, Krasker 
employed a GIJs procedure to derive the estimated equation (stan¬ 
dard errors in parentheses): 


Pt+lt pH 
Pit 


- r< = -.059 + 1.4 — 

(.037) (.72) 

= .03; SE = .24. 


-t- 


Pii 


(3) 


The estimate implies that wine costs $1.40 per bottle ($16.80 per 
case) to store. Although not significantly different from ^ero. the 
return differential between wine and Treasury bills is negative. 


The Fixed-Effects Model 

Because the sample data involve different numbers of observations 
for each different year, the GLS approach is not straightforward. 
Krasker does not mention how he estimates the off-diagonal compo¬ 
nent of the variance-covariance matrix for his GLS estimator, and 


' Although prestige and publicity over bottle content assuredly guided the purchaser 
of the 1806 t’hateau Lafite. many truly fine wines have been excluded by the arbitrary 
iboicc of 19,60 for a cutoff point. Nonetheless. 1 strive to maintain the similarities of 
ihese two papers where I can, so 1 have adopted this same breakpoint. 
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instead of writing my own seemingly unrelated regression program, I 
shall consider an alternate method to account for disturbances which 
are correlated within each year; least-squares estimation with dummy 
variables, d'he dummy variable technique exploits the special feature 
of the combined cross-section/timc-scries data by explicitly accounting 
for the “ctmstant” portion of each error term for each given cross 
section. 

As in the GLS procedure, this method presumes that each residual 
iit+iJpii for observations in a given year may be decomposed into two 
independent components: an individual effect, and a remainder, 
c,. Then, u,+ufpu = €, - 1 - v,i, where, in terms of the present example, e, 
represents a nonrandom addition to the rate of return to all wine sold 
in the year I, and v„ becomes the specific classical error term associ¬ 
ated with the ith wine, that is. 


(T?,, for / = /' and i = 
0 , for t 7 ^ t' or i 9 ^ ] 


riic e, component of the rate of return can be incorporated into 
Krasker’s regression equation by the introduction of dummy variables 
to represent given years. In this way, each year is t haracterized by its 
own special intercept. The regression equation is then a fixed-effects 
model, using Scheffe’s (1959) terminology. 

Foi ease of exposition, I will denote the dependent variable as y,(. 
The vector (/t = 1, . . . , m) will contain all observations on the 
dependent variables for year k. The variable l/p,( will be denoted z„, 
and Zfe will be the vector of observations on 2 ,, for year A. The vet tor 

(A = 1. m) will contain all of the ob.servations in for year A. 

Finally, (A = 2. m) represents a vector with entries of one in the 

elements associated with the Ath year and zeroes elsewhere. Stacking 
the vectors, let 



I'he relationship to be estimated from the combination of time- 
series and cross-set tion data can then be expressed in the form 

y = c + /Sjia + ... + (3jn, + 6 * + v, (4) 

where (c, , /3m. 6 ) is a veettir of unknown parameters. 

Equation (4) can be estimated efficiently with ordinary least squares. 
The parameter estimates can be used to estimate the quantity (r + 
[I/tw] XII 2 / 3 (); the risk premium that one would expect to accrue to a 
wine is randomly selected from the total sample. 

The application of this procedure to the wine data indeed reveals a 
premium to the wine investor with one qualification. Running four 
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TABLK I 

I'hk Rciiikn to Wine S'lOttAt.i-, 
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K^gkkssions Using Rkori-ssions IIsinc, 

4-VtAK Uaia {N = 140) S-Ykak Data (iV = 109) 



Regression 
K.stiiiiaic of 
Slot age Cost 

•Storage t:o.st 
Ksiimatc 
$.449 

Regrc.vsion 
F-stimaie of 
sSiorage (-ost 

Storage < 5>st 
t stimate 
$.449 

Average premium 
lo wine investor 

- 01367 

.03812 

.08.542 

.16614 

Standard ei ror 
(average premium) 

.0,3399 

.0661 

.03379 

.03296 

/-statisiic 

-.4021 

.5767 

2.-5279 

5.0392 

d 

17.2,'i 


)6.60 


(SK) 

(H.76) 

.29 

.27 

(5.13) 

.41 

.38 

Standard ctror 

24 1 .S 

.2439 

2532 

2,587 

Mean depondciit 
vari.tble 

0182 

0168 

.1036 

.1068 


separate regressiotis, I have distinguished between the effects on the 
rate of return of (1) increasing the period of observations f rom 4 to 8 
years and (2) estimating (on the right-hand side) versus incorporating 
(on the lc(t-hand side) the storage cost in the regression equation. 
Spec ifically, 1 run two regressions on ccpiation (4), one using Krasker’s 
140 observations taken over 4 years and the second using 199 obser¬ 
vations from an 8-year lime span. In an identical fashion. 1 run two 
more regressions on the following version of cejuation (4), 

y - 8z = < + 4- V, (5) 

where 8 = .449, the Freemark Abbey storage cost estimate. Using the 
estimates for (r, /Sj, . . . , /§„). I have calculated c + (l/«) Pi 
together with its standard error. The results appear in table 1. 

Not surprisingly, one estimate confirms Krasker's findings that the 
expected return to wine lies beneath the return to Treasury bills This 
regression utilized Krasker's data over the pericxl 1978-77, where 
1973-7,'i were very poor performance years for the wine industry. 
And like Krasker’s result, the return is biased downward by the 
unrealistic storage cost estimate produced by the regression: an an¬ 
nual $17.2.5 per case. The inffucncc of this estimate can be seen in 
the second regression, where the cellar cost is accounted for using 
the freemark Abbey winery estimate; a 3.8 percent premium to 
oenophiles separates the wine and Treasury investment. However, as 
in Krasker’s study, neither premium is significantly different from 
Zero. 

Kxpanding the data to cover the 8-year span (1969-77) greatly 
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instead of'writing my own seemingly unrelated regression program, I 
shall consider an alternate method to account for disturbances which 
are correlated within each year: least-squares estimation with dummy 
variables. I'he dummy variable technique exploits the special feature 
of the combined cross-section/time-series data by explicitly accounting 
for the “constant” portion of each error term for each given cross 
section. 

As in the GLS procedure, this method presumes that each residual 
u,+ tJp„ for observations in a given year may be decompo.sed into two 
independent components: an individual effect, v,i, and a remainder. 

Then, = €, + i’,i, where, in terms of the present example, e, 

represents a nonrandom addition to the rate of return to all wine sold 
in the year /, and i’,i becomes the specific classical error term associ¬ 
ated with the ith wine, that is. 


F.i’iiVrt = 


(Tr. for / = t' and i = / 
0 , for t ^ t' or i ^ j. 


1 he e, component of the rate of return can be incorporated into 
Krasker’s regression equation by the introduction of dummy variables 
to represent given years. In this way, each year is characterized by its 
own spet iai intercept. 'The regression equation is then a hxed-cffects 
model, using Scheffc's (19.59) terminology. 

For ease of exposition, I will denote the dependent variable as y„. 

The s et tor (k = 1. m) will contain all observations on the 

dejiendeni \ariables for year A. I he variable Mp,, will be ilenoled 
and will be the vector of observations on 2 ,, for year A. The vector 
{k ~ I, .... m) will contain all of the observations in Vt\ for year k. 
Finally, i* (A = 2, . . . . m) represents a vector with entries of one in the 
elements associated with the Ath year and zeroes elsevvhere. Stacking 
the vectots, let 



The relationship to be estimated from the combination of time- 
series and cross-section data tan then Ire expressed in the form 

y = c + /Szii + . . . + + 6* + V. (4) 

where (c, /J,, . . . , d) is a vector of unknown parameters. 

F.(iuation (4) tan be estimated efficiently with ordinary least st|uares. 
Fhe parameter estimates can be used to estimate the (juanlity (/ + 
[1/rnJ S.'lj /3i); the risk premium that one would expect to acc rue to a 
wine is randomly selected from the total sample. 

The application of this procedure to the wine data indeed reveals a 
premium to the wine investor with one ejualiheation. Running four 
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Regression 

-SiDi.itte Cost 

Regression 

Siorajfc Cost 


Ksiiitiale <>f 

F.sliinate 

Ksuinate o( 
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Storage <d>si 

$.44',t 

Sioragu ('osl 

$.449 
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to wine investor 

-.01367 

.03812 

.08542 

.16614 

Stantlaid einir 





(average prcinuim) 

.03.3‘)‘l 

.0661 

.03379 

,03296 

f-SldllSllC 

-.4021 

.T7t>7 

2.3279 

3 0392 

8 

17 23 


16 60 


(SE) 

(8.76) 


(.3 13) 



20 

27 

.4 1 

38 

Standard enoi 

241.3 

.2439 

2 3:S2 

2387 

Mean de|>ciictcm 





\ Ai lable 

0182 

.0168 

. 1036 

,1068 


separate iegressions, I liuve distinguished between the effects on the 
rate of return of (1) iru reasing the period of obsei vations from 4 to 8 
years and (2) esdinaiing (on the right-hand side) versus incorporating 
(on the left-liand side) llie storage tost in the regression eiiuation. 
Spec ificaliy, t run two regressions on ecpiation (4), one using Krasker's 
140 observations taken over 4 years and tfie second using 199 csbser- 
vaiions from an 8-yeai time span. In an identical fasJiion. 1 run two 
more regressions on tfte following version of e()uaiion (4). 

y - Sz - c + + ... + + V, (5) 

where S = .449, the Freemark .Mjljey storage cost estimate. Using the 

estimates for (c, . Pm)’ * have c.ilculaied r + (1/m) )3, 

together v\iih its standard error. 1 tie results appear in table 1. 

Not surprisingly, one estimate conlii nis Krasker’s findings that the 
expected return to wine lies beneath the return to I'reasury bills. I'his 
regiession ulili/ed Krasker's data over the period 1973-77, wliere 
1973-75 were very poor performance years for the wine industry. 
And like Krasker’s result, the return is biased downward by the 
unrealistic storage cost estimate produced by the regression: an an¬ 
nual $17.25 per c ase. 1 he inHuenc e of this estimate t an be seen in 
the second regression, where the cellar cost is accounted lor using 
the freemark .Abbey winery estimate; a 3.8 percent premium to 
oenophiies separates the wine and Freasttry investment. However, as 
'll Krasker’s study, neither piemiuin is significantly different from 

/.CTO. 

Expanding the data to cover the 8-year span (1969-77) greatly 
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improves the calculated return to wine. A low dividend of 8.5 percent 
refiects the high estimated cellar cost of 16.60. But wine outperforms 
Freasury bills by an impressive 16.6 percent for the commercial 
establishment for whom the Freeinark Abbey cost gauge applies. Both 
premiums are significantly different from zero. 


Reinterpreting Krasker’s Regression Equation 

A somewhat disturbing <|ucstion which 1 have yet to address is w'hy 
Krasker’s and my regression equation with unrestricted storage costs, 
that is 


-r, = e + S 

fill fit! 

give the implausible estimate, attributed to l//h,, of $16.60 per case 
annually. However, note that 16.60//>(, is a quantity on the order of 
magnitude of a rate of return, since the average value oi pu over the 
8 -year period is $410. An alternative interpretation of this result is 
that 6{\lfi„) does not represent annual storage costs but rather an as 
yet unexplained portion of the rate of return to wine storage. 

If the premium of the wine investor can be segregated into two 
effects, one w hich atcrues to the wine regardless of prit e and a second 
effet t related to its nominal price, then 0 + (B/pn) will represent the 
total return to storing wine. In this case 8/fi„ captures the premium as 
a function of price. Krasker’s positive estimate for 6, interpreted in 
this light, implies a higher premium to lower-priced wines. The alter¬ 
native postulate that high-priced wines receive the better rate of 
return would be supported by a negative 6. 

The possibly surprising conclusion that lower-priced wines receive 
higher premiums than their cminterparts can be explained by risk. 
Wines attached to smaller nominal price lags for some reason entail 
greater risk and return for the investor. I hus price stands as a proxy 
lor risk, where risk can be measured by the variance of the rate of 
return. The validity of this explanation for Krasker’s “cost” estimate 
relies on the variance of the rate of return of high- and low-priced 
wines, and support for this interpretation of the regrc.ssion equation 
would be gained by a measurably higher rate of return variance in the 
low-price group. Ordering the 199 observations from the 8 auction 
years by nominal price, I have computed the mean rate of return and 
variance for the 65 highest- and 66 lowest-priced wines. While wines 
in the high price range offered the investor an average rate of return 
of .03 (3 percent) with a variance of .051, the lowest one-third 
provided a mean rate of return of .3587 (36 percent) with a variance 
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more than twice as large as the highest third,. 1162. Performing an F- 
test, I can reject the hypothesis that the two variances are ec]ual at the 
95 percent level of significance, since the value of the/"'- statistic is 2.28. 

Repeating the procedure using 140 observations from the 4 auction 
years considered by Krasker yields similar results. The 47 highest- 
priced wines provide a mean return of —.0267, with a variance of 
.044; the47 lowest-range wines earned anaverage return of .1843 (18 
percent) with a variance of .086, double that of the high-priced wines. 
Again, ba.sed on an F- statistic of 1.95, I can reject the hypothesis that 
the two variances are equal at the 95 percent level of significance. 

Apparently, then, wine has an even higher return than allowed 
earlier in this paper, since previous calculations did not incorporate 
the return associated with a wine’s own price but rather attributed 
that portion of the rate of return to cost. Using the cellar cost statistic 
computed by Freemark Abbey Winery, we can infer f rom Krasker’s 
regression that 

7.h... - r„ - .449 — = (tf + 6 ~] + (6) 

P,i \ Pt, / P» 

1 he amended risk premium to storing vyine for the 8-year period, 
derived by computingc +• (llm) + (S/p), witbp == $410, is .12.39 

with a standard error of .0672. 

Before proposing an intuitive explanation for the greater risk asso¬ 
ciated with lower-priced wines, it should be emphasized that “lower 
priced” is relative to this sample of expensive premium wines. By no 
means do I mean to imply that risk is associated w ith less costly wines 
in general; consistency (i.e., zero risk) is the by-line of Ernest and Julio 
Gallo's jug wines. 

One plausible interpretation of the result is that the most expensive 
wines, heavily represented by the older vintages, offer greater cer¬ 
tainty to the investor because of their age. Their more youthful 
( ounterpart, with a shorter trac k rec ord, provides the wine buyer with 
unrealized potential, and thus risk, at a discounted pi ice. 


Conclusion 

In reestimating the rate of return premium to wine storage using a 
larger data set, 1 have reversed Krasker’s findings that wine specula¬ 
tion yields no greater return than do Treasury bills. In addition, I 
have shown that the use of an actual storage cost measurement allows 
reinterpretation of Krasker’s estimated “storage cost" as part of the 
return to wine. Adding this return to Kiasker's original estimates also 
I produces a positive premium to wine storage. 
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On the Relationship between Commodity Price 
Changes and Factor Owners' Real Positions 

James Classing 

Atisttahfin Nntutmd 


Recent contributions to iratle theoiy have iiiili/.ed the gencrali7crl 
variable propoi lions model in f)rder to identify ‘natural ft iends" and 
“natural enemies” of particular relative commodity-price rbanges 
(see, e.g., Jones and Scheinkinan 11977) aiul Jones [1979, chap. 8] tor 
a relevant survey). In the case where the numtx^r of factois equals 
the number of comnnxhiies, it is now known that every factor has a 
natiti al enetny in somecotnmodiiy in the sense that any factor's teward 
is inversely related to sotne commodity’s pi ice. Also, it is knerun that, 
111 general, a given factor need not have a natmal iiiend in the sense 
that the factor’s rewatd may not increase moic than propoitionately 
to atiy comitioclity’s puce nurc-ase. 

However, this result should licit be interpreted to imply that in the 
» X n case some individual whose enefowinem consists onlv of a given 
factor may not be .ilile to iiu rease liis real income through .in appro¬ 
priate comntodity-pi iie use. On the conirary, when we take into 
ai count an iiKfividu.il's expenditure p.ittein. it can be shown that any 
individual can improve his re.tl position through some commodity- 
pi ue rise. 

In order to sc-e this, consider the standard « X ii variable propor- 
lions trade model, l.el O denote the ii x » disinbuiive share matrix, w 
the II X 1 vector of f.ictor ivvvauK, and p the ii x 1 vector of 
(ommodiiy prices. Using ” to denote a proportionate change, v\e 
have the usual price equation 

B'w’ = ^ (1) 

I hw pajK-r li.is l>cnc-filcil from ilu- hclptiil oiiiiincms of Peter Wan amf an anon\- 
relerct*. 


\liwtuil of houimy, I'lHI, vt)l KM. imv SJ 

>n Jhc rinvpMitv t.f ( hK^gt. 
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Wc assume ihai 0' is iionsingular so that 


w = S'-'p. ( 2 ) 

)ories and Scheiriktnan show with an example that tor any f'actori the 
ah row of 0'^' may contain elements all less than unity. 

But the change in an individual’s real posiiitin also depends upon 
expenditure patterns. Suppose that agent I's endttwment consists only 
of the tth factor. Denote by the /th agent's expenditure share of 
income on commodity k. Then the real income effect for individual I 
o( a change in any commodity k's price is given by 

ri’i ~ <f>',kpk- (^) 

.Substitution from (2) yields the real income effect 

(0iV' - <(>lk)pk, (4) 

where 0,V' is the i — Ath element of 0' '. 

fhe teal position of agent / improves if the term in parentheses in 
et|tiation (4) i.s positive. Wo know that this term is negative for some 
k, since all factors have natural enemies. But the sum of these 
coelficients over all k is zero, since 0' is row stochastic, so that 0'“' has 
unit row sums, and since all income is spent. That is, 

n n 

X 0.V' - X = I - ' = (5) 

t = l Ar-I 

rhc'iefore, since the summation has at least tme negative term, it must 
have a positive term as well. The real position ot any imlividual can be 
improved through an appropriate commodity-price rise. 

As it stands, this lesull is not the exact counterpart of the "existence 
of a natural enemy’' result which depends on only the technology. A 
natural friend would lx* a commodity for which a price rise induces an 
improvement in a factor’s real wage, independent ol the expenditure 
shares of its different owners u.sed Icj compute each owner’s price 
deflator. Rather, we show here only that in the n x ft case for each 
individual / whose endowment consists only of a given factor, there 
exists a comtiiodity such that, when this commodity price rises, the 
real wage of that factor, when deflated by /'s price iticlex, rises. 1 hits 
every individual is a proponent of .some—-not necessarily the same— 
commodity-price increase. And, of course, if all owners of a single- 
factor happen to display the same expenditure shares, then theic 
would be a sense in which every factor can improve its real position 
through some commodity-price ri.se. 
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Confirmations and Contradictions 


Stochastic Implications of the 

Life Cycle-Permanent Income Hypothesis: 

Evidence for the U.K. Economy 

Vince Daly and (ienrge Hadjimatheou 
Kinffston Polytf'ihiiic 


In a previous issue of this Jourtuil, Hall (1978) pi esenled a simple life 
cycle-permanent income model of the consumption decision ac¬ 
cording to which consumption expenditures arc predicted to follow a 
random walk with trend; that is, 

C, = \C,_, -t-€„ X>1, (1) 

where e, is white noise. Hall’s slated purpose in deriving this equation 
was to provide a testable implication of the pure life cycle-permanent 
income hypothesis which avoided the econometric problems associ¬ 
ated with a consumption function in which measured income is in¬ 
cluded as an explanatctry variable. 'I'he testable implication is that, 
apart from this period’s consumption expenditure, no variable 
observable in this or in earlier periods should show any predictive 
power for next period’s consumption expenditure. For U.S. cjuarterly 
data 1948(1)-I977(1), Hall found this to be the case, with the excep¬ 
tion of movements in the real value of corporate stock.* Given the 
importance of the hypothesis it would seem worthwhile to examine 
the extent to which it is substantiated by alternative economic his¬ 
tories, and so we present here the results obtained from U.K. c|uar- 
terly data in the postwar period. 

' In Hall's pd|ier the sij^nifu antc nf the lorporale sttx k vaiiable i.s not cotisklered as 
c()ntiadiclf>ry lo the icstfd hypoihrsis when the m<Mlel is m<Hlihed lo allow for a lagged 
dfJjuslrnt'/U oi tonsuJTiplifjn U) d <ha»go in pt*rniari«"fH intornc*. 1 he iinplk i( d.ssu/nption 
IS ihdi, given ihe random walk naiure of <<)iporale stexk values, changes in them arc* 
expected lo fx* correlated with changes in permanent income. 

ttf Pntuual him4*my. 1981. vtti 89, rv» 3| 

1981 by IjiHrrsiiy .if (.hi.a*go. IXr22'3H(«/HI11 SO 
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1 he data are taken from katnomic frends—Anmud Supplement 1980, 
and, following Hall's example, variables were, where appropriate, 
measured per capita and deflated by the implicit price deflator of 
nondurable consumption expenditure, the latter being the measure 
of consumption adopted in the original study. Since the test prcKe- 
dure involved searching for variables which could supplement the 
predictive power of ecjuation (1), a large number of regression results 
were generated which we do not report below; we simply note that 
unemployment, the inflation rate, and the rate of interest, all of which 
have been proposed on occasion as determinants of C,, did not con¬ 
tribute significantly to the explanatory power of a regression ecjua¬ 
tion containing ('u-i- 

Our substantive result is that regression specifications which in¬ 
clude lagged values of disposable income or liejuid assets, or the 
period-to-jteriod changes in these variables, and regressii^ns which 
include cctnsumpiion expenditures lagged by more than one period 
can sigtiifkantly ouipeiform the simpler spec ification in which con¬ 
sumption expenditure is predicted only by its own lagged value.* 
I hus sotne doubt is cast oti both the validity of the j>ure life cycle- 
pertnanent income hypothesis, in the case of the United Kingdom, 
and the policy prescriptions that might follow from an accej^tance of 
the hypothesis. 

It tan be seen in the Appendix that both consumption exjjenditure 
lagged more than once and lagged income appear with coefficients 
which are sigtiilicanily dillerent from zero. Furthermc^rc, for the 
shorter period 19(54- 78 (or whic h data on licjuid assets are available, 
c hanges in the latter in the recent past seem to contribute signifit anily 
to the exjilanatcn y jjower of the regression. In all cases the size of the 
f i-statistic suggests that the h\ pothesis that the additional variables do 
not contribute to the legression should be lejecied.* 

Finally, repeated estimation for diffetent suhjseiiods of the sinij^le 
regression of on ; indicates that the coeffuieni on the latter 


* the iclcv.ciHc <)l lu|iii<l 111 cxiil.iiiiiiic; omsumcr',' cv[kmkIiuiu' Ii,w liecn 

sugj;eMc-<l ill a iiuiiiIk-i of ific iit slu<lk-x; sec. c.j{ . tcmiicncl IU 7 () 

^ When to obceiv.Kioiw ol e.ufi .s.iiii|)le .lie lel.iiiieit toi a j«>slsani|>le par.aiiieiec 
Maliility lest of die alternative ,s(>e(ifii,itioiis, the (D.iiidsoii ei .il. 1978, p. 67-1) and 
Chow's/-'-teM suggest that in .ill i.ises the h\|>i)these.sof |josi.s.iiii[)le [>.ii.iiiieiei sl.ihiliic 
.mil of no striii liiral cli.iiige should not he leieiteil. What is more impori.aiil. however, 
for the pre.seiit study is that ihe speiitii.iiioii which allows lor the iiiHueiiie ol ih.iiiges 
111 Ikfuid a.s.sels .seeiri.s to pioiide more anirr.ile lorec.ists than liie pure life cycle— 
Jiei iiidiicaii iiit'oiiie hypotheses, Ihe siijK’iioruv ol the .literiialive sjrei ili<,uioii w .ilso 
uiaiiitaiiicxl when siahihly is leslt'd lor |x>st-l972 ii.u.i—,i c'ritu .il jx*i iixl for cx oiiciiHei¬ 
ne foreeasliiig, coiniidnig with .i dnimatie iiicic-ase in the II K. s.iviiig latio ttf.cci- 
liiiiaihcou 1979). Foi this [xuixt, however, the v.iliie of x* suggesiv that the hvjxiihcsis 
ol poslsamplc parameter slahthly should lx* rc'icxted foi both s[x*eihialions while the 
/■'■value is still not siaiislii.illy sigiiituaiii. 
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variable is unstable. More specifieally, as can be seen from the follow¬ 
ing values of X, the greater the time span of the sample, moving 
backward toward the less recent past, the higher the level of the 
coefficient; (i) 1973(1)-H)78(4), 1.00125; (ii) 1964(1)-1978(4), 

1.00374; (iii) 1956(1)-1978(4), 1.00413; (iv) 1964(1)-1972(4), 

1.00483; (v) 1956(1)-1972(4), 1.00497 (C^how test: iii and v,/'’(24,66) 
— 3.14; ii and iv, F(24,34) = 2.21). In the context of a more con¬ 
ventional consumption function this is not surprising, given the 
marked downward trend of the average propensity to consume in the 
United Kingdom, especially in the 1970s. I'his observation, however, 
does nor .seem to lend support to Hall’s statement that constancy of A 
is expected to "be a good approximation, at least over a decade or 
two" (Hall 1978, p. 975). 

In conclusion, the present evidence .seems to suggest rejection of 
the pure life c)clc-permanent income hypoibesis in the U.K. case. 
Furthermore, the latter cannot explain the rcvealefl diffcretices be- 
tweeti the two countries. It may be tempting to suggest that Finglish 
people arc less "rational" than Americans, but such a pronouncement 
would certainly need an explanation. 

What is probably needed is a more genetal theory of the consump- 
tioti decision which encompasses the different experience of the two 
coittiiries as special ca.ses. It may be that binding financial constraints, 
th.tnges in the age stritcture and the distribution of income, as well as 
chatiges iti interest rates and the composition of wealth, are tco 
itriportaitl to be ignored. I 'he importance of some of these factors has, 
alter all, been recogtii/ed, either explicitly or implicitly, by the life 
cycle atid/or jjcrtnanetn iticome hyjtoihescs. 


Appendix 

Dependent Variable is C 

Ih-'iHd)-1978(4), ,V = 91: 

l.OOtlSt;, ,, (Al) 

(1055) 

= 0.994, 8 = 2.103, SSR = 398.2. 

0.819t,,., + ().2197t;,_j + 0.12(>1 f, , - 0.14921^2, 

(7.9) (2.1) (2.9) (3.6) (Al.l) 

«= = 0.995, .8=1.986, SSR = 343.1, F|(3,87) = 4.66. 

1964(1)-1978(4), A' = .59; 


1.00374(.-,_, 

(780) 


(A2) 



CONKIRMATIONS AND CON I RADICTIONS 

= 0.971, S = 2.441, SSR = 346.0. 

0.497 K.',, + 0.23.5.3r;,..i + 0.273^,., 

(4.1) (1.67) (2.3) 

+ 0.089.'i4K,-, + O.H!SA/.A,^,, 

(1.92) (5.2) 

= 0.9884, S= 1.881, SSR =191.0, f',(4,54) 

0.604CV, + 0.442r:, 2 + 0.0831 
(4.8) (3.6) (1.60) 

- 0.10,5)Vi + 0Am6LA, , - 0.0926/yl,_2, (A2.2) 

(1.79) (3.8) (4.0) 

«2 = 0.988, 8=1.96, SSR = 204, /•■,(5,53) = 7.38. 

Numbers in parentheses aret-statislirs; S = standard error of estimate. .SSR = 
sum of the s(]uarcs of the residuals, and /■', is a lest for the signifitanre of 
additional paiameters; Al',., ~ Y,., ~ Y, j and A/..4,_i = — LA,.^' ~ 

scletled li(|uid a.ssets of the personal set tor as given in Ftnannal Stathlirs. 


(A2.1) 

= 10.96. 
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Miscellany 


Why There Are No Risk Preferrers 

Somewhat more than 100 years ago, Smith’s celebrated diamond- 
water paradox was solved by the observation that however valuable 
water might be to someone who had none, if it was produced and sold 
in unlimited quantities at a price p consumers would adjust their rate 
of consumption so that its marginal value to them was at most p. The 
same argtiment implies that behavior under uncertainty depends not 
merely on the actor's utility function but also on the cost of producing 
(and absoibing) risk. However fond I may be of uncertainty, 1 will not 
buv it at a high price (by choosing a profession whose returns have 
high variatice but low expected value) when 1 can get it at a low one (in 
I.as Vegas). 

Risk can be produced inexpensively; existing .sellers in Las Vegas 
and elsewhere charge only a few percent of the expected value of the 
gamhle, part of which must be regarded as payment not for the risk 
itself hut for the palatial sitrroundings in which it is produced. Hence, 
absent special circumstances (such as a government monopoly of risk 
production), we may expect individuals in etpiilibrium to exhibit at 
most a small degrt'e of task preference, for the same reason that they 
exhibit at most a small marginal utility for water. 

Insurance, unlike risk, is inherently costly to produce because of the 
problems ol moral hazard. Moral hazard is a major cost of insurance 
hut a minor cost of gambling because a gambling game, being created 
for the purpo.se of producing risk, can be designed to almost elimi¬ 
nate the ability of ]3articipaiits to spend resources on iiiHuencing the 
outcome. I'he .seller of insurance laces a preexisting situation in which 
it is likely that the buyer (of, say, unemployment insurance) can and 
will affect the odds by his actions. Hence insurance is costly to pro¬ 
duce and individuals in equilibrium may exhibit a substantial degiee 
of risk avoidance. 


Virginia Folytechnit Imtilute 
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Book Reviews 


The Ref^tlatory Process and Labor Earttin^s. By Ronai.d G. 

New York: Aca(leini< Press, I97‘>. Pp. xx + 204. $15 00. 

The irony is lhai tins snidy was noi published in lire fled journal of Eionotnus. 
Khretiber}{'s argument is that the wage rates of workers in ihc New V'ork 
'[ elephone Gonipany arc above those ol other etjuallv (ptalified workers anti 
that theiefore the New York Pitblie Service Goininission should not allow 
lurtber wage itureases to be passetl oti as higher telephone rates. As I read 
through his argument 1 wondered how Khrenberg would have treated a 
ictpiest to cover increased ptiblualion costs by the eilitoi's ol ihc flell journal. 
A related tactic did not escape tiotice by the leaders of the (.lomiTiunieation 
Wotkers of Aitierita (GWA), however, who were on the olhei side of the 
tegulatorv proceeding in which Khreiibeig participated. In addition to im¬ 
plying that Khrenberg w.isa incie hired gun for Alfred Kahn (then chairman 
of the New York State Piihltc .Service CommisMoii), accoiding to Khrenberg, 
". . . |>olttical pressure was put on niy ctnployer, the New York .Stale School of 
Industrial and l.abor Relations (ll.R) at Gornell University, by the New York 
Slate AKI.-GIO, which cancelled a sc hedtiled ccnilereiue at Gornell, iinphcilly 
suggesting that it would he wise i( ILR laculty did not get involved in similar 
(ases 111 the future " (pp. 147—IS). I he hero ol this |>art ol the story was 
.ipparenlly Bob McKersie (then dean ol the ll.R sc bool but now at .Ml I ) who 
refused to lake iiisiilulional responsibility lor an individual laculty ineiiibers 
lesearch coiulusions. 

I think most econornisis will lind this a lascinating sludv In lact, 1 would 
urge any ecotioniisl IcKiking for a good read while on holiday to take this 
(relatively inexpensive) Ixxik along. I look it and a preity gextd paperback tor 
a week on Martha's Vinevarc! (spending a whole suminei theie is apparently a 
pieiecjui.siie tor becoming prc'stcleiil ol the .AKA) and actually hnished 
KhrenlK-rg's Iwiok first. Kor one thing, it is well written. Regtession ecjuations 
and the like are kept in an appendix. .As a conseciiience the book would make 
excellent supplenienlary reading lor graduate oi undergiaduate coiiises in 
lalior economics, industrial organi/aiioit. public policy, oi applied econoiiiet- 
I its. 

Kor another thing, the Ixiok tells a coherent sioi v alxiul .i coniplete po iiy 
orienictl research projcxl Iroin start to liiitsh, I he 1971) inception ol Khren- 
Urg's study resulted Irom an iinpiecedeiiled rec|Uesl h\ the New >"rk 
1 elephone Gonipany lor a rale incrc-a.se lo tover liilure, vet niinegotiated. 


l/nurnci/ of /’.tfliutffrv, IVtHI. vo| HQ. lio 

C) b> Ihc oi ( hk jgo 
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increases in wages anti benefils. The conclusion of ihe study comes a year 
later when the I’ublit Servite Onninission declines to accept Ehrenberg’s logic 
and therefore finds his empirical lalculations inappropriate to the decision at 
hantl. In the meantime, however, the union-management negotiations had 
been tompleled undei the shadow of a possibly adverse decision on cost 
pass-through, and, 1 would bet, this was just the purpose for which Ehren- 
berg's study was initiated by the Public Service (Commission staff in the first 
plate. 

Ehrenbetg's study is also coherent in another dimension. It reffecls a 
remarkable breadth of substantive argument over a range of subjects cover¬ 
ing lalaor law, the delenninants of unioni/alion and bargaining structure, as 
well as some very convnuing empirical economics. For labor economists there 
are some s|)e<ial treats. Ehrenberg tells us that he btlled the Public Service 
(’.otnmtssion staff for4f) daysof tonsulting lime (at an undisclosed rate), while 
National Etonoinit Research Associates billed New York rdephone lor 87 
days of consulting time (at an undisclosed, but higher, rate). 1 am not sure 
wh.'it this is supposed to imply for public polity, but it does rai.se a larger issue. 
As the real academic incotnes of economists have fallen throughout the last 
tietade (1 recently taltitlated that jiinitir faculty taking jobs at tny university 
this tear will reteive real salaries 22 pertent below the starting salaries 12 
ycais agti), their efforts ftave naturally turned towaicl the more praclital and 
remiiner.iiive. One cannot help wondering what effect this tretul will have on 
the profession's research output as this income stpiee/e intensifies during the 
detatle of falling college enrollments that we are enteritig 

file core of Ehrenberg's study is his empirical toinpanson of wage rates 
and fringe benefits at New York I'elephone with tfie wage rates of other 
workers. Ehrenberg deltberately chooses “other workers" to be olhei noniin- 
loit workers with similar skills, a decision the Public .Service ('.ommission 
ultiinatelv found inappropriate. Wage compan.sons are first made using sev¬ 
eral surseys from the Bureau of laibor .Statistics and other sources. .Since 
worker fiuiitan capital is difficult to control in these comparisons, standard 
wage legie.s.sioti.s from the l/IOO .sample of the 1970 ()cnsti.s data for New 
Yoik .State ate also computed. The lele|>hone company is so large an em- 
jiloyer, and New York so large a state, that this procedure produces reliitively 
precise wage-differential estimates. Taken together, I found Ehienberg’s 
empirical case convincing, especially when bolstered with the exiraortlinaiily 
low relative cjiiit rates at New York 'felephone. In stim, I think there is little 
doubt that wage rates at New York Telephone had by 1976 been increased 
above market-clearing levels and that many of the employees of this c cimpany 
weie earning eccmomic rents. Thei|ueMion that naturally ari.ses, iheielore, is 
what attitude the New York Public Sc-rvice ('.ommission should have taken 
toward this stylized fact on the occasion of a rate-increase ret|uest. 

The acimmisirafive law judge in this regulatory procc*erlmg recommended 
that this "fad " be tgnoreef because to do orhcrwi.se would be to interfere in the 
collective-bargaining process in violation of federal labor law. Thus, the ad¬ 
ministrative law judge concluded that whether to use Ehrenberg’s wage com¬ 
parisons was mainly a "legal i.ssue.” As anyone who has consulted with lawy ers 
will understand, a "legal is.sue" is primarily one lor which there is no accepted 
aiiswet. In fact, Ehrenberg shows that as a matter of law the administrative' 
law judge was simply incorrect on this jKiint and the Public Service Toinmis- 
sion cone lucled similarly. Despite the commission’s cone Itision that disallowing 
excessive labor-cost increases was legally permi.ssible, however, the commis¬ 
sion explicitly rejected Ehrenberg’s wage comparisons as inappropriate. 



BOOK REVIEWS 


603 

In effect, Khrcnberj^ ci>mpare(I the average wage rale in a unionized 
monopoly firm against the average of all other wage rates, union and non¬ 
union. Since the majority of all workers are nonunion, this is primarily a 
comparison of the wage rales of unionized workers in a monopoly lirjn 
against the wage rales of nonunion workers. The commission argued that the 
average wage rate of the workers at New York Telephone should have been 
compared against the average wage rate of other unionized workers. Since 
there is an empirically e.slablished overall union/nonunion wage differential, 
this comparison w<juld essentially have uncovered whether the union/ 
nonunion wage dilferential was systematically greater in a particular 
regulated-monopoly industry. If this had been the case, the commis.sion 
appatently would have been prepared to disallow further labor-cost increases 
as within the scope of its mandate to regulate monopoly power but not union 
power. 

Khrcnberg really offers no convincing argument against the commission's 
position. Moreover, although he does not offer the necessary empirical work 
to test the appropnate factual issues, I suspect that such a study would not 
demonstrate that union/nonunion wage differentials were higher among 
unionized emphiyees of monopoly firms than among other unionized work¬ 
ers. Hisiorually, for example, unioii/nonumon wage differentials have been 
highest among constriKiioii workers and mine workers, both relatively com¬ 
petitive industries. 

The difficulty here is that the commission's position essentially takes the 
presence of uniomzatton as exogenous and asks merely whether the presence 
ol regulated mono|)oly exacerbates union wage pressures, d'his position ig¬ 
nores the really iinpotiaiit eptesuon of whether an industry might not have 
become unionized /iciwio' it isa regulated mon())>oly. In this rase the presence 
ol the union is endogenous to the regulatory process, and the appropriate 
wage compari-son is akin to the one Khrenberg offers, i he anecdotal evidence 
he preseius ol the evolution of the Bell System bargaining structure as well as 
the well-known anecdotal evidence regarding the trucking and railroad in¬ 
dustries and the liistory ol the Davis-Bacon Act in coniraci consirui non all 
support this iiiierpieiaiion. .As Khrenberg's experience deaily demonstrates, 
however, making this case before a regulatory commission is likely to be a 
thankless task, and perhaps nol very remunerative cither. 

ORI hS AsHF.NtKLIER 

I'niKi'Um UiiivrrsUy 


I dw, l.rfpslaHon ami l.thfilw ,1 Stall MU’nt aj thf l.thiral Principles oj J nslicc ami 
Pahtical l'.famiiii\. Vol. ‘i: 'I he Palilieal Onlcr a) a Pici' People. By KRitDRti.ii A. 
Hassk 

(Chicago: University of (ihuago Press, I97‘) Pp. 243. $14.00; London and 
Henley , Routlcxlge & Kcgan PanI, 11)79. Pp. 243. £5.95. 

Bill to avoid this (destroying our civilizationl we must shed the 
illusion that we can deliheiaic'ly create the future of mankind. 

This is the hnal coiuhisioii ol the forty years which I have 
cfevoted. . . . 

1 want to lhank kaj Areskoiig, Fritz Maihlup, fngo Wallet, aiicf lee Wohlfeit lor 
lommeiits on an earlier draft of this icview. 
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These dramatic words end ihc third and last volume of L«re, Lef^slalion and 
Liberty: A New Statement of the Liberal Principles of justice and Politiial Ltonomy 
(hereafter cited asLZX) by Nobel laureate Friedrich A. Hayek. The recently 
published volume. The Political Order of a Free People, analyzes ihe undermin¬ 
ing of democratic principles and liberty in parliamentary deniotracies and 
(ont hides with an outline of a new constitutional framework. 

The message of Hayek's voluminous work since World War 11' is, to put it 
briefly, that the current political system of Western democracies is btiund to 
taiise increasing coeii ion of individuals by governments, private groups, and 
firms, and that governinetilal decisions will increasingly be conirary to the 
opinions of the majority. He argues in volume 3 that the unlimited powers 
(sovereignty) of govei nmenis—combined with the belief and expectation that 
goveininetits can direct production, coiisurnplioii, and the distribution of 
income to maximize stxial welfare—are the necessary and sufficient condi¬ 
tions for this development. I'he constitutional proposal is designed to subject 
the gosernment to substantial constraints in its use of coercive powers. 

The previous two vohiines of/././. (Hayek 1973. I976h) provide the foiin- 
daticm for the analyses of the political process in volume 3. but the last can be 
read independently. Knle\ and Order analyzes the lole of rules and law in a 
"f ree " society—one in which no individual can be coerced to act in accordance 
W’l til th e will of otfiers. Law must rc|>rcsent general principles of just conduct 
on wide h there is common agreement. Coercion must lx‘ used only to enforce 
sue h genet al principles applicable to all individuals. The Mitage of Sotial Justice 
discusses the limits of reason and rationality in sotial design and the im|)o,ssi- 
biliiy of defining social )ustite. Some arguments in these two volumes will be 
referred to below to clarify the discussion of volume 3. 

The “Bargaining” Democracy and Logrolling 

Hayek uses the concept of the bargaining democracy to de,stribc a system of 
re()reseniativc assemblies within which diflerent interest groups can trade in 
vote support for each other's proposals. In the theory of public finance, this is 
called logrolling" and is generally regardexi as an improvement in the demo- 
c ralic process because 1 he strength of preferences can be allcswcd 10 af tec l the 
oulcome Hayek, on the other hand, argues that the trade in vote supiiort 
tontribuics to results that do not reflect the opinions of the majority. 1 hese 
views are clearly contradictory, yet the abstract nature of I layek’s analysis of 
the democratic process makes a coinpatison with conventional public finance 
difficult. However, Hayek's criticism of trade m vole support within repre- 
sentaiive assemblies is directed at the wide range of issues on which such 
practices arc possible rather than at the principle of logrolling itself Ihe 
argument is derived from the distinclion between votes on general rules ot 
just conduct applicable to all citizens (i.c., legislalton proper, such as private 
and criminal law) and votes on the distribiiiion of payinents and benefits 
among individuals and groups in sexiety. 1 he latter include all voles on taxes 
and transfer payments, ullcxatiou decisions with income redistribution impli¬ 
cations. and ''directives'' that discriminate in favor of ceilain groups. I he 
repiesentativc as.semblies in all dcincxracies do in fact practice logrolling on 
both types of isstics. It is, thetelorc, possible that the distribution of benefits 

' 1 his work includes such classics as I'he Rocul to Serfdom (1944) and I he Constitution of 
Liberty (I960). 

" .See, e.g , TuIIck Ic 1969. 
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on which a majority decide at liie same lime violates general rules on which 
the same majority agree. For example, a principle of uxation can be ac¬ 
cepted while special benefits (e.g., laiifts and subsidies) with implicit positive 
(»r tiegativc tax equivalents are pa.sscd by the same a.sseiTibly. violating the 
agreed-on principle. It appears that the theory of logrolling is valid lor votes 
on the distribution of benehisand paymentson/y when no general agreed-on 
principle is applicable. (July pure collective goods are likely to satisfy this 
condition when a principle of taxation has been laid down, iiecaiise inclividii- 
al.s tan be a.ssigned lax e(|uivaleiils lor other puhlicly provided services and 
benefits, 

I he coercive powers of governments to benefit some groups at the expense 
of others, in .addition to enforcing generally agreed-on principles, is the 
source of what Hayek calls “the miscarriage of the demociatic ideal" (p. 98), 
These powcis make it possible and advantageous to trade in specific benehts 
as well as to trade in lienefils lor sup|)ort of general principles. .Moreover, 
well-orgaiii/ed interests are likely to gain dispro|)ortionalt' weights in the 
democratic priKess—-“there is no iiiiiit to the blackmail to whit h governments 
will Ire subject" when "no superior pidiciarv anihority can pre vent ihe legis¬ 
lature Ironi gianting pnvileges to pailicular groups" (p. I 1).‘ Ihe final 
outcome of the deniocranc process need not, then, correspond to anihodv's 
opinion on what is right or to the will ol the majority.' Hayek argues, in fact, 
that thi,s concept am he siatc>d only in ic-rms ol geneial 1 ities and ptmciples.'’ 
Behavioral incentives ol the mles are lost and the vmH of ihe majorily caiinol 
he given coiilenls when representatives decideon geneial rules and on 
acljitslments to the outcome ol inclivichials' acinilics that ate performed 
subjecI to these itiles, 

A Model Constitution 

Hayek outlines a ccmstnuiioii designed IkiiIi to make die ouicome ol the 
deiiiocialic piocess coincide with principles held bv the majorils and icr 
mininii/.e the degree ol coeiiioii in society.'’ Ihe piojrosal is vers simple: 
nisiinguish between legiskiiiuii propel and dtxisions on govcnimem spend¬ 
ing, aclnunisliation, and legulalioii by sepaialiiig the two Inni lions into two 
distinct asseinhlies elected by entirc-ly dilleieiit procecluies. .Make the gov- 
einmenl body subject to the geneial rules ol condncl decided uiroii hv the 

' suggeslioii like Kriecliii.in .end Filedin.iii's (19811) to Icgailt jirohibii die u.‘)t' of 
laiilfs ten the beiielil ol spc-c lal-iiileresi gioujis can oiniously ncil get to ilie loot of the 
jitohlciii ol the deiiicx r.u v in llayek's vic-w .is long as die govei nineiil has ihe coei cise 
power lo use a inullilude of alternalive means of > ri-aiing special hciielits loi ihc same 
gioujrs 

‘ Note ifial Havek rcgaids die slate ol dcmcKiacv in die L’liited States as ihic-atenc'd 
as arc the parliaiiienlaiy clenicxi.uies m Europe Ix’cause die judicial\ branch of the 
.Aincikaii goieiiiinem docs geiiei.ilh not subjec I dexisions of ( aingicss to a lest against 
geneial |n incijiles—lliougli this iii.iv have'tx-en ihe iiiieiinon ol the Foundiiig Fathers 

' This argiiineiil is cleseliipeci in col 2 ol /,/./, 

* Frilii Mac hluji has jxiinied onl 10 me that one could argue lhai I lavek s profxisal for 
constiluiioiiai reform coiitradic Is his own views on the crc'alioii ol lumiaii institciticins 
Willie Hayek argues that liuinan msliiiiiioiis c aiiiioi lx* designexi nilionallv and delilier- 
ately for the heiielit of mankind, he afso argues that they cannot develop spciiuaiieously 
in accordance with jxxiplc's prelerences uiilcvs the conditions for such developments 
exist. The proposal cousiilutes a delilx-rate design lor the puipo.se of creating such 
< ondiiions. 
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legislative assembly. Hayek’s nuxici coiistilulion also tontains a general decla¬ 
ration of rights and an imfioriaiu definition of what tjualifies as law—a 
gctieral rule of just condticl. A eoiisiilulional court would be established to 
test the appropriateness of the legislaltire’s decisions against such a definition. 

I he legislative assetnhly would be res|>onsible for the Ixtdy of criminal and 
private law, the principles of taxation, general regulations for safety and 
health, rules to secure competitive markets, corporate law, and the like. 1 he 
coercive powers of governinenis would be limited to the enforteinenl of these 
general rules and principles. 

1 be government assemhly would decide on the use of material and human 
resources eiiti listed to the public set tor. The si/e and the general purposes of 
expenditures would lie limited only iiidireclly by the general rules of conduct 
set down bv the legislative assembly and by people’s willingness to pay taxes. 
I he general principles of taxation decided upon in the legislative assembly 
would make citi/.ens aware of their share in payments for specific services. 
1 his would prevent the currt'nl practice of disguising lax burdens to "make 
itiose who will ultimately have to bear it (the burdenj as little avvaie of it as 
possible ” (p. 127). 

Would the proixised constitution achieve its puiposcs? A critical issue 
seems to he wfictfier the legislative a.ssembly can be pi evented from instituting 
laws favoring large gtoups or wealthy iiiterests. Hayck presents a number of 
suggestions in this regard. Mote important, however, is the definition of "law’’ 
and the role of the constitutional couit in evaluating the constitutionality of 
legislation against this dehniiion. Hayek characteri/es (p. lOf)) a law as nega¬ 
tive in the sense that it does not "atm at at hieving concrete purpo.ses,’’ it must 
be iipplitahle to an "indefinite number of unknown future instances.”’ and it 
must ext hide 'all provisions inieiicletl or known to aff et 1 print ipallv panic u- 
hir idemifi.ihle individuals or groups." This definition would appatently rule 
out legtslalion covering such issues as minimum wages and price controls. 

1 lowever, the Ixmiidary between tneasuresto improve inlortnation spreading 
in tfie market and tfie protection of particular linns or groups is sometimes 
extremelv vague, as in the ca.v of legislation on health standards, cemsumer 
protection, and occ up.itional safely It is therefore easy to imagine situations 
m which the members of the const it u 110 nal court assume a critical role. The 
expenente with tfie U..S Supreme Coiiil indicates tliat the existence of .1 
consiilutional court does not necessarily providea complete safeguard against 
legislation that favors large groups m society. A simple addition to Hayek’s 
proposal could help achieving his purpose—Wicksell’s old suggestion 
(Wic ksc'll ISyti) of cpialified inajoriliesand relative unaimnily. A principle that 
legislation must be passed by, for example, 9(1 jieicfiil of the assembly hardly 
seems olijec tionable since decisions on general rules must reflect the stan¬ 
dards of most citi/ens to be respected iiiicl ju.slify coercion in enforcement. 

VV'heie does Hayek’s proposal lead? Debate on it cannot be carried out 
without reference to more specific results. .Since the author provides only 
general and abstract clues, 1 will here attempt to illtisiiaie some potential 
consequences by relating it to a lew economic political issues. 

1 he inflalionarv bias of current deincx racies has been a theme in Hayek’s 
earlier writings and is cleatly linked to the subject of this volume since 
inflation is one way lor governments to disguise the true tax burden, file 
independence of the central bank from the government seems guaranteed 
under the proposed constitution because directives covering the central bank 
will have to take the form of general rules unrelated to specific jioliiical and 
economic circumstances. I'his provides a check on the monettzalion of gov- 
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erninenl fiscal dcfirils.^ liiHalionaiy pressures euiilfl also be reduced under 
the proposed tonsiilulion because the government would be unable to Itear 
the costs ol unemployment and business failures. Most market inlet veniions 
in the form of subsidies to labor or capital are likely to violate the geneial 
rules by which the government must abide. Ihis could contribute to higher 
wage and price Hexihilily." I‘ttrthermore. governments must consider its ex¬ 
penditures more carefully when the cone.sponding tax burden for each 
individual has been established by the Icgi.slature." 

A more controversial consequence of the proposed tonstilution would be 
that developments in production, consumption, and income distribution to 
which a majority object have to be accepted unless a general tule has in fact 
been violated. However, the outcome ol economic activities may shed new 
light on a particular principle and lead to a change therein. The extremely 
far-reaching implications of this can lie illustrated with the examples of 
immigration laws and ei|uuy-orienied (xilicies. 

Immigration restiictions appear to be unconstitutional, as do larifts or 
import cjuotas, under Hayek’s proposal. Sutb policy measures are clearly 
directed at idenliliable individuals 10 achieve concrete purposes—economic 
benefits lor national residenl.s and Hrms. Opening up the Western indus- 
iriali/ed economies to unlitniied iininigratioii would lead to a scxial transfor¬ 
mation of enormous proportions. Most people would probably find the out¬ 
come unacceptable. Ncvciihele.ss, they would have to accept immigration as 
the "will of the majciniy” unless a general principle could be found to prevent 
immigration without thwarting other desirable outcomes, siirh as imports of 
competitive foreign goods. 

Most policies aiinecl at securing certain letels of incomes for individuals or 
groups also have to be given up by governments that must abide by general 
principles. Disc rimination is ncces.sary to guarantee the outcome of economic 
actitiiies periaimiig to an individual. 1 his is ilhistiated in volume 2 of/,/./, by 
a revealing and shatply satirical analysis ol thcconiciiisol the United Nations 
I 'nivi'nal Ded/nahun 0 / Huiiuin Higlih. I his declaration contains jxisitive rights 
(to outcomes) as well as traditional civil rights (to equal treatment before the 
law as deliiied by geneial rules ol conduct). The two kinds of rights are simplv 
contradictory. Fit her governments treat unequal people crqually, abiding by 
generally accepted rules, or ihev treat unet)ual people uneqiiaiK to .secure 
cei'iaiii outcomes. 1 he only available (Kility insiruiiicnt aimed at securing a 
cfilain income distribution would be the general t.cx structure. I'luis. a 
negative income lax would have to Ire subsiinned for large parts ol the social 
.scxuriiy .system. 

I lie examples abene suffice to show that the consiitiiiional proposal is far 
from polilicallv fea.sible in the loresceable future. 1 here is probabK no 
significant political gioup in any clenuxracy that bases its political targets on 

’ Hayck earlier suggested the denationalization ol the right to issue money (Hayek 
I976n). Though a possibility, this suggestion is not part of the proposal here. 

"■'.Social responsibility" imisl lie the major check on wage and cost increases in an 
economy m which the goveinmeiil guarantees emplovment independem ot the wage/ 
co.st stiucturc (cf. Wihlixji'g 1978). 

" llayt'k's projxisal can here be compared with die cuitciu discussion abcnii a con¬ 
stitutional aniendnieiit prohibiting an unbalanced government budget. Just as tariffs 
cannot reduce the political pressures to discriminate in favor of a particular industry, a 
omsiiiulional amcndiitem to balance ibe budget docs not reduce the political incentive 
to tax via iiiHaiion. Thus, the political prex^ess can be used, e g , to manipulate the 
definition of the government budget—rendering the amendment meaningless. 
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gentTiil principles rather than on desired ouieomes. Moreover, the proposal 
does not address the question of whether any outcome under a certain 
principle is acceptable. For example, is it not necessary to di.scriininate in order 
to secure survival of an individual who spends the negative income tax 
payments almost on receipt? Should hospital care be denied individuals who 
have chosen not to invest in medical insurance.^ Hayek has previously sug¬ 
gested ways of resolving these questions with a minimum of government 
coercion (Hayek 1960). It is not clear, however, that these solutions are 
commensurate with strict adherence to the constitutional principles outlined 
heie. Despite such questions, Hayek's proposal deserves serious consideration 
as the starting point lor lurther work by economists and social philo.sophcrs. 
A fundamental constitutional change may indeed be the only route to pre¬ 
vent further erosion of democratic values and individual liberty. Hayek has 
argued—convincingly, in my view—that the coercive powers given to gov¬ 
ernments, in the belief that progress can and should be planned and directed, 
are the cause ol a sell-generating prexess leading to the destruction ol the 
decentralii'ed market economies in which a high degree of individual liberty 
remains feasible. 

Hayek's constitutional proposal implies cjuiie simply that we give up the 
idea of sleering and planning the future direction of social and economic 
activities. Instead, we should ask how we can set up a systein of general 
principles under which mankind’s knowledge can ensure progress for the 
maximum benefit of all. In Hayek’s words: “To pretend to know the desirable 
direction ol progress seems to me to be the extreme of hubris. . . . All we can 
do IS tc3 create favoiable conditions for it [progress] and then hope for the 
best ’ (p. 169). 

Cl.AS Wllll BOKC, 

■W'.c' F(«A ('iimcmly 
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Studies in the luoninnus of Searih. Kdiied bv S'lfVKN A, I.ii’PMan and Iohn | 

MtXAi.i,. 

Amsterdam: North-Holland rublisliinfr Co., 1979. Pp. 225. $41.50 (iloih). 

As a sequel to their extelleni survey of the job search literature, Lippman 
and McCall have organized an impressive collection of new papers that 
materially enrich the subject. I he rolleclitm includes original contributions by 
nine young authors on topics tliat range from the esiimatton of the elastii ity 
of unemployment duration with respect to unemployment compensation to a 
theoretical demonstration that searching lor a bargain is the real world 
substitute for Wahass auctioneer. 1 he individual essays are of uniformly 
fiigh (jtiality as are the editing and production of the volume. 

According to the editors, each paper is motivated by a clesiie eillier to lest 
the empirical implications of the so-called “standard" search model or to 
extend its structure for the pur|)ose of deriving a richer menu of behavioral 
iinplicalions. Regarding these motives as noble', 1 have attempted to point out 
the sense in which and the exic'iil to whic h each author has been led by them. 
Since the essays are related to the standard model in i he inverse order cif their 
presetitation in the volume, my comments read from back to front. 

I he standard model in a lalior-niarkel cemlext supposes that an individual 
worker samples wage rales sec(uc'ntially from an urn of offeis with a known 
Irequency clistribution at a rale ol one per period. The worker wishes to 
maximize expected lifetime earnings net of the out-of-pocket and op|)ortiinity 
costs of search. As is well known, the opiinial sirategy is to accept the first 
offer in cxcc'ss of some resc'ivation wage. 1 hr- reservation wage ecjuaic-s the 
cost of taking one more s.imple to the expected gam aiiributahlc to another 
sample. Under reasonable conditions, ibo reservation wage is independent of 
search duration to date 

1 hat a ic'duclion in the cost of seaich yields an increase in the lescrvalion 
wage is a siiaighitorwatcl implication (iiven the standard model, an increase 
111 the reservation wage, in turn, implies an increase in the expected lime 
recjuirecl to find an acceptable wage and an increase in cxiveeted earnings. 
Because uiiemplcivmeiil compeiisauon must be lorgoiic when an otfer is 
accepted, it ;uis as a subsiclv to search in the standard model 

Kdlhleeii Cllasseii tests fot tlic-se precliclcd eflects by estimating the 
rccluced-fcirm relaliotiship Ix-lween Ixith unemployment duialion and posi- 
unetnploymeiu eaniiiig and tmcinployuient compensation. Both the weekK 
benefit received aiicl the m.vxiimmi duialion of Ixnehts ate included .is 
right-liaiid-side variables, flic .iiialvsis vielcls a strong positive lelaiicmship 
between duration and Ixiib measures of nneiuploy ment i omix-nsalioii liber¬ 
ality but no statistically significaiit relalionsbip belwcen eventual earnmg and 
liberality. Although the second result can be explained .ivvav, it is also true 
that .illeriialive explanations exist lor a positive effec t on duration. 1 he paper 
has many interesting things to say. but as a test of search iheoiv, the results 
are intone lu.sive. 

One would prefer to lest tliretily for the implied relationship betvvecn the 
reservation wage and the tost of scare h A major problem in doing so is the 
nonofi.scrvabilily of the reseivalion-vvage rate. Nick Kieler and Cieoige 
Neumann reporl on an allempl to gel aroniicl this problem. I heir niclbod 
lakes advantage of the siriicliirc of the standard model. In the paper, thev 
show that data on accejJted wage rales and realized unemployment cluraiioiis 
c an lie used to estimate simultaneously the struc tural parameters of Ixiili the 
reservation-wage equaium and the piobabilily of finding acceptable employ- 
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mem function. AkhouKh their esiiiiiaie of the unemployment benefit effect is 
positive in the Hi st case and negative in the second as expected, the estimated 
effects of niaxtmum benefit duration have the “wrong" signs. In spite of this 
ambiguity, the paper represents a valuable contribution to the methodology 
of estimating sequential, discrete choice mmlels. 

The standard model “explains” unemployment as a rational respcinse to 
wage dispersion. Critics argue that this explanation is of little importance, 
because workers with long unemployment sfiells, those that account for most 
iinempio)ment, do not turn down offers. They simply cannot find a job. By 
allowing unemjiloyed workers to alknate time between .search effort and 
nonmarket activity, Ken Burdett has extended the relevance of the search 
theoretic framewciik even to the case in which there is no wage dispersion. In 
his model, a reduc tion in job availability on unemployment is amplified by the 
rational allotation of a worker’s time. 

In the paper jiresented in the volume, Burdett shows that search intensity 
intrea.ses, and the reservation wage falls wilh unemployment duration, as ihe 
unemployment-benefit pericKl is c'xhau.stecl. This theoretical result suggests 

I he existence of misspec ilication bias in both of the empirical papers disc ussed 
above The allocation of lime among search, work, and leisure by an em- 
ployc'd wotkei is also investigated. This case is a natural synthesis ofclassical 
laboi supply and search theory. The principal contribution of the paper is to 
incorporate search theoretic considerations into a more general dynamic 
theory ol labor supply. 

rile pajiers l>y Jeffrey Hall, Steven Lipjmian, ancf John McCall and by John 
Danforth study the conseejuetues ol relaxing the assumjition ol risk neii- 
liality. I’resuming decreasing absolute risk aveision, the implied relationship 
between the leservatioii wage and nonhiiman wealth is positive. Hence, the 
papeis tepreseni a rigorous derivation ol the long-standing contention that 
the luiinmg down of financial a.ssets explains the obsersed negative associa- 
lioii between the resersation wage and duration of iineinployment. As a 
technical matter. Hall et al. show that the reservation-wage |)ropei ty need not 
hold when past of lei s can be recalled. Nevertheless, the jjrobahiliiy per 
period ol lindmg acceptable employment decrea.ses with wealth in tins case as 
well, finally. Danloiill's toiimilalion lepreseiils a synthesis ol the iheoiy ol 
lionseliold weallli accumulation and die theory ol wage seaitli. As Daniorih 
points out, die llieoi y implies a picKc.ss of increasing wealth dispersion ovei 
lime if llie assniiijiiion of decreasing absolute risk aversion is cmjiincally 
valid. Hence, the paper is also a coiiliibution lo ihc llieory ol income dis- 

II ibiitioii. 

fhe sianclard search model characten/es the optimal strategy of a jjiice- 
takmg agent seeking a bargain. The market effect ol such behavior is a 
reallcKatioii ol goods and services toward deinandcrs who value them most 
highly. One might expect that such a proces.s would eventually equalize 
marginal valuaiions, that is. converge lo a compelilive equilibrium. Joseph 
Wharton formalizc.s this idea in (he coiilexi ol a market for homogenectus 
labor services composed of workers whcj search on ihcjob for a higher wage 
and employers who adjust wage offers in rcsjroii.se lo scare h-iiiduced changes 

III employmeiil. The process is modeled as a Markov chain. When the cost of 
sc'arch is "small. ' a cdmjjelitive allocation of workers ac ross firms is the only 
abscjrbing state. Although the analy.sis is insightful and the approach devel¬ 
oped is amenable to further development, the employer’s wage-.seiting Ire- 
havior is not shown to be rational in any sense. Other work on this problem 
suggests that strategic behavior on the part of the price-setting agents can 
thwart convergence to comjretitive equilibrium. 
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Search-induced convergence lo a single price raises ihe following paradox. 
How can the standard search model represent an important contribution to 
positive economics if the market effetl of the behavior explained eliminates 
the price dispersion required to observe the behavior? Roger Kormendi 
provides a rc.solution of this paradox by formulating a market model in a 
pure-exchange context that exhibits price dispersion in equilibrium. One of 
the original features of his model is that every agent plays a role in the price 
formation process. The demonstration that mutually consistent (Nash) price¬ 
setting strategies exist which define a market equilibrium is a major contri¬ 
bution of the paper. Although a detailed discussion of the kinds of conditions 
one needs for equilibrium price dispersion in Kormendi's model is beyond the 
.scope of this review, it is impcjrtanl to note that nonstorabilily of the two 
commodities traded is implicit. It this assumption were relaxed. Kormendi's 
equilibrium is only “tempotary” in the sense that, once allcxations have 
changed in response to trade, the agents have incentives to play the game 
again. One suspects that the repeated game might well converge eventually to 
a fompeiiiivc single price equilibrium. 

It is curious that the matching of job and worker is not explicitly recognized 
in the standard tormulalion of the individual’s problem of finding acceptable 
employment. Izniis Wilde contributes lo this deficiency by distinguishing 
between two Job cbaracicrisiics' the wage, observable on initial contact, and an 
aiinbiiie that must be experienced on the job. The analysis characterizes an 
individual worker's acceptance strategy assuming knowledge of the joint 
probability distribution over jobs. The result is a formalization of the "job- 
shopping'' liypciihesis lypitally offered as an explanation for high quit rales 
among the young. A useful extension of this model would also recognize the 
employer's problem of learning about a worker's produt tiviiy. 

In the final paper, Charles Stuart applies the standard search model to an 
entirely different subject, the spatial clustering of sellers of Ixilh the same and 
dilfercni commodities. .Specifically, he shows that market centers arc more 
atiraciive lo buyers than spatially isolated .sellers, because the cost ol search 
per unit is smaller and the perceived heterogeneity of the goods offered is 
larger. These sources ot agglomeration economics offset the effects of trans- 
portalion costs and competition that would otherwise induce dispersion in the 
localioii ot sales outlets. Although there are many insightful observations on 
the subject, the author docs not attempt lo derive an ei)uilibriuin solution lo 
ihe location problem m which all cd these forces balance. 1 his would seem to 
be a fruitful area lor tuilhcr rcseatch. 

In sum. there is much of value in the volume both to students of labor 
economics and lo those inlet esled in applying search theory in other areas of 
economic iiu|uiry. 

D.M t T. MoHlVNSt.N! 

Sarlhtivslcrri 1.'itivciMly 


■ t Ihriity of huome Di'.lniftiltoii. By Harcu o I.vuai.i. 

New York: Oxford University Press, 1079. Pp. vti + 326 $32 '>0. 

Presicssor l.ydall, oi South African birth, tcceived his higher education at 
Oxford. He has taught primarily in Britain and Australia, worked in inierna- 
iional agencies in Geneva, and also visited extensively in the United Slates. 
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His methodology recalls ihe Burns and Mitchell pragmatic tradition of quan¬ 
titative institutionalism. That is to say. Lydall begins by searching out statisti¬ 
cal regularities in his data and then seeks one or more theories to explain 
them, preferably on more than an ad hoc basis. He looks with some suspicion 
on the opposite procedure ol deducing a general theory first and then 
seeking out data to test it. 

Although the present volume is indc'ed more nearly iheorelical and less 
exclusively statistical than Lydall’s well-known Slmilure of Edrnrtijr'i (1968), its 
title may seem misleading to the conventional economist. I myself expec ted a 
new "high theory" (Shackle's phrase), or at least a tasteful ragout ol existing 
high iheor/cs. But after seven "background” chapters of a cIck irinal-historical 
nature (which explain w hy the author (c‘els it desirable to strike out along new 
paths), l.ydall's independent coiuribulion rcTreats more than half way to 
enijiini ism. 

Once inoculated against anticipations of high theory—wage funds, iron 
laws, tiiaigmal productivities, widows’ cruses, degrees of monopoly—the 
readei is impiessed In l.ydall’s si.ilisiical generali/alions, both in themselves 
and as c halleiigc-s tor integration mtev the rival high theories of our rival 
conventional wi.sdoms. 

l.ydall finds, for example, the following relation beiwc-en value added per 
woiker, which he calls n, and the average si/e ol the employing firm as 
measured by employment, v = BL"'' (fl > 0, jS > 1). A simil.ir relation 
holds (or the average wage, tc: tc = AL"~' (.4 > 0, 1 < o < )3), and lor one-man 
firms (OMF), A = H. The size distribution ol firms follows the Pareto law, with 
,V as the mimbei of (iimst.V = ItL''' {fi > O.p > I). (In thescihree e(|u.itions, 
both capital and (Ireek letteis are siatisiical parameters.) 

Ati ingenious algehtaic development (pj). 297, 298) combines these resulls 
into ;i iheorelical l.ibor share.S'„ of (p - /3)/(p — n), derived under conditions 
l.vdall (.ills F.PM (entry and pioduct market) comjjetilion In F.PM, di¬ 
minishing returns to stale need never hold as in convenlion.il pure competi¬ 
tion. but freeclom of enlry tor tiny interstitial OMF is an adct|uate check on 
ihe abuses of monojioly and oligopoly—when coupled, I should mysell add, 
with plausible ihreals ol regiilaiioii or socialization, (l.ydall sees this KPM- 
(yMf frainewoi k as among his major contributions.) fhe FPM c ompeniion is 
atom I SI it but imjierfect. It [lei mils, m jrarlit ular. impel let lions in know ledge, 
uiiulih/ed economics of scale, and lieterogeneous c apital, li is ccmsistenl even 
in the long run, l.ydall believes, with such phenomena .is in.irkiip pricing, 
prodiKliori uiidei iiureasiiig retiiriis, and jH-rsislenl difIcrenIials in rates of 
iftuin on tnveslmeiits of different soils. As to the im|X)ilance of these 
modilualions. my reaction is wait and see, influenced more by the (Xist- 
Chamberlin history of the "large group c.ise’’ of monopolistic tompelilion 
than by any analytical (law in F,PM-()MF or its applic alioiis. 

Two further iiiiiovatioiis in Lydall’s approach and results relate to the 
chiiamits of distiibiition and development, I hey exploit' the Ku/nets con¬ 
jee tuie that, in l.ydall’s woids (pj). 129, 130), “the degree ol inec|nalily is fati ly 
low m the [tooresi countries, inc reases with per capita income up to a cei tain 
level, and thereafter follows a continuous downward trend.” In a cross¬ 
country study, a l.ydall studenl ("F. Das) derived an empirical estimate of the 
determinatits of the Ciini concetitiation ratio, 6’, with dependent variables .V 
(income per capita), its reciprocal, and four Dj or “dummy” variables:' (! ■= 
0.784 - B.d-WlngX - 12.184X-' + 0.0.510, + 0.124Oj - 0.0330;, - 0.0070, 


' I interpret O, as the proportion of incoiiie recipients (lo total population), D.j as ihf 
proportion ol ecoiioinically active peisons, O 3 as the projioilion in the rural (01 
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Lydall provides later (pp. 2)9, 220) tiis own explanation of the Ku/nels 
phenomenon. He has clivl<lert each economy under study into a traditional 
set tor, 1. and a modern sector, 2. The average iiicjmic, K,. of each sector is 
related logarithmically to overad average incciiiie, Y: log = 1.4,57 + 0.725 
log K, log = 3.86 + 0.55 log K; we also have K = (I - p)y, + pV.,, with die 
proportion^ risingas development progresses. Lydall assumes income dislrib- 
uleci lognormally in each sector with a unifoim logarithmic standard devia¬ 
tion. Ffom these results attd a.s.suinpiion.s he derives by simulation (his table 
12.1) nctmerital results which not only conform to the Kuznels hy|X)thcsis but 
go beyond it in one impottatit respect. 4 he share of the lowest 20 percent of 
the population eventually turns up as development proceeds, which it did not 
apfiear to do in Ku/nets’s original data. 

The most noteworthy feature of I.ydaH’s discussion of personal income 
distnhution (chaps. 12- 14) is its conscious endeavor to redress the b.ilancc 
between competitors already in the field. As economists know, the human- 
capital tlieories of 1. W. .Schult/, (iary Becker, and others have risen to 
dominance over the past ijiiaiter ceniury l.ydall represents a reasoned and 
good-natured rebuttal in the ongoing debate. At the same time, he avoids 
what I should call the major error of )an linbergeii’s 197,5 clisiribuiion 
monograph, wine h is to ignore iioiilabor incomes entirely as ala\ islic residuals 
destined to depail the scene in the near liiiiire. 

I’ostwar distribution theory h.is been marked by a number ol efforts to 
bridge the gaps and tonstiuci at least pontoon biidges between clcciuciive 
(unt'lional tht'ones and em])nMal personal one's. I heseeflorls haveranged in 
Ainerica alone from the "luunan-capiiaf approach mentioned above to 
"|o!)-ladclct" and "labor-mat ket segmentation" approaches generally more 
critical ol (be status c|uo. l.ydall's study is a ma)c)r and possiblv .i highly 
mflueiUial addition to this litei.iltiie. although Lyilall's own summars of his 
iiiiicisalive conclusions (chap. 15, esp. pp. 285-91) is cxcasionally less than 
generous to his luimercius |rrc’decesvirs. If i.ydalfs Ixiok is mteiidecl as more 
ih.in a conlrihution, but .is the lati'twenlielh-century ec|mvalc'nl of John Bates 
C.latk’s late mueteenlh-ceiUm y magnum opus (as that summaty chapter 
hinls), I Ic'ar that it falls somewhat short. 

M.SR IIN BrOVH-NBR t S.M I< 

Dukr i'lin’cnily 


agi ic iiliiirai) secloi, and /)< ,is the* propnrlion in the uihaii (nr iinnagriccclturai) scttni 
(I lie si.iiistic al Mgnificance* oi /t ) .ind /L .ippeai s chihinus in l.y dal! s ic-vl ) 1 he \ ,ilue 
(it I) 38 , while low, IS sigiiiticaiiilv grcaler lhan rc-rn 
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Keith B. Leffler 
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The conditions under vvhith transactors ran vise the market 
(lepeat-piiriliasc) nu'vhantMii o( loniratl cnloiremeni are exam¬ 
ined. liK leased pi'K e is shown lo be a iiiejiis o( assiii injf coiilrat lual 
pel lormaiRC. ,A. netess.iry and siiKuieiit tondilioii lor perlormaiicc 
is the existence of price sudkieiitly above salvageable piodviction 
tosls so that ihe iionperfoi ining (inn loses a disiounletl si ream ol 
leiils on lutiirc sales whuh is greater than the wealth increase Irom 
nonjK'iloi malice. Ibis ssill generally imply a maikel price greater 
than the perlevlly vompelitixe prive and raiionah/c insestmeius in 
hrin-spevihr assets, Adieitising iinesiiiienls thereby bevoiiie a pos¬ 
itive indicator ol likely pet lorin.ince. 


I. Introduction 

.An implicit assumption ol the economic paradigm of market ex¬ 
change is the preseticc of a goveinment to clehne property rights and 

VVe ihsink Armen Alihidii. 1 ln>in.is Boiiheidinj^, Haiold Deiniei/, James feTgiist)n. 
Jack Hir!»hleilei, Mall IJndsa), Ro) Kennc), |ohn Long. Ian M.une)l. Kevin Minphv, 
Phillip Nelson, Joseph Osiroy, Hcicr Pashigian, Sam Peli/man, (icorge Priesi, John 
Riley, Jonathan Skinner, (ieoige Stigler, Karl I'hc^mpson, and pariiapants at seminars 
at U(U.A and the l-iiivcisiiy ol ('liicago during 1977-78 lor hflpUd suggestions and 
(oiiimetiis. lonalhan Skinner also piovided valuable researeli assistance. The Foiinda- 
lion for Research in Economics and Education and the University of Uhicago Law 
Schotd l^iw and Kcoiiomics Progiam provided Klein wiih research suppoit. and ihe 
(anuer for Research in (ioveinineiu Policy and Business^ Univeisiiy ol RcKhcster, 
provided l-cffler with lescaich support. 
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enfbixe coiilratts. An iinfiortant element of the legal-philosophical 
tra<lition upon which the economic model is built is that without some 
third-party enlbrcer to sanction stealing and reneging, market ex¬ 
change would be impossible.' But economists also have long con- 
sideied “reputations” and brand names to be private devices which 
provide incentives that assure contract performance in the absence of 
any third-party enforcer (Hayek 1948, p. 97; Marshall 1949, vol. 4, p. 
xi). This private-contract enforcement mechanism relies upon the 
value to the linn <jf repeat sales to satisfied customers as a means of 
preventing nonperformance. However, it is possible that economic 
agents with well-known brand names and reputations for honoring 
contracts may liiid it wealth maximizing to break such potentially 
long-term ext hange relationships and obtain a temporary increase in 
profit. In particular, the determinants of the efficacy of this market 
method ol contract performance and therefore the conditions under 
which we ate likely to ob.serve its icse remain unsjx;cihed. 

This paper examines the nongovernmental repeat-purchase 
coutract-enlbrcenient mechani.sm. I’o isolate this force, we assume 
throughout our analysis that contracts are not enforceable by the 
govei nment or any other third party. Transactors are assumed to rely 
solely on the threat of termination of the business relationship for 
enforcement of contractual promi.ses.* rhis a.ssumption is most 
realistic for contiactual terms concerning difficult-to-measure prod- 
u( i chaiacteristics such as the “taste” of a hamburger. However, even 
when the aspects of a contract are less complicated and subjective and 
therefore performance more easily measurable by a third party such 
as a judge, spe< ifu ation, litigation, and other contract-enforcement 
costs may be substantial. I'herefore, explicit guarantees to replace or 
repair defective goods (warranties) are not costless ways to a.ssure 
contract performance. Market arrangements such as the value of lost 
repeat purchases which motivate transactors to honor their promises 
may be the cheapest method of guaranteeing the guarantee. 

While our approach is general in the sense that the value of future 
exchanges can motivate fulfillment ol all types of contractual prom- 


' H<>blx*s ((U351J 1955. pp. 89-lH)) iiiaimains that . . he that perfoiineih first, has 
lu) dsisuiance the oihei will peifoiin aliei; Intause the Ixiiids of words aie weak to 
hiidle rrieri\s aaihiriori, avarice, anger, and other Passions, without the fear oi srnne 
roertive Power; whuh in ihc condition oj here Nature, where all men arc equal, and 
judges of the jiisines.s ol iheir own Tears cannot possibly be supposed.” 

^ 1 his nssurnpiion is consi.sieni with the pioneering woik ot Macaulay (1963), wheie 
leliaiue on foi nial contracts and the threat of explicit legal sanctions was found to be an 
exiicMiiely raie elenicnl of interfirm relationships. Macaulay provide.s some skeufiy 
cvideiue that Inisiiiess firms jirevent nonfulfillment of coniraels by the use of effective 
nonlegal saiiciions consisting piimariiy of the hiss of future business. This “relational 
nature of (ontracts has lieen recently emphasized by Macneil (1974), and also by 
Goldlierg (1976) and Williamson (1979). 
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ises, we focus in this paper on contracts between producers and 
consumers regarding product quality. In order for a repeat-sale en¬ 
forcement mechanism to operate, we assume that the identity of 
firms is known by consumers* and that the government enforces 
property rights to the extent that consumers voluntarily choose whom 
to deal with and must pay for the goods they receive.^ In addition, 
managers of firms are assumed to be wealth maximizing and to place 
no value on honesty per se. 

In Section II, the conditions are outlined under which firms will 
either honor their commitments to supply a high level of quality or 
choose to supply a quality lower than promised. In order to em¬ 
phasize the ability of markets to guarantee quality in the absence of 
any government enforcement mechanism, a simple model is pre¬ 
sented which assutpes that consumers ctistlessly communicate among 
one another. Therefore, il a firm cheats and supplies to any indi¬ 
vidual a quality of product less than contracted for, all consumers in 
the market learn this and all future sales are lost. A major result of 
our analysis is that even such perfect interconsuiner communication 
conditions are not sufficient to a.ssure high quality supply. Cheating 
will be prevented and high quality products will be supplied only if 
firms are earning a continual stream of rental income that will be lost 
if low quality output is deceptively produced. The present discounted 
value of this rental stream must be greater than the one-time wealth 
increase obtained from low quality production. 

This condition for the “notorious firm” reix-at-purchase mecha¬ 
nism to assure high quality supply is not generally fulfilled by the 
usual free-etitry, perfectly competitive equilibrium conditions of price 
equal to marginal and average cost. It becomes necessary to distin¬ 
guish between production costs that are “sunk firm-specific assets 
and those production costs that are salvageable (i.e., recoverable) in 
uses outside the firm. Our analysis implies that firms will not cheat on 

^ Nonidenlitu'aiioii ot firm output tt> qu.ility <tfpicciaut»u via a standard rxlitr- 
iiality argumciti; i.e., supply !>>' <1 particular firm of lower than aiiiicipawd ipialiiy 
imposes a cost through the loss ol future sales not sole!) on that fit m but on all hi ins tn 
the industry (see Akerlof 1970; Klein 1974) 

* For simplicity, we .xsstiinc that "thett," as opptiscd to nonfulhllmeiil ol coniratt is 
not possible. While "(raud," in ihe sense of one paiiy to the iiansaciion inieniionally 
supplying le.ss than contracted for. is analytically similar to 'ihetl. we diaw a disiimiion 
along this continuum by assuming that the government only irermiis "soliiniary' 
iran.sactions in the sense that iran.saciors choose whom to trade with. Fherelore. while 
consumers cannot • steal" gtrods, ihev can. in principle, pay lor the goods dies receive 
with checks that bounce, and while firms cannot robconsunicrs, they can. in principle, 
supply gcxMis of lower than promised tpiality. Although we lecognue the great 
difhculty in practice of separating the underlying government eidorcement mecha¬ 
nisms, C.B., property law, from the private promise-enforcing mechanisms we are 
attempting to analyze, this distinction between theft and fraud is analytically unam- 
bigiious. 
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promises lo sell high quality output only if price is sufficiently above 
salvageable production costs. While the perfectly competitive price 
may imply such a margin al)ovc salvageable costs, this will not neces¬ 
sarily be the case. The fundamental theoretical result of this paper is 
that market prices alx)ve the competitive price and the presence of 
nonsalvageable capital are means of enforcing quality promises.'"’ 

In Section 111 our theoretical model ol quality-guaranteeing price 
premiums above salvageable costs is extended to examine how the 
capital value of these price-premium payments can be dLssipated in a 
free-entry ecjuilibrium. The quality-guaranteeing nature of nonsal¬ 
vageable, firm-specific capital investments is develc’iped. Alternative 
technit]ues of minimi/.ing the cost to consumers of obtaining an as- 
.sured high cpialily are investigated. We also ex|>lore market responses 
lo consumer uncertainty about quality-a.ssuring premium levels. Ad¬ 
vertising and other production and distribution investments in “con¬ 
spicuous" assets are examined as competitive responses to simultane¬ 
ous (|uality and produciion-co.st iincertainiies. Finally, a summary of 
the analysis and some concluding remarks are presented in Section 
IV, 

II. Price Premiums and Quality Assurance 

Assume initially that consumeis costlessly know all market prices and 
piodticrion technologies Itui not the tjualilies of goods offered lor 
sale. For simplicity, the good Iseing considered is assumed to be 
chaiacteri7cd by a single objective (juality measure, q, where quality 
refers to the level ol some desirable characteristic contained in the 
good. F.xainples are the <|uietness of appliance motors, the wrinkle- 
fiee or colorfast properties of clothing, or the gasoline mileage of an 
automobile. We also a.ssume that the economy consists of consumers 
who consider buying a product ,x each period, where the length ol a 
pericjd is defined by the life (repurchase peiiod) of product x, and 
who are assumed to costlessly communicate quality inlormalic^n 
among one another. Fherefore, if a particular firm supplies less- 


^ I notion (tiat aii iiRicascd price can sci vc <ts a means of assuring High quatny 
supply by giving ibc* firm a rental stream that wiH fie lost if future sales ate not rnaiie is 
n<*i new. Adam Smith {[1776] 1937, 105) suggested this foiic more than 200 years 

ago when he noted that “the wage^ of labour vary according to the* small or great trust 
which must lx* reposed in the woiknian. I he wages ol goldsmiths and jewellers aie 
everywheie supericjr lo those of many other workmen, not only of equal, hut of much 
superior ingenuity; on act ouni of the pretious metals with whu h they are iniilisted. W’c 
trust our health to the physician; our fortune aiui .sc^rnctitnes out life and re[iuiaiion to 
the lawyer and attorney. Such cimfidc'iHC could not safely he reposed in jx.‘ople of a 
very mean or low condition." Similar competitive mechanisms recently have been 
analyzed by Becker and Siigler (1974) and Klein (1974). 
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than-contracted-for quality to one consumer, the next period all con¬ 
sumers are assumed to know. In addition, this information i.s as.sumed 
not to depreciate over time.® 

Identical technology is assumed to be available to all entrepreneurs. 
Hence, there are many potential linns with identical total cost func¬ 
tions, C ~ c(xjq) + F{q), where F is fixed (invariant to rate) costs. 
Higher quality and larger quantities require higher production costs, 
F, > 0, c„ > 0, Cj, > 0, and marginal cost is assumed to increase with 
quality, Cj., > 0. Fixed costs are assumed initially to be expenditures 
made explicitly each period rather than capital costs allocated to the 
current period. For example, they may include a payment on a 
short-term (one-period) rental agreement for a machine but not the 
current forgone interest on a purchased machine or the current 
period’s payment on a long-term rental agreement—both of which 
imply long-term and hence capital commitments. 

We therelore are explicitly distinguishing between “fixed” costs in 
the sense employed here of constant (invaiiant to output) current 
costs and “sunk” (nonsalvageable) capital costs. The usual textbook 
proposition that a (inn will not shut down production as long as price 
is greater than average variable cost blurs this distinction and im¬ 
plicitly assumes that all lixed costs are also sunk capital costs. Our 
assumption of the tompleie absence of any long-tei ni cominiitnems is 
analytically equivalent to perfect salvageability ot all capital assets. If 
all long-term production-factoi commitments were costlessly resers- 
ible, that is, all real and Imaucial assets such as the machine or the 
long-term machine rental contract could be costlessly resold and 
hence perfectly salvagealtle, there also would not be any capital costs. 
Only the nonsalvageable part of any long-term commiimeiit should 
l>e considered a current sunk capital cost. 

If buyers are costlessly mfoimed alroul quality, the competitive 
pi ice schedule, /■’r. for altci ualive quality levels is given by the 
minimum average pitKliKlion costs lor each level i>l quality and is 
designated by Ibis is represented in figure 1 lot two 

alternative quality levels,!/;, andc/min, pricesF, and F Suppose, 

however, that the (pialily ol proclucix cannot be determined costlessly 
before purchase. For simplicity, assume prepurchase inspection re¬ 
veals only whether c|ualiiy is lieJow some minimum level, and that 

^ H wf iriodil) (he ti.ssuiii|»iion^ ot our lo inakt* iiiU‘rronhium,'i c(>iiuinini(.»uu>n 

less than perfect and allow intlows ol new ignoraiu consuiuers over nme and per- 
nit individuals to forget, the poiemial shoit-run iheating s®*”' tiims would Se 
nuiLased, riierelore, ihc ipialiiy-asMUing pine pieinivini would Ix' than we 

tlenve l>elow. In this case increaseti iirni si/e, l>y mailing it intue liLoK dial tiu* 
tndivnluals one is sliaiiiiR nioiliicl-qaality iiiloi inatioii with (0 g . laiiiil> aiict tueiub) 
tiave piirtiiascd from ilie Sfinic fii in, luwcrs the ptuc’iuial slunt-rnii cheating ga"* hy 
essentially retiutmg the repiirthasc (X-riod, 
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the costs are prohibitive of determining whether quality is above 
prior to purchased Obviously, whenever the market price that con¬ 
sumers will pay for asserted high quality exceeds the cost of produc¬ 
ing minimum quality output, the firm can increase its initial period 
profits by producing tfie minimum quality output and deceptively 
selling it as a higher quality product. 

If producers are to have an incentive to produce high quality 
products (in the absence of governmentally enforceable contracts), 
consumers must somehow reward high quality production and 
punish low cjuality production. We assume in this competitive 
framework that consumers will purchase from particular sellers 
randomly chosen from the group of homogeneous sellers over which 
consumer information is transmitted. If a consumer receives a prod¬ 
uct of a c]uality at least as high as implicitly contracted for, he will 
continue to purchase randomly from this group of sellers. On the 
other hand, if quality is less than contracted for, all consumers cease 
to purcha.se from the particular sampled “cheating” firm. 

’’ The quality o( the gixxi beyond the mininiuin level is therclore what Nelson ( 1970 ) 
ha.s labeled a.s an "experience" characteristic. Making the minimum quality level en¬ 
dogenous does not subsunlially change the following analysis. 
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Consider now an initial “competitive” equilibrium in which a single 
firm contemplates selling a quality below that expected by customers. 
Given the competitive market price for some high quality, 
in figure 1 , this particular firm could increase its initial period quasi 
rents by producing minimum quality and selling it at the high quality 
price. However, since buyers are assumed to communicate fully with 
one another, all future customers of high quality output, that is, sales 
at prices greater than PAqmSr,) = Pa in figure 1, are lost. That is, a firm 
that cheats will become known as a “notorious” cheater, and consum¬ 
ers will not purchase from the firm any product the quality of which 
cannot be determined prepurchase.* 

Whether sales of high or minimum quality will maximize the firm's 
wealth depends on whether the capital value of f uture quasi rents 
from continued high quality production exceeds the differential ini¬ 
tial period quasi rents from quality depreciation. In terms of figure 1, 
at the perfectly competitive price for high quality output, f*,, price is 
equal to the average costs of high quality production. Therefore, the 
quasi rents from continued high quality production are zero. If, 
alternatively, the firm were to deceptively produce minimum quality 
output, as a price taker it would expand its production to X 3 (where P, 
= and receive a one-period quasi rent, the present value of 

which is equal to: 




1 -f r 


(P, - P„)X 3 - - P„]d!x]. 


( 1 ) 


Therefore, at the perfectly competitive price for any quality above 
qrn\n firms will always cheat consumers and supply 

Faced with this possibility, consumers would recognize that re¬ 
gardless of producers' promises they will not obtain the higher quality 
product. Therefore, consumers would be willing to pay only the 
costless information price of the minimum quality product whose 
quality they can verity prepurchase, Pq. Because of such rational 
consumer anticipations, firms will not be able to cheat, but desired 
high quality output will not be supplied. 

There may, however, be a price higher than the perfectly compe¬ 
titive price of high quality output, P,, that il it were the equilibrium 
market price would (a) motivate honest production of the high quality 
good and {b) not completely dissipate the consumers’ surplus from 
purchase of higher quality, (ionsider a price such asPj in figure 1. A 


" A terminaicd firm cannot begin business in this industry under a new name. 
However, the highest valued alternative use ot the entrepreneurial skills is included in 
salvageable fixed production costs. T he firm considered hen is assumed to face the 
same opportunities elsewhere as the hrms that arc honest in production ot x. T here- 
lorc, the cheating firm can elect to enter a new industry. 
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firm supplying high quality output will now expand its production to 
Xj. The price premium P, defined as the increase in the price above 
minimum average cost of high quality, provides firms supplying high 
quality with a perpetual stream of quasi rents the present value of 
which (assuming unchanging cost and demand conditions over time) 
is equal to: 


W., 


1 - 

^ Px, - [MC^(x) 

r Jx, 



( 2 ) 


The price premium also increases the gains to a firm from supply¬ 
ing minimum quality at the high price. A firm that chooses to cheat 
will now expand its output (in terms of fig. 1 tox^) and earn the extra 
premium on all units sold.'* Therefore, the capital value of the quasi 
rents from supplying (juality less than promised is: 


IV, = 


1 + r 


[P + (P, - P«)|X4 


r 


- Po^<i^\- (3) 


A film will honor its implicit quality contiact as long as the differ¬ 
ence between the capital values of the noncheating and cheating 
strategies, W., - W^, is positive. Consider the quasi-rent flow of the 
cheating and noncheating alternatives, that is, the terms in braces in 
our ex|>re.ssions (2) atid (3). Define QP 2 equal to rW 2 and QP, equal to 
(1 + r)VV'|. A firm will theti elect not to cheat if and tinly if : 

_2^^_y_±£L (4) 

ClH -i r ■ ^ > 

Therefore, there will be a price premium that motivates firtns to 
lionestly produce fiigh quality as long as: 

L'sing L'Hospital's rule, equation (5) will be satisfied as long as 


I > (*^^4 - Xj) 

r v. 


( 6 ) 


for all P > P:i, where P, is some finite price. Intuitively, as the price 
increases it is otily the int reasc in t|ua.si rents on the additional units of 
minimum quality output that favors the deceptive strategy. Kquation 
(6) insures that price increases beyond some level increa.se W 2 more 
than IV, sucfi that eventually is greatei than W..,. 


" N’ole ihat allh<nij{h Xj Tiiay l>e greater or le;,., than x,, (lepeiiding on ihe price 
prcnrium cliown, given upwaicl-sloping supply functions and tile condition that 
1 ’ niust be the case that x, > x, > x,. 
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The condition specified in equation (6) is quite reasonable. It will fjc 
satisfied as long as a cheating firm does not accompany cheating with 
very large output increases. If, for example, the real interest rate 
were .05 we require only that the output increase by a cheating firm 
not be more than 20 times the total output that would be produced if 
the firm were not cheating. Hence, under very general cost conditions 
a price premium will exist that motivates competitive firms to honor 
high quality promises becau.se the value of satisfied customers exceeds 
the cost savings of cheating them.'“ 

While we cannot state broad necessary conditions for the form the 
cost function must take to imply the existence of a quality-a.ssuring 
price, “reasonable" sufficient conditions can be stated. In particular, 
all cases of vertically [larallel marginal cost curves, as illustrated in 
figure 1, where quality is produced by a fixed input not subject to 
decreasing returns to scale (such as the u.se of a better motor) and 
where the second derivative of marginal cost is greater than or equal 
to ^ero imply the existence of a quality-assuring price. T he Appendix 
cotuains simulation results under the more unrealistic assumption of 
isoelastic marginal cost functions. These simulations indicate the ex¬ 
ceptional nature of the conditions when e<|uation (6) is not satisfied. 
When a quality-assuring price does not exist, the cost 1 unctions are 
generally such that ;tt reasonable premiums cheating output would be 
expanded by very large factors (often f.actois of many thousands). 
Since marginal cost functions lor most products can be exix-cted to 
become cpiitc steep if not vertical at output expansions ol much less 
than these f actors, a quality-assuring price premium can genet ally be 
expected to exist. 

Throughout the remainder of the paper we assume the existence of 
a quality-a.ssuring piice. For given cost and demand conditions, the 
minimum (|uality-assm ing price will depend iqson the level ol quality 


'» I ho pou-iuial tun. Uun ul p. uv piemnuus as qual.u guaianlo. s is also applicable In 
nu.kcMs i’;. KhKb l.r.ns t.uc <|Lnwa.d-s...p.nK .f.-n.amis, tn .Ins case, 

Inins n, .....ease sales .mhnnt .eclmnons ... p.ue InnUs .he ga.ns asa.lablc li.m 

.leeepllve nn..i........ <|..alnv puKlnciun .as pii.c .n. .eases 1 he «'Us cn.^ Z 

s..lf„,en. «. g..a.anlee gualiiv ....« deiKM.ds .... .he elasuot) o '''" 1 "' ^ 

.he ms, savi.,gs I,on, quah.v .e.lno.ons sanous<,uan,...es. 

se.nchinK turns do no. have s.al.le Im.ue de.uau.ls, j 

inoent denia.id rondiuons .s ..... sulhoen. .o oun.a.e 
.,nn„paced In.u.e den.aud v.s-a-v.s .....en. .le...an.\ .s also > 

whe,e musu.nc s expect a growing demand to, .he ,,.,tpn. "V* ^ ,.d.tV o. t...n.e 
produce high .piahlv oiiipiit, ll.e laie ol .|.ias.-ieiii o.v ion ^ ^ ^ 

deecpuvc iinniniiiiii .pialilv) pi.KliKl.oi. .j „ j Lirger t.ipiul 

will, .he same inilial but lonsiani dcinaud. ihe glowing , inilial neviiwl Virtus 

value re.„m a, any p..ce Iron, high .p.ali.) prixlmuon 
tac.nKexpe.,e<i demand grow.l, w.ll therelorc 

p.emh.m.s. Sec Rle.n, Mclanghlm. an.l .Murphy (1980) luran anaivs.s ol .he 
claslic him clcmaiul tas<’. 
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considered and is denoted byP* — P*{g, f)- Our analysis implies 

that the qiiaJity-assiiring price will increase as quality increases, as 
minimum quality decreases (for all q greater than ^mui). and as the 
interest rate increases. These conditions are consistent with the 
familiar recognition that, given a particular quality level, quality- 
cheating problems are less severe the higher the level of quality that 
can he detected prepurchase and the shorter the period of repur¬ 
chase." 

Intuitively, the quality-assuring price treats the p>otentiai value of 
not producing minimum quality as an explicit opportunity cost to the 
firm of higher quality production. Hence the quality-assuring price 
must not only compensate the firm for the increased average produc¬ 
tion costs incurred when quality above that detectable prior to pur¬ 
chase is produced, but must also yield a normal rate of return on the 
forgone gains from exploiting consumer ignorance. This price “pre¬ 
mium” stream can be thought of as “protection money” paid by 
consumers to induce contract performance. Although the present 
discounted value of this stream equals the value of the short-run gain 
the firm can obtain by cheating, consumers are not indifferent be¬ 
tween paying the “premium” over time or permitting the firm to 
cheat. The price “premium” is a payment for high quality in the face 
of prepurchase quality-determination costs. The relevant consumer 
choice is between demanding minimum quality output at a perfectly 
competitive (costless information) price or paying a competitive price 


" We (dll complicate our model by dropping the assumption that nondeteiving firms 
are aiiiKipated to prcxlucc forever. 11 firms have a finite life, and the last period of 
production is known by both firms and convumers, there wifi be no premium sufficient 
to guarantee quality. No matter how high the premium paid by consumers for a high 
quality good in the last pericxl, firms will supply "deccplive" minimum quality because 
there arc no future sales to lose. Cxmsumers aware of the last period will therefore 
demand only tlie minimum quality in that period. But then the next to llie last period 
becomes ilie last pcntKl in the sense that, independent of the price premium, firm 
wealth 15 maxiiiii/ed by supplying minimum quality and going out of business. Con- 
.suiners will then only pay for miniiniiiii qiiaJiiy output in the next to last period, and so 
on. High quality will never be produced. However, the necessary unraveling of the 
piemium solution to assure high quality requires prior consumer knowledge of a date 
beyond which the firm will not prtxlucc. If consutners merely know that firms have 
finite lives but (aimot with certainly specify a date beyond which a particular hrm will 
not exist, price premiums may assure quality. While consumers are aware that some 
transactions will be with firms in their last peruxi and hence cheating will occur, the 
expected gain Irom purchasing high promised quality can he positive. Our price 
premium-repeat business quality enforcemeiit mechanism is analytically equivalent in 
form to the "super-game" solutions to the pri.soner’s dilemma problem developed in the 
game-theory literature. A general result of that analysis is that a cooperative solution 
can exist if one assumes either an infinitely long super game (as we have assumed in our 
nuxlel). or a super game of finite length but with transactors who have sufficient 
uncertainty regarding the pcritKl when the su|)cr game will end (see, c.g., Luce and 
Raifla [1957], pp. 97-102, or, for a more recent solution to the problem that is similar 
in emphasis to our approach, Telser [1980]). 
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higher quality rather than the minintum quairproSuct. ff ^ontum^ 
ers can easily substitute increased quantity of the low quality produci 
for increased qu^uy, then the value of guaranteed high quality will be 
relatively low. Therefore, although a quality-guaranteeing price 
exists, a higher than minimum quality product may not be produced, 
for those goods where the substitution possibilities between quality 
and quantity are lower (e.g., drugs), consumer demand for confi¬ 
dence will be relatively high and the high quality guarantee worth the 
price premium. We assume throughout that we are dealing with 
products where some demand exists for the high quality good in the 
range of prices considered. 


in. Competitive Market Equilibrium; Firm-specific Capital 
Investments 

Our analysis has focused on the case where costless information 
(perfectly competitive) prices do not imply sufficient firm-specific 
rents to motivate high quality production. A price premium was 
therefore necessary to induce high quality supply. Thus, if price 
assures quality, the firms producing quality greater than appear 
to earn positive economic profits. However, this cannot describe a full 
market equilibrium. When the price is high enough to assure a par¬ 
ticular high level of quality, additional firms have an incentive to enter 
the industry. But il additional firms enter, the summation of the 
individual firms’ outputs will exceed the quantity demanded at the 
quality-assuring price. Yet this output surplus cannot result in price 
reductions since the quality-assuring price is. in effect, a minimum 
price constraint “enforced” by rational consumers. All price-taking 
firms supplying a particular promised quality q above the minimum 
face a horizontal demand curve at P* = P*iq), which is of an unusual 
nature in that prices above or below P* result in zero sales. Consumers 
know that any price below P* for its associated quality results in the 


'* As opposed to the Darby and Kami (1973) analysis, this particular mtxlcl implies 
an equilibrium quatilily ol ■■|raud’’ci|ual to zero, where b aud is the dilferenre between 
anticipated and actual quality. Given the syininetncal information assumptions re- 
Kai ding cost functions, parlies to a romraci know when and by how murlia toiliraU will 
be broken. An unanticipated broken quality contrail is therefore not possible, rtic 
implicit economic (as opposed to common usage) concept of "coiuraci" refers to 
anticipated outcomes and not to verbal promises or written agreements', thus there will 
be no broken quality "contracts." 
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supply of They therefore will not purchase from a firm promis¬ 
ing that quality at a price lower than P*. 

A. Brand Name Capital Itwestments 

Competition to dissipate the economic profits being earned by exist¬ 
ing firms must theretdre occur in nonprice dimensions. However, the 
zero-profit equilibrium is consistent with only a very particular form 
of profit-absorbing nonprice competition. The competition involves 
prm-specific capital expenditures. This firm-specific capital competition 
motivates firms to purchase assets with (nonsalvageable) costs equal to 
the capital value of the premium rental stream earned when high 
quality is supplied at the quality-assuring price. That is, H'P*(q) is not 
U) increase, the investment leading to zero profits must be highly firm 
specific and depreciate to zero if the firm cheats and supplies ^min 
rather than tlie anticipated Cjuality. Such firm-s|}ecific capital costs 
could, for example, take the form of sunk investments in the design of 
a firm logo or an expensive sign promoting the firm's name. Ex¬ 
penditures on these highly lirm-s|>ecifit: assets are then said to repre¬ 
sent brand name (or selling) capital investments. 

The competitive proce.ss afso forces the firm-specific capital invest¬ 
ments to take the form of as.sets which provide the greatest direct 
service value to consumers. Tfie consumers’ “ef fective’’ price of pur- 
(basing a qitality-assttred good, where the ef fective price is defined as 
the purchase price of a product, P*, le.ss the value of the services 
yielded tiy the joititly supplied lirand name assets, is thereby 
minimized. Competition among firms in seeking and making the most 
highly valued firm-specific brand name capital investments will occur 
until the expected wealth increase and, therefore, the incentive to 
enter the industry aie eliminated. 

If the firm decides tt) cheat it will experience a capital Itiss equal to 
its anticipated future profit stream. .Since P*(q) is derived so that the 
threat of loss of this future profit stream motivates guaranteed quality, 
the zero-profit etjuilibrium “brand-name capital,”^, which serves as 
an explicit hostage to prevent cheating, equals, in terms of figure 1, 
where P*{qh) = P 2 , 

^ ~ (^C)oJx2 

That is, the market value of the competitive firm’s brand name capital 
is equal to the value of total specific or “sunk" selling cfists made by the 
firm which, in turn, equals the present value of the anticipated pre¬ 
mium stream from high quality output. If we continue to assume that 
there are no capital (and therefore “sunk”) costs of production, the 
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zero-prolit equilibrium is shown in figure I where average “total” tost 
(which includes average production costs, plus average brand- 

name capital [i.e., nonsalvageable “selling”] costs, r[y3/;c]) just equals 
price, 1 ^ 2 - 

What assures high quality supply is the capital loss due to the loss ol 
future business if low quality is produced. .Since the imputed value of 
the firm s brand name capital is determined by the firm’s expected 
quasi rents on future sales, this capital loss from supplying quality 
lower than promised is represented by the depreciation of this firm- 
specific asset. The expenditures on brand name capital assets are 
therefore similar to collateral that the linn loses if it supplies output of 
less than anticipatetl quality and in equilibrium the premium stream 
provides only a normal rate of return on this collateral asset. 

Note that the “efiective" price paid by consumers, which equals the 
quality-assuring price less the value of the consumer services yielded 
by the brand name capital, may l>e minimized by the investment in 
specific selling assets with some positive salvage value. Kven though 
this results in an increased quality-guaranteeing price, assets with 
positive salvage values may yield differentially large direct consumer 
service (lows. All brand name capital a.ssets must, however, satisfy a 
necessary condition that the salvage value fter unit of output be less 
than the consumer service value. Firms competing to minimize the 
elfecttve price will choose specific assets by trading off increased 
consumer service value with decreased salvage value. This may ex¬ 
plain why stores which supply high quality products often have 
amenities (such as .i luxuiious car|x;t cut to fit the particulai .store) 
even though only stnall direct tonsutner services may he yielded 
relative to cost.*'' 

H. Nonsalvagfdble Productive As^eh 

The market equilibrium we have developed implies an ellcctive price 
for high quality output that is higher than what would exist in a zero 
information cost world. While the costless-inlormalion solution is 
meaningless as an achievable standard of “efficiency,’ alternative 
marketing arrangements may be usefully compared to this bench¬ 
mark, Viable, competitive firms will adopt the arrangements which, 
considering all transacting and contracting costs, minimize the devia- 

IIThc “sunk " asset yiekis absolulcly i><> uiiisiiiner services, then the hi m will iiol ti.se 
It. Even thouj^h prufii.s wiiuiil lx* ehininaiecf by puiehase tii such an a.ssei. tonstimeis 
wouut be indifterent between a In ni that invesied in the asset and a tirrn that did not. In 
a world where coiistiniers do not (Kcssess full knowledge of eosi conditions, howevei, 
use <it obvifvusly specific assets may l>e enipfoyed even it yielding no direr I consumer 
service flow becau.se they may elluienllv iiitorm consumers regarding the stink capital 
cost to the firm. This is discussed in greater detail in Sec. IIIC, 
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lions between the costJess-infbrmation price and the effective price. 
One potentially efficient alternative or supplement to the pure price- 
premium method of guaranteeing quality may be the use of nonsal- 
vageable productive assets rather than brand name (selling) assets. 

In order to simplify the analysis of price premiums in guaranteeing 
quality, we have assumed that all production costs, including fixed 
costs, were noncapital costs and therefore, by definition, salvageable. 
More realistically, firms can control both the capital intensity of pro¬ 
duction and the salvage value of any fixed assets employed in the 
production process. In particular, if the firm uses a production pro¬ 
cess that has a nonsalvageable capital element, the normal rate of 
return (cjuasi-rent stream) on this element of production capital ef¬ 
fectively serves as a quality-assuring premium. In terms of our model, 
the capital value of the quasi-rent stream when a firm cheats (eq. [3]) is 
now modified so that the net gain from cheating equals IVs minus this 
nonsalvageable capital cost. Alternatively, in the zero-profit equilib- 
l ium the total level of collateral must still equal the potential gross 
gaitis frotn cheating, but part of the collateral is now provided by the 
nonsalvageable production assets rather than the brand name capital 
assets. 

For example, if a machine is somewhat illiquid, buying it rather 
than renting it shoit term provides some of this collateral and lowers 
the quality-guaranteeing price. In fact, because of positive selling 
costs, capital assets generally have a salvage value less than cost. Thus 
cajiital inputs, especially those that have a high turnover cost, will 
have a value in terms of providing quality assurance in addition to 
their productive value. Even if the asset is not firm specific, if there is 
any time delay after the firm cheats and is terminated by consumers in 
selling the asset to another firm, the firm loses the real rale of interest 
for that time period on the capital. In addition to physical capital, 
human capital costs, especially entrepreneurial skills, are also often 
highly nonsalvageable in the face of consumer termination and there¬ 
fore also provide significant quality assurance. 

rhe general theoretical point is that tlie presence of positive 
quality-information costs favors an increase in the capital intensity of 
production, including the extent of long-term, illiquid contractual 
arrangements with suppliers of productive inputs. In particular, the 
minimum-cost production technique is no longer necessarily that 
which minimizes solely the average cost of production. “Sunk” pro¬ 
duction capital now accomplishes two functions—the supply of pro¬ 
duction services and the supply of quality-assuring services. Therefore, 
increases in average production costs accompanied by larger increases 
in sunk production assets may minimize the effective consumer prod¬ 
uct price. Profit maximization requires firms to trade off “inefficient” 
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production technologies and the quality-assurance cost savings im¬ 
plied by the presence oi lirm-sjiecilic (sunk) capital assets in the 
productive process and hence the reduced necessity for the firm H) 
make sunk selling cost (brand name capital) investments. Although 
the more capital intensive production technology may increase the 
perfectly competitive price of high quality output, Po< h reduces the 
price premium, P 2 - /“i, necessary to assure the supply of that high 
quality. In fact, even a very slight modification of the minimum 
production cost technology, such as an alteration in some contractual 
terms, may imply the existence of large enough nonsalvagcable assets 
so that the need for a quality-guaranteeing price premium is elimi¬ 
nated entirely.'"* 


C. Consumer Cost Ufuerlainty: A Role for Advertising 

The discussion to this point has assumed complete consumer knowl¬ 
edge of firms’ costs of producing alternative quality outputs and 
knowledge of the extent to which any capital production costs or 
brand name capiul selling costs are salvageable. This knowledge is 
necessary and sufficient to accurately calculate both the (juality- 
guaranteeing premium and price. However, consumers are geneia y 
uncertain about cost conditions and therefore do not know the 
minimum quality-guaranteeing price with perfect accuracy. In lact, 
consumers cannot even make perfect anticipated quality rankings 
across firms on the basis of price. I'hat one hrtn has a higher price 
than another may indicate a larger price premium or. alternatively, 
more inefficient production. In this section, we examine how the 
more realistic assumption of consumer cost uncertainty influences 
market responses to prepurchase quality uncertainty. 

We have shown that increa.ses in the price premium ovei aveiage 
recoverable cost generally increase the relative returns from pioduc- 

quality. The existence of a high price premium also ^ 

penditures on brand name capital investments 
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the magnitude of the price premium. When a consumer is uncertain 
about the cost of’ producing a particular high quality level of output 
and therefore the required quality-assuring premium, information 
about the actual level of the price premium will provide information 
about the probability of receiving high quality. If consumers are risk, 
averse, this uncertainty about receiving anticipated high or decep¬ 
tively low quality output will increase the premium that will be paid, 
fhe premium will include both a (presumably unbiased) estimate of 
the quality-assuring premium and an extra payment to reduce the 
risk of being deceived. 

Thus, when consumers do not know the minimum quality¬ 
guaranteeing price, the larger is a firm’s brand name capital inve.st- 
ment relative to sales, the more likely its price premium is sufficient to 
motivate high quality production. Competitive investment in brand 
name capital is now no longer constrained to assets which yield direct 
consumer service flows with a present discounted value greater than 
the salvage value of the assets. Implicit information about the suffi¬ 
ciency of price as a guarantee can be supplied by “conspicuous” 
specific asset expenditures. Luxurious storefronts and ornate displays 
ot signs may be supplied by a firm even if yielding no direct consumer 
service Hows. Such hrm-specific assets inform consumers of the mag¬ 
nitude ot sunk capital costs and thereby supply information about the 
c|uasi-rent price-premium stream being earned by the firm and hence 
the opixutunity co.st to the firm if it cheats. Both the informational 
services and the direct utility producing services of assets are now 
relevant considerations fora firm in deciding upon the most valuable 
form the brand name capital investment should take. 

The value of information about the magnitude of a firm's specific 
or “sunk” capital cost, and tfierefore the magnitude of the price 
premium, is one retuin from advertising. Indeed, the role of pre¬ 
miums as quality guarantors provides foundation for Nelson’s (1974) 
argument that advertising, by definition, supplies valuable informa¬ 
tion to ct)nsumers—namely, information that the firm is advertising. 
A sufficient investment in advertising implies that a firm will n(.)t 
engage in short-run quality deception since the advertising indicates a 
nonsalvageable co.st gap between price and production costs, that is, 
the existence of a price premium. 'I'his argument essentially reverses 
Nelson’s logic, ft is not that it pays a him with a “best buy” to advertise 
more, but rather that advertising implies the supply of “best buys,” or 
more correctly, the supply of promised high cjuality products. Adver¬ 
tising does not directly “signal" the presence of a “best buy,” but 
“signals” the presence of firm-specific selling costs and therefore the 
magnitude of the price premium. We would therefore expect, ceteris 
paribus, a positive correlation not between advertising intensity and 
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“best buys," as Nelson claims, but between advertising intensity and 
the extent of quality that is costly to determine prepurchase. 

Conspicuous sunk costs such as advertising are, like all sunk costs, 
irrelevant in determining luture firm behavior regarding output 
quality. However, consumers know that such sunk costs can be 
profitable only if the future quasi rents are large. In particular, if the 
consumer estimate of the initial sunk ex|)enditure made by the firm is 
greater than the consumer estimate of the firm’s (jossible short-run 
cheating gain, then a price premium on future sales sufficient to 
prevent cheating is estimated to exist. Our analysis therefore implies 
that independent of excludability or collection costs, advertising that 
guarantees quality will be sold at a 7ero price and “tied in” with the 
inarked-up product being advertised.'® 

Our theory also suggc.sls why endorsements by celebrities and other 
seemingly “noninformative" advertising such as elaborate (obviously 
costly to produce) commercials, s|x>nsorships of telethons, athletic 
events, and charities are valuable to consumers. In addition to draw¬ 
ing attention to the piocluct, such advertising indicates the pre.sence 
of a large sunk “selling" cost and the existence of a price premium. 
And because the ciucial variable is the consumers’ estimate of the 
stock of advertising capital (and not the flow), it also explains why 
firms advertise that iliey liave advertised in the past (e.g., "as seen on 
' fhe Tonight Show”’). Rather than serving a direct certifying func¬ 
tion (e.g., as recommended by Good Howekeeping magazine), informa¬ 
tion about past advertising informs consumers about the magnitude 
of the total brand name capital investment.'^ 

Firms may alst) provide valuable information by publicizing the 
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large Ices paid to eelebrities for commercials. Information about tarije 
eiidoi semeiit fees would lie tloselj guaided if the purpose were to 
simulate an ■•unsolicited endorsement" of the products particular 
quality characterisiics rather than to indicate the existence of a prkc 
premium. Viewed in this context, it is obviously unnecessary for the 
celebrity to actually use the particular brand advertised. This is con¬ 
trary to a recent FTC ruling (see Federal Trade Commission 1980). 

litis analysis of advertising implies that consumers necessarily re¬ 
serve something when they pay a higher price for an advertised 
brand. An expensive name brand aspirin, for example, is likely to be 
better than unadvertised aspirin because it is expensive. The adver¬ 
tising of the name brand product indicates tbe presence of a current 
and (utiire price premium. This premium on future sales is the firm’s 
brand name capital which will be lost if the firm supplies lower than 
anticijjated quality. Therefore, firms selling more highly advertised, 
higher priced products will necessarily take more precautions in 
production.'** 

We have emphasized the informational value of advertising as a 
sunk cost. Other marketing activities can serve a similar informational 
role in indu ating the presence of a price premium. For example, free 
samples, in addition to letting consumers try the product, provide 
information regarding future premiums and therefore anticipated 
totality. Such free or low-price samples thus provide information not 
solely to those consumers that receive the .samples but also to anyone 
aware of the existence and magnitude of the free or low-price sample 
|jrogtam. Mote generally, the supply by a firm of quality gieater than 
anticipated and paid for by consumers is a similar type of brand name 
capital investment by the firm. By forgoing revenue, the firm provides 
inlormation to consumers that it has made a nonsalvageable invest¬ 
ment of a particular magnitude and that a particidar future premium 
stream is anticipated tf) cover the initial sunk alternative cost.'" 


The gu'Hlci' IS ihc cost to consuinc-rs of olxdiiiitig tlcteptively low quality, the 
gicMlc't will lx- ihe (IriiiuiKl for (|ualiiy assiiranco 1 he very low market .slurc of 
“generic” children's aspirin (1 percent) vis-a-vis gencnc’s share of the regular aspirin 
market (7 percent) is consistent with this implication (see IMS America, Ltd. 1978) 
Many individuals who claim "all aspirin is alike" apparently pay the extra price lor their 
childien where the < osts ol lower quality aic gieatei and ihercfoie quality assurance is 
(onsidered mote nnporlaiil. 

Oiir analysis of adveitising also illuminates the monopolism competition debate. 
(ihanilK'rlin’s (196.5) distinction between production costs, dehried as what is included 
in the “paikagc” lhal passes from seller to buyer, and selling costs (e.g,, advertising), 
which are not part ol the package transferred, suggests that selling costs uselully may 
l>e consideied as a privately supplied collective factor. For example, a linn which holds 
selling costs, siu h as expenditures on a store sign, constant as his sales increase dt>es not 
appear to be decieasing the average “quality" of his product. Dcinsetz (1959, 1968) 
made the contrary as.sumplion lhal average quality does fall as sales increase, holding 
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^cncj'iilly uiivcco^niy-cd impi)i'i^occ of tnctcast^d rnarlict piiccs 3iid 
nonsij/vageaWe capilul as possible tneth(xis of making quality prorn- 
isfs credible. We obviously do not want to claim that consumers 
“know ” iliis theory in ihe sense that they can vcrbali/e it but only that 
they behave in such a way as if they recognise the forces at work. They 
may, lor example, know from past experience that when a particular 
type of investment is present such as advertising they are much less 
likely to he deceived. Therefore, survivorship of crude decision rules 
over little may produce consumer behavior very similar to what would 
lx; predicted by this model without the existence of explicit “knowl¬ 
edge" of the forces we have examined. 

Our analysis implies that consumers can successfully u.se price as an 
indicator ol quality. We are not referring to the phenomenon of an 
ignorant consumer free riding on the information contained in the 
market price paid by other more informed buyets but rather to the 
fact that consumer knowledge of a gap between firm price and sal¬ 
vageable costs, that IS, the knowledge of the existence of a price 
premium, supplies quality assurance. I he former argument, lliat a 
naive buyer in a market dominated by knowledgeable buyers can use 
pi ice as a quality signal because the relative market price of dif f erent 
products reflects differences in production costs and therefore dif fer¬ 
ences in cjuality, crucially depends upon a “majority" of the buyers in 
tlie market being knowledgeable. 

As Scitovsky (194.5, p. 101) correctly notes, ". . . the situation 
(lecomes paradoxical when price is the index by which the average 
buyer judges (piality. In a market where this happens price teases to 
be governetl by tompetition and becomes instead an instiument 
wlierewith the .seller can iiiHuence his customer's opinions of the 
cjuality of his wares.” However, even when the “aveiage" buyer uses 
price as an index of quality, we need not fear, as Scittivsky does, the 
havoc this supjxi.sedly wreaks on the economic theory of choice. All 
consumers in a market may consistently use price, given their esti¬ 
mates of salvageable production costs, as an indicator of the firm’s 
price-premium stream and therefore as an indicator of the antici¬ 
pated (juality of the output to be supplied by the firm. Scitovsky did 
not consider that price not only influences buyers’ expectations but 
also influences producers’ incentives. 

We do not wish to suggest that use ol implicit (price premium- 
specific investment) contracts is always the cheapest way to assure 
quality supply. When quality characteristics can be specified cheaply 
and measured by a third party, and hence contract enforcement cfists 
are anticipated to be low, explicit contractual solutions with gov- 
ernmentally enforced penalties (including warranties) may be a less 
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costly solution. W he n expl ic i i «mt r x .-,. 

short-run prohl from decepiivdv In of 

r, • occcpiively low quality supulv and hence the 

quahty-assunng price premium is also high. Kovir'nniental speuht 
non and enforcement of minimum <,ual.ty standards may Ite an alter- 
native method ol reducing ihe costs of assuring the supply ot Itigh 
quality pioducls.^ And, finally, vertical integration, which in this 
consumer-product context may consist ol home production or con¬ 
sumer cooperatives, may be a possible alternative arrangement (see 
Klein, (irawt'ord, and Alchiaii 1978). 

I he three major methods in which to organize transactions can be 
uselully considered within this framework as (n) explicit contractual 
or regulatory specilicatioii with third-party enlorcemcnr, (/>) direct 
(two-party) enlorcemem of implicit contracts, and (r) one-party or¬ 
ganization or vertical integration. This paper has analyzed the brand 
name repeat-purchase mechanism reptesented hy the second alter¬ 
native. More geneially, however, all market transactions, including 
those “within" the hrm such as employer-employee agreements, con¬ 
sist oi a combination of the two basic forms of contractual arrange¬ 
ments. Sotne elements of performance will be specilietl and enforced 
by third-party saiicliotts and other elements enforced without invok¬ 
ing the power of some outside party to the transaction but merely by 
the threat of tetmination ol the iransaciional relatiotiship. 

Our analysis implies tliat, given a particular level ol explicit contract 
costs, we are more likely to observe an increased reliance on the brand 
name cotitract-eiiforcemeiu mechanism the lower the rate oi interest 
and the lower the level oi prepurchase quality-determination costs. 
The lower the interest rate the greater the capital cost to a firm Irom 
the loss of future sales and therefore the lower the eqmlihriuin price 
premium. Hence we can exfiect llie termination ol tuiuie exchange 
method of enforcing contracts to he more ellective. Mote generally, 
since the interest rate in our model refers to the period ol product 
vepurchasc, the quality as,suiance will he less costly lor less durable 
goods that have greater repurchase frequency. Kranchismg chains, 
lor example, take advantage of this el led by making it possible lor 
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consumers to pool information from sales of seemingly disparate 
sellers, thereby decreasing the period of repurchase and the quality- 
assuring price. 

Similarly, purchase f rom a diversified firm increases the frequency 
of repeat purchase and lowers the necessary price premium. As long 
as consumers react to receiving unexpectedly low quality from a 
diversified firm by reducing purchases of the firm’s entire product 
line, all the firm’s nonsalvageable capital serves to assure the quality of 
each product it produces. This economy of scale in communicating 
quality-assurance information to consumers may be one motivation 
for conglomerate mergers. If a firm sells a set of products, each of 
which is produced by capital with salvage value less than costs, the 
(]uality-guaranteeirig price premium on each product will be lower 
than it production were done by separate firms. 

finally, we can expect greater reliance on the non-third-party 
method of contract enforcement the lower the direct costs to the 
consumer of determining quality of the product prepurchase. The 
highei the costs ol pioducing the minimum quality output that can¬ 
not he distinguished prepurchase from a given promised high quality 
output atid the faster these minimum quality production costs rise 
with increased out|iut, the lower the potential short-run cheating gain 
and therefore the lower the price premium. When the low quality cost 
(unction is such that a cheating firm can expand output a substantial 
amount with little increase in cost, use of the brand name enforce¬ 
ment mechanism is unlikely. 

When the low quality cost function liectmies so Hat that the pre¬ 
mium solution does not exist, the implicit contract-enforcement 
mechanism we have analy/etl will not be used. When this condition is 
combined with an extretnely high cost of quality assurance via explicit 
contractual guarantees, governmental supply may be the cheapest 
alteinative. Ati obvious example is the good “money,” where the 
marginal cost of production is essentially zero, the short-run cheating 
potential extremely large, and where the cost of a commodity money 
or the necessary bullion reserves to assure performance via converti¬ 
bility is also extremely high. Governmental supply is the generally 
adopted but far Irom co.stless solution (see Klein 1974). Other prod¬ 
ucts where the “hold-up ” potential is very large and where explicit 
contract costs arc high (such as police or fire protection services) are 
also generally supplied by non-profit-maximizing government agen¬ 
cies rather than by unregulated profit-maximizing firms earning large 
quasi rents on unsalvageable (firm-specific) capital assets. In general, 
minimization of the cost of a.ssuriiig performance will imply an opti¬ 
mal combination of governmental regulation and/or supply, explicii 
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contractual enforcement, vertical integration, and the implicit (brand 
name) contractual enforcement mechanism we have analyzed. 


Appendix 

Simulation of the Quality-assuring Price 

if we a,ssume that output of high and low (minimum) quality is produced by 
constant-elasticity cost functions of the form: 

+ (Al) 

C, = F, + (A2) 

the quality-assuring price premium. P, will be given by:“ 


1 + r 


I'hi.s expre.ssion indicates that as the ratio of low to high quality fixed costs, 
(f,/f»), decreases the quality-guaranteeing price-premium increa.scs (because 
the short-run profit from cheating increa.scs). But, as long as Fi ran be 
assumed to be less than or equal tof'*, fixed costs cannot affect the existence 
of the quality-guaranteeing price premium. Similarly, as the interest rate, r, 
increases, the quality-guaranteeing price pieinium increases hut will always 
exist. It is the mat ginal cost elasticity. (l/(a - 1)1, and the ratio of the marginal 
cost slopes, ()3(/^i,), that determine the existence ol a quality-assuring price 
premium. An increa.se in the elasticity of marginal cost or a decrease in the 
ratio of the low to high quality marginal cost slows, by inci easing the possible 
expansion of the low quality output at the high quality-guaianteeing price, 
increases the quality-guaranteeing price premium and the likelihood ih.it it 
may not exist. Simulation results as a function of lhe.se paiamelers are pre¬ 
sented in table Al below. The ratio of the quality-assuring price lo ihe 
minimum average cost of high quality production. P-.IP, m terms of figure 1, 
along with the ratio of low quality oiilpui at the quality-assuring price relative 
to the minimum high quality average cost output. x,lx, in terms ol figure 1, is 
presented. When the quality-assuring price does not exist, the ratio ol low 
quality output at a P,iPr ratio of 2 to the output at the mimmum aveiage cost 
of low quality output, x„, is presented in brackets to indicate the shape o 1 e 
low qu^ity cost function. The lesulls indicaie that these cases of nonexastence 
generally (Kcur where the low quality co.st curve is so Hat relative 10 '^e high 
(luality cost curve that cheating output can lie expanded drainalically lelati e 
.0 the nonchealing output. For example, when the marginal cost claslu.l^s 
assumed to be 10.0 and the ratio of marginal cost slopes is assumed to 
.1 implies that low quality output can be profitably expanded by 
billion times beyond its minimum average cost rate when the market puce 
double the perfectly competitive high quality price. 


” The derivation i.s available to readers upon request. 
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Specifically, if the bias works to the lienefit of some identifiable group, 
then why do political at tors not insist on etticicncy in combinaiion 
with a compensation scheme? I'his would leave the idcniiried group at 
least as well off, all others at least as well off, and then leave the 
efficiency gain at the political distretion of representatives, thereby 
enhancing their role. From the anomaly of sustained political inelfi- 
ciency, it appears that there is a divergence between normative eco¬ 
nomic principles, on the one hand, and the preferences of political 
actors on the other. Prominence is given in the following analysis to 
the political mechanisms that create and maintain this divergence. 

In this paper, we develop a model of the public choice mechanisms 
comprising a repiesentative legislature in order to show the political 
sources that systematically bias public decisions toward larger than 
efficient projects in the area of distributive policymaking. By distribu¬ 
tive policies we mean those projects, programs, and grants that con¬ 
centrate the benefits in geographically specific constituencies, while 
spreading their costs across all constituencies through generalized 
taxation,^ This collection of public decisions includes the traditional 
iRirk barrel of public works and rivers and harbors projects as well as 
the more recent examples of highway construction, caiegonca 
grants-in-aid, nrljan renewal, mass transit, and sewage ireaimeiil 
plants. The model identifies the political sources of efficiency bias— 
and there are several—by unpacking the democratic institutions into 
their components, thereby fix using on the influence of each. This 
takes the form of a progression of models beginning with an effi¬ 
ciency benchmark. I’hen. one by one, political features arc added 
until the final form models a representative legislature divided into n 
districts. The approach shows how pslitical insututions ^ 

economic basis of costs and benefits into political costs and benefits. 
The latter, and not their economic counterparts, define rational deci¬ 
sions for political actors. 

The model reveals three important sources ol bias, ^he brsl is 
consequence of the political definition ol benefits and J 
divergence, in important respects, from the economic de iniiion. 1 he 
second source stems f rom the districting mechanism which ^s 'he 

economy into n disjoint political units called districts. The '"^hod of 
project financing through generalized taxation 

source of bias. Moreover, we show that the mechanism of j^opular 

election of legislative representatives ion 

bias so that these three sources, in cc.njuncuon with the reelcction 

mechanism, explain the inefficiency of political choice. 

> See l.>wi (1964, fur a d,son.,.on of d.flerem k.nd, of ^ based on ,he cha.ac- 
wn,tic, of bcnefinanc, as well as ihe mechamsms ol finam nR. 
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The General Approach 

Our model is one of policymaking in the realm of distributive policy, 
so we begin by clarifying that concept. A distributive policy is a 
political decision that concentrates benefits in a specific geographic 
constituency and finances expenditures through generalized taxation, 
rhese policies—sewage treatment plants, land reclamation, the rivers 
and harbors omnibus, urban renewal projects—authorize collections 
of projects, each targeted to a geographic location and each generat¬ 
ing benefits in that geographic location unrelated to projects in other 
locations. This latter characteristic is crucial since the omnibus-like 
quality of distributive programs allows decisions to be made on a 
project-by-prqject, locality-by-locality basis. Each project of the om¬ 
nibus may be fashioned independent of others in the omnibus. 

While it is clear that all policies have a geographic incidence of 
benefits and costs, what distinguishes a distributive policy is that 
benefits are geographically targeted. In contrast, a nondistributive 
program, say an entitlement program, though having geographic 
incidences, is fashioned with nongeographic constituencies in mind, 
for example, .socioeconomic groups. Subsidies to beekeepers, for 
example, generate a distribution of benefits that depends on the 
geographic distribution of Iteekcepers. No geographic area has a 
claim on program benefits except as it contains residents in the enti¬ 
tled category. Thus, by our definition, programs targeted to the 
malnourished (food stamps), the unhealthy (Medicare), the poor (wel¬ 
fare), the retired (social security), the injured worker (workmen’s 
compensation), or the automobile driver (automotive product safety) 
are not distributive policies becau.se any citizen may obtain program 
benefits if he falls in the specific category. Thus, an entitlement 
program confers fjenefits on all individuals in the designated cate¬ 
gory, benefits which may not be varied without similarly varying them 
for others in the category. Although the motivation to create nondis¬ 
tributive programs may have a geographic basis (as when a politician 
is moved to suppoi t a policy Irecause many of his constituents fall into 
the beneficiary group), the fact remains that the beneficiary group is 
not geographically defined or determined. In contrast, geography is 
the hallmark of distributive politics; Programs and projects are geo¬ 
graphically targeted, geographically fashioned, and may be inde¬ 
pendently varied. Importantly, geography is also the basis for political 
organization and representation. 

A distributive policy for the; th district, Pj (x), is a project located in 
that district, where x is a decision parameter. Although x may be 
treated as a vector of project characteristics, we assume that x simply 
describes the scale or size of the project. Associated with the project | 
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Pj(x) are both benefits and costs. Let b[x) represent the present value 
of the economic benefits which How from the project to the particular 
political constituency. T his includes consumption benefits, say cleaner 
water from a sewage treatment plant, and potential pecuniary gains to 
producers, for example, increased profits to project input owners 
from price rises in factor markets.^ 

Assumption 1; b'(x) > 0, b''(x) < 0. 

Let c(x) represent the total resource cost involved in producing the 
project. It decomposes into three components, c(x) = C|(x) + Cjj*) + 
C 3 (x). The first component, r.,(x), is the real resource expenditures for 
project inputs spent in the constituency in which the project is located; 
C 2 (x) is the real resource expenditures for project inputs spent outside 
the district; Calx) is nonexpenditure real resource costs imposed on 
the district (e.g., nonpecuniary externalities, such as the destruction 
of the natural environment, and pecuniary externalities in the form 
of price rises to consumers in factor markets).^ 

Assumption 2: c,'(x) > 0, cl'fx) » 0, i = 1, 2, 3. 

The expenditures are financed iltrougli taxes st> that the lax bill tor 
Pj(x) is 

T(x) - c,{x) + c^ix). (1) 

We assume a tax system that covers all expenditures, assigning non¬ 
negative tax share tj to the ith district, where lUi li ~ ' 
number of districts. The tax bill for the ith district lor the project 
Pi{x), therefore, is t,[c,(x) + CjfxlJ. 

As the above development suggests, there are several mechanisms 
at work in the realm of distributive policy which our model captures. 
First, economic benefits are geographically concentrated in a politi¬ 
cally relevant way. Second, production costs may be unpacked, again 
in a politically relevant way. Some costs are extracted irom the econ¬ 
omy and returned as geographically earmarked expenditures—cd-r) 
and rdx); other costs are nonexpendiuiie in nature, on the 

local economy in which the project is loiatcd—T hird, the tax 
bill, T(x), is paid for by cacli political suixlivision according to the ux 
.shares t,,t = 1. n. Ihe relevant mechanisms which we examine 


" Some sludeius ol tosl-benefil aiialyis. e « . McKean (1958, c!i.ap. 8). ai Ruc ag-jmsi 
in<luding pecuniary cxiernal elfecis m ihe calculations claiming 
. on.stitute Sributional elfecis that arc not germane lor elhcicncy 
l.ealmenl of pecuniary cxiernal ettcus rcnains controversial in ^ 

Our resulu reined in ih.s papr-rdo no. depend on ho« 

> 1 here is a fiiurth component ot costs, namely, nonexpenditurc real lesoiu.c lo.t 

that spill over into other poliucal tonstiluencies. Since P^x) is a ‘^”f'7,77'*s«tion*Vor 
ise Ignore this for the pr^seiu. We return to this point m the 
.iiialysis of other kinds of policies not possessing the paiiitular piope 
projects, see Shepsle and Wcingasi tl980). 
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below are (1) the political cost-accounting mechanism, (2) the dis¬ 
tricting mechanism, and (3) the taxation mechanism. After examining 
these mechanisms and their effects on the characteristics of distribu¬ 
tive policy, we explore the complementarity of legislators’ reelection 
objective. First, however, we develop the familiar efficiency criterion. 


Model E: Maximizing Economic Efficiency 

The benchmark for the entire set of political institutions developed 
below is the efficiency criterion. This requires the maximization of 
economic net benefits. This is given simply by 

max E{x) = b{x) — c(x). (2) 

The familiar first- and second-order conditions are 

b' -c' = 0, 

and (3) 

h" - c" < 0. 

The second-order condition follows directly from assumption 1 and 
assumption 2 so that the solution to (2), x''- in figure 1, is a unique 
global maximum. 


The Politicization of Expenditures 

I'he first transformation of the .standard approach is the (xiliticization 
of economic costs. This transformation rests on a crucial political 
property. Project costs, paid from general revenues, become geo¬ 
graphically earmarked expenditures. Fhe political process distributes 
those expenditures in the Ibrm ol c,-type and r 2 -typc costs. Thus, for 
F’j(x) in district ;, production inputs are purchasec] from firms and 
individuals in the district (C|) and from their counterparts outside the 
district (c.^). More important is the political evaluation of the distribu¬ 
tion of geographically earmarked expenditures. While these expen¬ 
ditures are not lump-sum transfers of wealth to factor owners, they do 
entail pecuniary gains since they represent increases in demand for 
project input factors, thereby driving up their prices."’ Not only does 


' This statement presumes that factors of prodtiction are geographically hxed in the 
short run. This does not preclude the bidding away of pecuniary gains as the long-run 
supply of faclots adjust.s to this change in demand. For .some projects, which are oik* 
shot and nonrefietitive in nature, the short-run analysis holds since the long-run 
adjustment pnxess is attenuated. For other programs, in which a permanent increase 
in demand has occurred (e.g., the continuing flow of military prfK uremenl projects to a 
district), long-run market forces adjust with the concomitant bidding away of pecuniary 
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the public project entail new business for input owners; it allows them 
to receive a higher price lor the sale of inf ramarginal units as well.® As 
this discussion suggests, geographic expenditures arc important be¬ 
cause they distribute pecuniary effects. Consequently, the distribu¬ 
tional effects of local expenditures combine with consumption 
benefits in the district’s valuation of a project. Put simply, a dollar’s 
valuation of a project may come in either of two forms: a pecuniary 
gain to a factor owner or a benclii to a project consumer. Partly as a 
consequence of the.se distributional effects, and partly lor additional 
reasons enumerated laelow, the political evaluation of pecuniary ef- 
lc(’is diverges fn)in their economic lre<itinent. 

We may distinguish several clas.ses of agents who are dilfercntially 
advantaged or harmed (in addition to their tax obligations) by the 
provision of Pj(x): (1) m-dv,tnct comumen, who receive beneiits 
thioLigh consumption of the public project but are unattected by 
pecuniary effects; (2) m-duslnft factor owners, who obtain pecuniary 
gains in production of, as well as benefits in consumption from, P/.x); 
(3) out-of-district jewtor owners, who obtain pecuniary gams in produc¬ 
tion (but no consumption Ijenefits since they do not reside in the loca 
constituency); (4) in-district consumers who tnakejactor market purchases, 


inaiiift'stccl in an 


KMins. Ncvcrihei^ss. |KiUticM rff'cti is lUc same. ihoiiRh now i 
aveision lo pecuniary losses sutlcred it deniaiul were lo coniiat 

'’'"'rtr^are pecunory a, well-nanwly. mher asei, ol pmieci taeiors .ho 

experience risinj^ pnccs. 
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who obtain consumption benefits from the project but suffer 
pecuniary losses in the form of higher prices for factors; and (5) 
out-of-district purchasers of factors, who suffer pecuniary losses through 
higher prices in factor markets (and who, moreover, obtain no con¬ 
sumption benefits since they do not reside in the district). Public good 
theorists like Samuelson, Bator, and Head, in their market-failure 
models of public good provision, typically focus only on type 1 agents 
and the nonappropriable product of public projects. Cost-benefit 
studies and welfare analyses (Prest and Turvey 1965; Mishan 1976) 
also focus on type 1 agents, limiting the role played by the other four 
types. These studies compute costs and benefits in consumption fol¬ 
lowed by an appropriate adjustment for pecuniary effects. But these 
studies may have mis.sed the point. As Aranson and Ordeshook (1978) 
have emphasized generally, and Stigler (1971) and Peltzman (1976) 
for the case of regulation, it may well be the appropriable pecuniary 
gains and losses of factor owners and competing factor users, respec¬ 
tively, that di ive the political production process. In this latter view, 
the consumption benefits of the public project—clean water, mass 
transit, or whatever—are a by-product of factor owners and factor 
users seeking pecuniary gains and the avoidance of pecuniary losses, 
respectively. 

How these pecuniary effects are distributed geographically, and 
whether they are gains or losses, have different kinds of political 
impacts. Since political representation is geographic, legislators care 
about who gains and who loses in proportion to their geographic 
locations. We assume that local gains and losses are politically more 
significant to the legislator’s objectives (reelection and constituency 
service) than nonlocal effects. Hence representatives use whatever 
legislative influence they can exercise on behalf of those affected 
locally by pecuniary effects. 

Additionally, there are .several reasons to believe that pecuniary 
gains are exaggerated and pecuniary losses diminished in the repre¬ 
sentative’s political calculus. They relate to the concentration of 
pecuniary gains and the dispersion of pecuniary losses. First, in what 
might be termed the "Robert Moses ef fect” (named after that famous 
New Yorker who appreciated and exploited it so effectively), is the 
observation that pecuniary gains in the form of increased jobs, profits, 
and local tax revenues go to named individuals, firms, and localities 
from whom the legislator may claim credit and exact tribute. 
Pecuniary gains may be targeted to constituents; pecuniary losses, on 
the other hand, are often more widely dispersed, falling on con¬ 
stituents and nonconstituents alike. Second, pecuniary losses, princi¬ 
pally through higher prices in factor markets, are not always fully 
linked to the effects of increased factor demand from the project in 
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question. Indeed, the illusion may l)e such that pecuniary losers are 
unable to distinguish the source of their losses f rom general price 
inflation. Hence there is a perceptual asymmetry between pecuniary 
gains and losses. Accompanying this asymmetry in perception is an 
asymmetry in capacity to convert perceptions of gains and losses into 
political influence. Third, then, as Peluman (1976) has noted in 
another context, gainers typically are smaller in number, more cohe¬ 
sive in political interest, and, consequently, better organized jjoliti- 
cally. They are capable of rewarding the local legislator for delivering 
the bacon in a fashion in which pecuniary losers are unable to punish. 
The combined impact of the Moses, the illusion, and the Peltzman 
effects is an exaggerated political importance accorded local 
pecuniary gainers. The local legislator, then, is strongly encouraged 
to generate projects with large c,-type components (vs. Cj-type) and 
tends to be less concerned with associated pecuniary losses (vs. 
pecuniary gains). 

Since, on the arguments above, there is political value in securing 
local expenditures for their own sake, the representative, in assessing 
the project Pj(x), incorporates, on the benefit side of his political 
calculus, both the consumption benefits his constituents obtain, b(x), 
and the politically distorted pecuniaiy effects. Since the latter depend 
on local expenditures (c,fx]), we write it as/(c,(x)]. For the pro|>osi- 
tions below, we stipulate7[c,(.v)} = C|(x) to enable the clear presenta¬ 
tion of results unencumbered by mathematical detail; our results are 
qualitatively unaffected so long as /' and /' are positive.’ Thus, the 
representative of the single political constituency j seeks a piojec t scale 
that maximizes his political maximand which, in turn, depends on his 
constituency’s benefits minus its share of the ctrsts: [h(x) + C|(x)J - 
[qr(x) -f- c,(x)]. 


Model P: A Single Politiral Comtituency 

Consider the case of a single national constituency. Then b = 1 and 
expenditures earmarked for the constituency now consist ol all ex- 

' A slraiKhtforward comparative suaics analy>.,s, which we do not pm.sne here, would 
examine how alternative s|xxihcation.s of / aticet optimal choices. Such an analysis 
would conhrm that as long as/ is positive, the qualiutive nature ol our '7"'^ 
noted, tt IS not too misleading to ec,uate /(r,(xll with r,(x) or some In ea. lum t.on 
thereof namelv /[c.(*)I = "> 'fie relevant range. But only up to a pomt. n 

expenditures yield not only consumption benefits but P dollars per •*’ 
addiuonal ■'Jnefit." then, subject only to n 

moLvaied to expend the entire GNP, clearly an absurdity. The * 

meant to represent the political (read; electoral) advantage ' 

who delivers ed*) dollars of public expc.^.ture 
must tail off so that, in a general mathematical analysis./f d )] 
diminishing returns. 
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penditures, that is, fi(3c) + C 2 (x). With these assumptions and the 
identity in (1), the political objective function reduces to 

inaxP(x) = [A(x) + cdjc) + c^ix)] -[c^x) + c^ix) + Cslx)] 

(4) 

= 6(x) - Ci(x). 

The first-order condition is 


y - t;'. = 0. (5) 

and tfie second-order condition requires that 

y - cj < 0. 

'I'he solution to equation (5) yields point x'’ on (igure 1. The following 
theorem shows that x'' > x'"'. 
riiKORF.M 1: v'’ > x''-. 

Prooi: From assumptions I and 2, x''" is the unique global 
maximum of £(x) = /»(x) — C|(x) — C 2 (x) — C;,(x). Consequently, h'{x''‘) 
— rKx'' ) - Cafx'') — oKx'’-) = 0. From assumption 2, cl(x'' ) + c^ix''') > 
0; therefore, P\x'') = h'(x'') ~ Cs(x*) > 0. But this violates the 
first-order condition fora maximum of£(x). Indeed, from the prop¬ 
erties of strictly concave functions, since P'(x^-) > 0, then either (i) 
P‘{x) — 0 for X = x^ > x^ , or (ii) P'(x) = 0 for no hnite x. In the first 
( ase,x'’ is a global interior maximum which exceedsIn the second 
case, there is no interior maximum since P(x) increases without 
bound. A foi tioii, a piojcct stale larger ihanx'^^ is preferred. Q.F,.D. 


Model N: Non K.sl e PlunbtLS Unum, or Every District for Itself 

The next stage in oui' examination of political mechanisms is to 
partition the single national constituency into multiple, disjoint politi¬ 
cal units called “districts” with representation in a legislature. Each 
district, through its representative, is presumed to maximize its net 
(private) benefits without regard to the costs imposed on other dis¬ 
tricts. Publicly supported projecLs are funded through taxes which fall 
primarily on other districts. Hence, the benefits are concentrated 
while the costs are diffused. 

Any political choice institution consisting of representatives of mul¬ 
tiple, disjoint constituencies is characterized by a principle of 
aggregation—that is, a voting or decision rule—and by the substan¬ 
tive choices made under that decision rule. There are several litera¬ 
tures which address these issues.* The theoretical thrust of these 

" These include research on logiolling, constitutional choice, and distributive 
policymaking. There is a large literature on each of these topics, but Buchanan and 
Tullock (1962) reiiiain.s the best introduction and overview. 
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literatures is twofold; (1) Why is one institutional decision rule em¬ 
ployed rather than another? and (2) What are the policy consequences 
of a given decision rule? Regarding the first, with special emphasis on 
majority rule, theorists have focused on the instability of decisive 
coalitions, the ex ante uncertainty surrounding the composition of 
winning coalitions, and the cost of organizing and monitoring coali¬ 
tion partners. Their express purpose is to address how institutional 
actors seek, through suitably arranged institutional practices, to im¬ 
prove their circumstances by evolving coping strategies, formal rules, 
and other mechanisms (called norms in the sociological literature, 
rules or binding commitments in the game theory literature, and 
contracts in the property rights literature). The second question takes 
institutional practices as fixed and examines the policy decisions im¬ 
plied by those practices. The historical origins of and rationales for 
institutional rules are of little concern in these latter analyses. 

Our chief concern is the second question in which we assess the 
project choices of a legislature in the distributive policy area. In those 
policy areas characterized by a project-by-project orientation, the 
geographic concentration of benefits, and the diffusion of costs, there 
is abundant evidence that universalisin and reciprocity are prevailing 
derision rules in the U.S. Congre.ss. The foimer practice assures any 
interested distiict a project: the latter, in recognition of the fact that 
district differences translate into different policy priorities, facilitates 
a process of mutual support and logrolling. These two practices 
comi)ino to permit packages of distinct projects earmarked for in¬ 
terested districts to obtain the support even of those without a stake in 
the package in exchange lor reciprocal treatment. Empirical studies, 
moreover, repeatedly observe the operation of universalistic criteria. 
Exam|jle.s inchide the pork bai rel of rivers and harbors (Maass 1951; 
Ferejohn 1974); model cities and urban renewal (Plott 1968); tax 
loopholes (Manley 1970): the traditional tariff (Srhattschneider 
193.5); private member bills (Fioman 1967); military procurement 
(Rundquist 1973); and categorical grants-in-aid (Mayhew 1974). In¬ 
deed, some scholars observe a tendency to infuse controversial 
policies with distiibutive elements in order to build a more inclusive 

coalition.'* . , . • 

Elsewhere (Fiorina 1978; Wdngast 1979; Shepsle and Wemgast 

1981) the conditions under which institutional actors preler univer- 
salislic criteria to pure majority rule are derived. These to¬ 

gether with the preponderance of empirical evidence cited above, 
provide the basis for our focus on universalism and its policy conse¬ 
quences. First we explore the con.sequences of universalism in the 


"See Stockman (1975) and Fiorina (1978) lor illusiiatums. 
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extreme in which each legislator independently determines whether 
to propose a project and, if so, at what scale. The following section 
then examines the conditions under which there are incentives for 
legislators to coordinate project selection. Consider legislator j’s 
maximand regarding his district’s project: 

Nj(x) = b{x) + c,(x) - t}T(x) - c:,(x). (6) 

I'he legislator is presumed to maximize the district’s private benefits, 
b{x) + Ci(*). minus its share of the taxes, tjT(x) = (j[c,(x) + C 2 {x)], 
minus the externalities of the project which fall on the district, C 3 (x). 
This simply reflects the proposition that legislators are reelection 
oriented and that their prospects are positively associated with the net 
benefits they deliver to their constituents. According to this reelection 
mechanism, voter decisions correspond to a “what have you done for 
me lately?” evaluation. 

Maximization of (6) yields the first-order condition 
/>' 4- d - tjT' - d = 0, 


or 

b' + c't = t,(c[ + c-i) + ci (7) 

1 he second-order condition requires 

b" -t- d - 4- d) - 4 < 0. 

Equation (7) has a .solution yielding a maximum point x'’' for (6) 
dcpitted in figure I. The next theorem establishes that, in a wide set 
of circumstances, x '^ exceeds x''. 

I’liEOKEM 2: If Cl' > tj(c,' 4- cj), then x'^ > x'‘. 

I’rooi; Note that N{x) defined in (6) may be recast as 

N{x) = P{x) 4- c,(x) - h[c,(x) 4- Czlx)], 

where P{x) = b{x) - c,i(x). At the maximum of P(x), x — x^, 

P'(x'‘) = A'fx'') - d(x'’) = 0. 

1 his implies that at x x'" 

N'(x'') = P'(x'’) + c,'(x'') - lj[c[(x*’) 4- C 2 (x'’)J 

= c,'(x'’) - fifc.'fx'') 4- Czfx'')]. 

Therefore, the premise of the theorem implies thatiV'(x) > 0 for all x 

tq. (6) does not contain district;’* total benefits and costs, only those falling within 
Its control fhcorein 4 below establishes the innocuousness of this omission. 

'' This iiKxlel of voting is known as retrospective voting in contrast to the prospective 
voting model, initially (lopularized by Dtiwns (1957), in which voters respond to prom¬ 
ises for future policy. 
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Consequently, ifA^(x) possesses a maximum—call itx*^—thenx'^ 
> x'’. Moreover, if N{x) does not possess a maximum, then our 
theorem obtains a fortiori since N(x) then increases without bound. 
Q.E.D. 

The condition in the premise of theorem 2, cl > /^(cj + ci), can be 
presumed to hold in most circumstances. This condition requires that, 
in district-initiated projects, locally earmarked expenditures grow 
more rapidly with project scale than local taxes. Since tj is ordinarily 
quite small, only a modicum of imagination by legislators is required 
to find projects with this local expenditure characteristic. For exam¬ 
ple, if taxes are shared evenly by districts, then 1) = 1 /n, where n is the 
number of districts, and the condition becomes 

Cl > —(cj + Cj). 

n 

That is, so long as district-targeted expenditures grow with project 
scale at a rate at least I/nth as fast as total exfienditures, the premise 
(and hence the conclusion) of theorem 2 holds; in even moderate¬ 
sized legislatures this is a fairly weak condition. 

More generally, suppose t) = tj(z) where z might be any characteris¬ 
tic (examples of which are given below). Then the following com¬ 
parative statics result may be established. 

Theorem 3: Let x"' be the optimum for ( 6 ). d'hen dx^ldz > 0 if 
and only if t', < 0 . 

Proof; Rewriting ( 6 ), we have 

N(x) - b(x) + [1 - (j(z))c,(x) ~ (j(z)cz(x) - C 3 (x), 
and first-order condition 

JV'(x) = d'(x) -t- [1 - b(z)]c,'(x) - <j(z)c 2 (x) - C 3 (x) = 0. 

The second-order requirement is 

iV"(x) = 6 "(x) + [1 - tj(z)]c”(x) - tj(z)c 2 (x) - csjx) < 0. 

Totally differentiating the first-order condition yields 

{(»"(x) + [1 - /j(z)]c"(x) - tj(z)c 2 (x) - C;"(x)}dx 
= [cK-*) + ci(x)]i;(z)dz. 


Thus, 

^ ^_ [C|'(x) -h ri(x)] _ , , 

dz {b"(x) + [1 - fi(z)]c','(x) - Ij(z)f 2 (x) - cj(x)} ■' 

The numerator of the coefficient on the right-hand side is positive 
(from assumption 2) and the denominator is negative (from the 
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second-order condition). The coefficient, therefore, is negative, es¬ 
tablishing that the sign of dxfdz is opposite that of fj(z): 

/ j ( 2)<0 —^> 0 , 

t'(z) > 0-^-^ < 0. 
az 

Q.E.D. 

CoRoi.i.ARY (The Law of 1/n): If district tax share is a declining 
function of the number of districts (w), then the degree of inefficiency 
in project scale (x"* — x^-) is an increasing function of the number of 
districts. 

Proof; l.et/j = tj(n) with tj{n) < 0. Then, from theorem S,dx‘^/dn > 
0. Q.K.D. 

The corollary indicates that when taxes are apportioned as a de¬ 
creasing function of the niimber of political units—for example, tj{n) 
= 1/w for all^—then the optimum project .scale for any disti'ict grows 
as the polity is more finely partitioned into districts.Theorem 3, 
however, is more general, for it applies to tax mechanisms that may be 
the funcdon of any politically relevant characteristic. H a district’s tax 
share is a decreasing function of certain of its legislator’s institutional 
characteristics (membership or influence on tax-writing committee), 
political char acteristics of its representative (is he a committee chair¬ 
man? is he associated with the majority party?), or economic or demo¬ 
graphic characteristics of the district (pioportion of families below 
poverty level, proportion of population above age 65), then we can 
associate increasingly inefficient projects with particular kinds of dis¬ 
tricts as defined by these lax-relevant characteristics. In all these cases, 
theorem 3 indicates that the equilibrium scale of a district’s project 
(and, given the assumptions, its degree of inefficiency) changes with 
respect to some tax cr iterion in precisely the opposite way the tax 
share changes with res[>ect to that criterion. 


Institutional Incentives to Restrict Project Size 

We have just shown that decentrali/ed choice by a representative 
legislature characterized by a univeisalism mechanism, a tax-sharing 


This analysis piesuiiu-s thal the only ihangc m /Vfx) lollowing a ihange in the 
niimher ol districrs is ihe lax rale, tj. It, however, ri(x), Cjfx), and fa(x) depend on the 
eonhguraiion of disincts, then two countervailing tendencies may lie observed. In- 
treasing die number of districts (1) tiansforms some portion of c,(x) into r,(x) and (2) 
decreases the tax sfiare of the district. Since the hrsl cflecr reduces and the second 
increases the optimal project scale, the net ellect is ambiguous without further 
specification. 
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rule, and a political objective function yields a vector of projects x'^ = 

(*7.). Theorems 1 and 2 provide the conditions in which xj' > 

Acf, t = I, .... n. However, x?', the project scale for district i that 
maximizes its legislator’s political objective function (6), is computed 
in isolation of computations by other legislators; therefore, it does not 
take the expenditure and tax externalities (positive and negative) 
generated by those other projects into account. 

In this section, we seek to determine whether or not there is an 
institutional basis for restraining unbridled universalism.'^ In par¬ 
ticular, we turn to an examination of packages of projects and seek to 
discover whether there exists a package of constitutionally restricted 
projects,x' = (x5,. . . with the property thatx*^ >ix''' (where >i is 
the preference order of district i over packages of projects). To 
accomplish this, consider the complete political maximancl of legis¬ 
lator;, B^(x.. x„), which rewrites (6) to incorporate the effects of 

projects in other districts. The net benefits to dislrictj consist of 6-type 
benefits f rom its own project, 6(Xj); c,-type expenditures from its own 
project, Ci(-*j); C 2 -type expenditures spent in district^ from other 
projects, 2(,.j Cadxj); nonexpenditure costs from its own project r.j(xj); 
and its tax share of the total expenditures, tj S"=i [ci(x,) -t C 2 (Xf)J. Thus 


fl'(x,, . . . , x„) = 


6(x,) + f,(Xj) + ^C 2 j(X() 




tl 

Ol(Xj) + X + Cjix, )] 


(S) 


Bef ore considering the possibility of restrictions on project size, we 
first characterize choice under the complete political maximand, 

B^{x .x„). C.allingx* legislator j’s solution to the maximization of 

(8), we have: 

Theorem 4; x” = x'. 

Prook; Ecpiation (8) may be rewritten 


= {6{Xj) + c,(Xj) - C.dXj) - f,lc,(x,) + f 2 (Vj)]} 

-t- {X X If'(’f') + '■■d’Cf)) . 

Wi-j (,-j 

which, from (6), becomes 


= (V,(xd + 


If*) iftj 


(9) 


( 10 ) 


In .Shepsic and Wciiiga.st (1981), wc dcscritie rcsulis that dcmonstralc tlie ex ante 
superiority of univcrsalism lo pure majority lule in ihe eyes of each legislaior. Here we 
ask whether a restricted form of uiuversalism, m turn, is superior lo pure uiiiiersalism. 
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Since the terms in braces are not functions of Xj, the first- and 
second-order conditions far are the same as those for thus, 

their respective maxima are the same. Q.E.D. 

Theorem 4 establishes that our initial focus on Nj (instead of B^) 
involved no loss of generality for questions about decentralized proj¬ 
ect choice. Moreover, theorem 4 establishes that is a Nash strategy 
for legislator ; and that the project vector, is a Nash equilibrium.'^ 

Turning to the role of cooperatively imposed restrictions on project 
size, we state a characterization theorem that establishes the circum¬ 
stances under which districts (or their representatives) prefer a re¬ 
stricted collection of projects, x*^, to the collection under unrestrained 

universalism, x''. For the vector of prtyectsx* = (x^.xJ), let£'j(x*) 

represent the tax and expenditure effects of other projects on district 
;. Specifically, 

Ej{x^) = ^ + r 2 (xf)] - 

In effect, Ej(x'') is the tax bill for districtj for all other projects net of 
local expenditures to district; from these other projects—that is, net 
negative external costs. 

Fheorkm 5: x' >, if and only if Nfix-"^) — A^j(x'’) < Ej(x^) 

- 

I heorem 5 conveys the following idea: Legislatorj has an interest 
in sLibsiitutingacollectionof projectsx' = (x{,... ,x^’) forx''' = {xf ,..., 
xrj;) it and only if his district’s reduction in (xilitical benefits from the 
reduced scale of its own project is compensated for by a concomitant 
reduction in its burden of net negative external costs. There are 
several ways in which this may fail to hold for a given district 7 . For 
example, (i) district j has a sufficiently small tax share, Ij; (ii) districtj 
is, disproportionately, a source of project inputs for other districts, in 
which case the terms are large; and (iii) the political benefits for 

district 7 with project scale x?, b{x'j) + c, (x?'), are inordinately large. In 
each of these cases, the condition may fail, implying the absence of a 
unanimous preference for x' over x'''. Moreover, there do not appear 
to be any interesting properties as.sociated with the f amily of f unctions 
hix), c(x), and <(x) that satisfy the condition in theorem 5; nor is there 
any ex ante basis for supposing that the condition in theorem .'i will be 
encountered in empirical .settings. 

Despite the lack of unanimous preference for restriction under all 


I’hib view interprets our model of unrestricted universahsm as a j^arne. The choice 
set lor player ; is the size of his pro|ect, xj, when project choice is governed by a 
universalism mechanism The Nash equilibrium follows from the separability of the 
positive and negative externahiics of other projects m (JO). As a result, each district 
makes its own decisions without attending to the externalities it produces or consumes. 
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circumstances, legislators have something to gain by properly ac¬ 
counting for the pecuniary externalities of project scale selection. The 
next theorem provides some insight into the optimal set of projects 
which internalizes these effects. 

Theorem 6; maximizes B\x), given in (8). 

Proof: 

n n 

L = ^ 6(xj) + c^(x,) +^C 2 j(xt) - c.,(xj) 

J»1 I 

(=•1 J 

n 

J-1 

n 

The first-order conditions are 

~~ = 6'(x,) - C 3 (xj) = 0, j = 1, . . . , n. 

Noting that (dy.)/(dx,dxj) = 0 for i ^ j, the second-order conditions 
are 


= b'\x,) - cS(xj) <0, ;■ = 1.n. 

These equations and inequalities are the same as (5), which identifies 
the vector x'' in ligure 1. Q.E.D. 

Theorem 6 shows that total political net benefits are maximized 
with the vector x'’ = (x{’, . . . ,xj). Potential gains may be captured if 
the system of universalistic project selection is amended so that, while 
each district is assured a project, the project scale is determined as if 
there were but a single district (as in model P above). Notice that 

theorem 6 does not assert that x'’ >j x^,j = 1. n (substituting x'' for 

x', theorem 5 shows the restricted circumstances in which this will 
hold). What theorem 6 does assert is that a compensation scheme 
which redistributes net Itenefits is feasible so thaix'’ together with this 
compensation is preferred by all districts not only to x' but to any 
other omnibus of projects. 

It is occtisionally asserted that the distributive politics game is a 
prisoner’s dilemma in the economic sense—that unrestrained univer- 
salism produces a project package that is an economically inefficient 
Nash equilibrium, on the one hand, and is unanimously regarded as 
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less preferable than (x'‘ >j x^,j = 1,.... re) on the other. Theorems 
1,2, and 4 establish the first part of this assertion. However, retaining 
the political conceptualization of net benefits given in (8), theorem 5 
shows that the second part of the assertion does not always follow. 
The collection x’’- is not always unanimously regarded as preferable to 
X'''. More importantly, theorem 6 shows that even when all represen¬ 
tatives favor restrictions, ihepolitically optimal set of projects isx'’, not 
x''-. 

This di.scussion underscores the basic point of the paper. The 
efficient collection, though normatively attractive in welfare analy¬ 
sis, is not always behaviorally relevant. The implication of the political 
maximand (8) and theorems 1, 2. 5, and 6 is that legislators hold no 
brief for efficiency, per sc, either with regard to their own project 
selection or a package of such projects. 

Extensions and Discussion 

4 he model developed in the previous .sections roots the ineificiency of 
distributive |X)litics in democratic mechanisms and especially in the 
geographic basis of political constituencies. This latter feature pro¬ 
duces two independent sources of bias. First, locally targeted expen¬ 
ditures are counted by the local constituency as benefits. Second, the 
districting mechanism in conjunction with the taxation system pro¬ 
vides incentives to increase project size beyond the efficient point by 
attenuating the relationship between beneficiaries and revenue 
sources. A cooperative legi.slature has no incentive to remove entirely 
these .sources of inefficiency (beyond that described in the discussion 
following theorem 6). 

In this section, we examine several related themes and applications. 
Each of these is either an extension of our model to domains beyond 
that of traditional distributive (>olicy or a specialization of our model 
to substantively relevant cases of distributive policy. 

Congressional Limitation on Project Size 

A well-known behavioral mechanism has operated in Omgress since 
the 1880s that restrains the attempts of legislators to fund their pet 
projecLs. At the authorization stage, a universalism mechanism is at 
work—the annual omnibus public works bill, for example, typically 
contains authorizations for projects in most congressional districts. 
Following the passage of authorizing legislation comes the separate 
stage of actually appropriating monies. Here, the Committee on Ap¬ 
propriations systematically scales down each project (for a description 
of this process, see Fenno I1966J and Ferejohn [1974]). 
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Our model sheds some light on this well-established congressional 
practice. First, suppose the premise of theorem 5 holds, where x'' = 

, Hence, x'' >jx^,j = 1.n. Second, assume iegislalorj and only 

legislator j knows Xj* and Xj'. Theorem 5 implies that legislators 
unanimously favor a mechanism reducing project sizes from the un¬ 
coordinated choice, x'^. But there does not appear to be a straightfor¬ 
ward “demand-revealing” mechanism inducing legislators to an¬ 
nounce anything other than their maximizing Nash strategy, x'J. 
Thus, in the face of imperfect information, the simpler mechanism of 
scaling down all projects—the current practice of the appropriations 
committee—may yield a vector of project sizes less than x^ and pre¬ 
ferred by all legislators.*'’ 


Examination of Interesting Subclasses 

We mention briefly, as an indication of the utility of this approach, 
some interesting special cases of the political maxirnand (6); 

N(xj) = b(Xj) -i- - Ci(Xj) - /,[f,(x>) + CalXj)]. 

The details may be provided by the reader. 

1, Benefit tax. —This tax scheme requires the district to pay the 
entire cost of a strictly local public project {t, - 1). The maxirnand 
becomes 

N,{x,) = b{x,) - CafXj) - c,{x;), 

and assumptions 1 and 2 imply a project scale x”; with the property Xj 
^ xJ** ^ xj' < xj . Note that ifcj(Xj) = 0, thenx/ = xj' whereas, it “ 
0, then x*j = xj . This last fait suggests that a benelit tax in t onjuuttiou 
with no local expenditures is a sufficient condition tor public sector 
efficiency. 

2. Free-ride lax. — In this ca.se, tj = 0 and the ma.ximand is 

N(xj) = b{xfi + f,(Xj) - f,(x,), 

which implies a scale x'j > xj*. This case approximates local public 
goods, financed by user fees, where the Icx'al residents rarely number 
among the users. For example, if Yellowstone National Park were 
financed by user fees assessed states in proportion to their respective 

A complete analysis must resolve two issues. Fiim. as a consequence ol the waling 
down practice by ihc Appropnaiiom (auiuniuee, will the legislator stralegirally seek a 
project wale in excess ol .so that, when it is waled down, it will eventually reach si/e 
xJ? Altetnatively, are there sanction-v discouraging artihcial inllaiioii o! project .scale at 
the amhunzing stage? Second, are there sanctions available to be applied against the 
Appropriations Committee to ensure it does not scale down tcx> much? Sec Fenno 
(19^) for some empirical details. Fhese issues concerning demand revelation metha- 
nisms are theoretically intriguing but take us too far afteld u> lie dealt with here. 
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share of the population using the park, then Wyoming residents 
receive a cheap, if' not free, ride on taxes. 

3. Foreign aid versus military assistance. —We can interpret the politi¬ 
cal popularity of' the latter and unpopularity of the former via the 
following calculation. Let x'' and x*' describe levels of the two forms of 
aid; assume i(x^ ) = h{x^j), ceteris paribus, for any political constit¬ 
uency (assumed, in any event, to be small unless the constituency 
contains partisans of or emigrants from the benefiting country); for 
some constituencies (producers of military hardware), c,(x^) > 0 ; for 
all constituencies, on the other hand, c,(x j 0 ; for all constituencies, 
cMl) = cAx'',) = 0. Thus, 

A/(x-';') = 6(xY) + (1 - ti)c,(x^) - <jC 2 (x^), 

jV(xi ) = />(.vi ) - IjC 2 (xI’). 

Assumptions 1 and 2 imply (x")* > (xj )*. an observation consistent 
with their respective popularities. It also makes sense of the recent 
policy innovation of attaching strings to foreign aid requiring recip¬ 
ients to make purchases in the United States (Ci-type expenditures). 

4. Ruhe Goldberg machines and military bases. —Suppose b(Xj) = 0 for 
a ptoject iti a given constittiency, j. Then 

M(x,) = (I - (j)c,(xj) - t,C 2 (Xj) - C 3 (Xj). 

Even though b(Xj) - 0, the (politically) optimal project scale may be 
greater than zero. Specifically, 

x'j > 0 if A"(0) > 0, or (1 - tj)c,'(0) - tjC 2 ( 0 ) - f.j(O) > 0. 

Consider the case of many military bases (and other Rube (ioldberg 
machines) which, by the Department of Defense’s own admission, 
provide virtually no contribution to defense (filx] = 0). These never¬ 
theless remain attractive to local constituencies (hence Xj > 0 ) because 
of the ovet riding importance of these projects to the local economy in 
the form of c,-type benefits. Throughout this paper, we have em¬ 
phasized the political inappropriateness of economic net benefits as a 
relevant decision criterion. In this special case, it appears that even the 
absence of economic benefits altogether is not a di.squalifying charac¬ 
teristic in political choice. 


Generalization to Non-Pork Barrel Policies'^ 

One of the central features of our models is the unpacking of costs in 
politically relevant ways in which we distinguish project costs returned 

These themes are develojted in more detail in Shepsle and Weingast (1980). 
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to the district as expenditures, costs returned to other districts, and 
nonexpenditure costs borne within the district. We focused, however, 
on distributive or pork barrel projects, defined as projects whose 
benefits are geographically concentrated and whose costs are spread 
through general taxation. Two extensions offer further insight intcj 
nondistributive policies. The first distinguishes an additional cost, 
c^fx), or external, nonexpenditure costs borne by other districts. The 
second defines another source of economic benefits, fcjfx), that ac¬ 
crues to other constituencies as a consequence of a project in a par¬ 
ticular district. 

1. Additional external costs .—If additional external costs of the jlh 
project spill over into other districts, it can easily be shown that the 
degree of inefficiency increases when governed by decentralized 
project choice under the political maximand, N{x). Since the model 
now divides externalities into their politically relevant components, 
Cjfx) and c<(x), we can make further observations about the degree to 
which the political system can be relied upon to internalize exter¬ 
nalities associated with public activities. If the externalities are not too 
large (in the sense that they are local and do not extend into 
neighboring districts), the public sector action governed by a repre¬ 
sentative legislature internalizes them. However, if they are large, 
public sector action may not. 

The politically relevant (though economically arbitrary) distinction 
implied by district boundaries suggests that the public sector can only 
provide certain categories of public goods which are not available 
through private market arrangements. Hence, jurisdictional ques¬ 
tions become of paramount importance when producing a local pub¬ 
lic good like flood walls along a river which divides two political 
jurisdictions. Some of the most infamous cases of ptork barrel politics 
illustrate this point; The Hood walls along the lower part of the 
Mississippi River, which divides Louisiana from Mississippi, are 3 feet 
higher on the Mississippi side (see Ferejohn 1974, pp. 56-58). Simi¬ 
larly, the levees on the Indiana side of the Wabash River are higher 
than those on the Illinois side. Thus, a universalistic representative 
legislature is biased toward projects with low c^-type costs while failing 
to consider C 4 -type costs, 

2. Additional external benefits. —Leti(xj) = fri(x;) + b^lxj) where frd.Xj) 
are the benefits of the^ th project concentrated in district j and bi{Xj) 
are the benefits consumed by residents of other districts (presumed 
zero throughout the body of the paper). That is, b 2 {xj) is the positive 
consumption externalities (as compared with Cjlxj], which are the 
positive production externalities). 

Because the benefits outside the district are not readily internalized 
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under a distributive policy mechanism, large multidistrict (multistate) 
projects are likely to be rare relative to projects with concentrated 
local benehts. Consequently, multidistrict public goods (in the eco¬ 
nomic sense) are not only likely to be underproduced by a market 
mechanism but by a representative legislature as well (see Aranson 
and Ordeshook 1978). A universalistic representative legislature is 
biased toward projects with high A,-type benehts, while failing to 
internalize l> 2 -type benehts. Thus, both positive and negative exter¬ 
nalities adversely affect public as well as private provision. 


Conclusion 

Throughout this paper we have focused on the sources of inefhciency 
in public decision making. Our model demonstrates that democratic 
institutions play an important role. Three mechanisms were shown to 
inHuence the politically optimal project choice: the political cost¬ 
accounting mechanism, the districting mechanism, and the taxation 
mechanism. These features of the political economy systematically 
transform the economic beneHts and costs into political counterparts. 
Since it is the latter that determine the maximands for political actors 
and not their economic counterparts, these govern political choice. 

While our modeling of the districting and taxation mechanisms is 
straightforward and uncontroversial, there are circumstances in 
which our treatment of the incidence of gains and losses from local 
expenditures is implausible. We have presumed that pecuniary gain¬ 
ers figure more prominently than pecuniary losers in a legislator’s 
reelection constituency. However, under some circumstances this may 
not be true. A legislator's reelection constituency, for example, may be 
dominated not by factor owners of a public project but by those who 
would bear the brunt of the pecuniary losses and the nonpecuniary 
external costs of the project. We would not, therefore, expect the 
legislator to seek such projects. Indeed, since the menu of distributive 
programs is sufficiently diverse, there normally is something available 
for everyone. Thus we tend to find reclamation projects in the West, 
locks and dams in river districts with an active construction industry, 
and wildlife refuges in Sierra Club districts. Because of this diversity 
in policy preferences and program categories, the logic supporting 
the political distortion of pecuniary incidences continues to hold. 

Our principal conclusion is that since political institutions funda¬ 
mentally alter the perceptions and incidence of benehts and costs, 
they systematically bias project choices away from the efficient out¬ 
comes. In the context of distributive politics, this was shown to imply 
larger projects and programs than are economically warranted. 
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under a distributive policy mechanism, large multidistrict (mullistate) 
projects are likely to be rare relative to projects with concentrated 
local benefits. Consequently, multidistrict public goods (in the eco 
noinic sense) are not only likely to be underproduced by a market 
mechanism but by a represenutivc legislature as well (see Aranson 
and Ordeshook 1978). A universaJistic representative legislature is 
biased toward projects with high ^i-type benefits, while failing to 
internalize ^ 2 -type benefits. Thus, both positive and negative exter¬ 
nalities adversely affect public as well as private provision. 


Conclusion 

Throughout this papei we have focused on the sources of inefficiency 
in public decision making. Our model demonstrates that democratic 
institutions play an important role. Three mechanisms were shown to 
influence the politically optimal project choice: the political cost- 
accounting mechanism, the districting mechanism, and the taxation 
mechanism. These features of the political economy systematically 
transform the economic benefits and costs into political counterparts. 
Since it is the latter that determine the maximands for political actors 
and not their economic counterparts, these govern political choice. 

While our modeling of the districting and taxation mechanisms is 
straightforward and uncontroversial, there are circumstances in 
which our treatment of the incidence of gains and losses from local 
expenditures is implausible. We have presumed that pecuniary gain¬ 
ers figure more prominently than pecuniary losers in a legislator's 
reelection constituency. However, under some circumstances this may 
not be true. A legislator’s reelection constituency, for example, may be 
dominated not by factor owners of a public project but by those who 
would bear the brunt of the pecuniary losses and the nonpecuniary 
external costs of the project. We would not, therefore, expect the 
legislator to seek such projects. Indeed, since the menu of distributive 
programs is sufficiently diverse, there normally is something available 
for everyone. Thus we tend to find reclamation projects in the West, 
locks and dams in river districts with an active construction industry, 
and wildlife refuges in Sierra Club districts. Because of this diversity 
in policy preferences and program categories, the logic supporting 
the political distortion of pecuniary incidences continues to hold. 

Our principal conclusion is that since political institutions funda¬ 
mentally alter the perceptions and incidence of benefits and costs, 
they systematically bias project choices away from the efficient out¬ 
comes. In the context of distributive politics, this was shown to imply 
larger projects and programs than are economically warranted. 
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Flexible Exchange Rates, Prices, and the Role of 
"News": Lessons from the 1970s 


Jacob A. Frenkel 

University of Chicago and National Bureau oj Economic Research 


This paper analyzes the key issues and lessons Irom the experience 
with flexible exchange rates during the 1970s. It analyzes the efli- 
ciency of the foreign-exchange market and the volatility ol exchange 
rates, as well as the relationships between exchange rates and interest 
rates. A key distinction is made between anticipated and unantici¬ 
pated events, and it is shown that the key factor affecting exchange 
rates has been "news.” The analysis then proceeds to analyze the 
relationship between exchange rates and prices. The deviations from 
purchasing power parities are being interpreted in terms of the 
modern asset-market approach to the exchange rate. 


I. Introduction 

Recent experience with flexible exchange-rate systems has led to 
renewed interest in the operation of foreign-exchange markets as 
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leHected in many recent studies of the principal determinants of 
exchange rates. The 1970s witnessed the dramatic alteration of the 
international monetary system from a regime of pegged exchange 
rates which prevailed for about a quarter of a century (since the 
Bretton Woods Conference) into a regime of flexible (though man¬ 
aged) rates. As a consequence of the emergence of the new legal and 
economic system, traders, national governments, and international 
organizations were confronted with new economic problems, choices, 
and instruments. During the 1970s exchange rates fluctuated widely 
and inflation rates accelerated. The international monetary system 
had to accommodate extraordinarily large oil-related shocks which 
af fected trade flows in goods and assets. Huge oil payments had to be 
recycled. Uncertainties concerning future developments in interna¬ 
tional politics reached new heights, and the prospects for the world 
economy got gloomier. These developments placed unprecedented 
pressures on the markets for foreign exchange as well as on other 
asset markets. They were associated with a large slide in the value of 
the U.S. dollar and resulted in speeding up the creation of new 
institutions like the Kuropean Monetary System which provided the 
formal framework for the management of exchange rates among 
members. The increased interdependence among countries and the 
realization that exchange-rate policies by one national government 
exert influence on other economies have also induced legal responses 
from international organizations. For example, in late April 1977, the 
executive board of the International Monetary Fund approved the 
details of the second amendment to article 1V of the amended Articles 
of Agreement dealing with the principles and firotedures lor surveil¬ 
lance of exchange-rate policies of member countries. 

These developments provide the background for this paper which 
is intended to sum up the relevant evidence bearing on a set of related 
questions and to present a brief survey of key issues and lesstzns from 
the experience with floating rates during the 1970s. The main orien¬ 
tation of the paper is empirical and the analysis is based on the 
experience of three exchange rates involving the dollar/pound, the 
dollar/French franc and the dollar/deutsche mark (DM). Section II 
provides an analysis of the efficiency of foreign-exchange markets by 
examining the relationship between spot and forward exchange rates. 
The extent of exchange-rate volatility is also examined. T his analysis 
of the foreign-exchange markets sheds light on several questions 
including (i) whether exchange rates fluctuate “excessively”; (ii) 
whether speculation in the foreign-exchange markets is destabilizing; 
(iii) whether there is “insufficient” speculation in the foreign- 
exchange markets; and (iv) whether there is evidence for market 
failure in the sense that there are unexploited profit opportunities. 
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These issues are significant in assessing the performance of floating 
rates as well as in evaluating the need for government intervention in 
the foreign-exchange markets. The analytical framework that is used 
for interpreting the volatility of exchange rales and the association 
between spot and forward rates is the modern theory of exchange- 
rale determination. Within this framework, exchange rates are 
viewed as the prices of assets that are traded in organized markets 
and, like the prices of other assets, are strongly influenced by expec¬ 
tations about future events. 

The relationship between exchange rates and interest rates is 
analyzed in Section Ill. One of the key issues that is raised in this 
section is the distinction between anticipated and unanticipated 
changes in rates of interest. As an analytical matter this distinction is 
important, because the modern approach to exchange-rate determi¬ 
nation implies that exchange rates are strongly influenced by “news,” 
which by definition is unpredictable. Therefore, it is unanticipated 
rather than anticipated changes in interest rates that should be closely 
associated with changes in exchange rates. This prediction is tested 
empirically. 

Section IV analyzes the relationship between exchange rates and 
prices by examining the patterns of deviations from purchasing 
power parities. The main point that is being emphasized is that there 
is an important intrinsic difference between exchange rates and na¬ 
tional price levels. Exchange rates are more sensitive to expectations 
concerning future events than national price levels. As a result, in 
periods which are dominated by “news” which alters expectations, 
exchange rates are likely to be more volatile, and departures from 
purchasing power parities are likely to be the rule rather than the 
exception. The analysis of the relationship between exchange rates 
and prices is relevant for assessing whether the flexible exchange-rate 
system was successful in providing national economies with an added 
degree of insulation from foreign shocks and whether it provided 
policy makers with an added instrument for the conduct of mac¬ 
roeconomic policy. The evidence regarding deviations from pur¬ 
chasing power parities is also relevant for determining whether there 
is a case for managed float. Section V contains concluding remarks. 

II. The Efficiency of the Foreign-Exchange Market and the 
Movement of Exchange Rates 

In this section I analyze the principal characteristics of the relation¬ 
ship between spot and forward exchange rates which seem to emerge 
from the experience of the 1970s. Following an analysis of the effi¬ 
ciency of the foreign-exchange market, 1 discuss the more general 
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issues underlying the relationships between spot and forward rates 
and their volatility. 


The EJjiciency of the Foreign-Exchange Market 

One of the central insights of the monetary (or the asset-market) 
approach to the exchange rate is the notion that the exchange rate, 
being a relative price of two assets, is determined in a manner similar 
to the determination of other asset prices and that expectations con¬ 
cerning the future course of events play a central role in affecting 
current exchange rates.' 

If the foreign-exchange market is efficient and if the exchange rate 
is determined in a fashion similar to the determination of other asset 
prices, we should expect current prices to reflect all currently avail¬ 
able information. Expectations concerning future exchange rates 
should be incorporated and reflected in forward exchange rates. To 
examine the efficiency of the market, I first regress the logarithm of 
the current spot exchange rate, InS,, on the logarithm of the 1-month 
forward exchange rate prevailing at the previous month, In F(_i, as in 
equation (1):* 

In S, = a + b \n F,_| + «,. (1) 

If the market for foreign exchange is efficient, so that prices reflect 
all r elevant available information, then the residuals in equation (1), 
should contain no information and therefore should be serially 
uruorrelated. Further, if the forward exchange rate is an unbiased 
forecast of the future spot exchange rate (as should be the rase under 
an assumption of risk neutrality), then the constant term in equation 
(1) should not differ significantly from zero,* and the .slope coefficient 
should not differ significantly from unity. I examine three exchange 
rates: the dollar/pound, the dotlar/franc, and the dollar/DM. Equa- 


' For roller (ions of articles .suriiiiMiizing thi.s appioiich. sec ihc Scandinavian Journal of 
Erunnimc\, no. 2 (1976) aiirl Fienkcl and Johnson (1978). 

“ For an application of the same methodology in analyzing the cfhcienty properties 
of the foreign-exchange market during the German hyperinflation of 1921-2.8 .see 
Fienkel (1976, 1977, 1979). For an application to othei exchange rates during the 
1920s see Fienkel and Clements (1981); lor an application to the 1920s and the 1970s 
see Krugrnan (1977); fot an interesting analysrs u.sing time-serie.s and cro.s.s-.sectional 
data see Bitson (in prcs.s); lor an analysis ot market clficieniy using novel econometric 
technique.s see Hakkio (1979) and Hansen and Hodrick (1980), and for surveys sec 
Levich (1978, 1979). 

■’ Mol e pi erisely, if (assuming risk neutrality) the forward rate measures the expected 
value of the luture spot rate, then the constant term in the logarithmic ecjualion (1) 
should lie sec Frenkel 1979. The statement that under risk neutrality the 

forward rate etjuals the expected future spot rate neglects the effects of the stochastic 
elements m prices. As an empirical matter this neglect does not seem to be consequen¬ 
tial; sec Frenkel and Razin 1980. 
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tion (1) was estimated using monthly data for the period June 1973- 
July 1979 (data sources are listed in the Appendix). The beginning of 
the period was set so as to concentrate on the experience of the 
current exchange-rate regime (following the initial post-Bretton 
Woods transition period). The resulting ordinary-least-squares (Ol.S) 
estimates are reported in table 1. Also reported in table 1 are addi¬ 
tional regressions which will be analyzed shortly. 

As may be seen for the dollar/DM exchange rate, the hypotheses 
that (at the 95 percent confidence level) the constant term does not 
differ significantly from zero and that the slope coefficient does not 
differ significantly from unity cannot be rejected. These hypotheses 
are rejected for the dollar/franc exchange rate and are rejected (mar¬ 
ginally) for the dollar/pound exchange rate. The joint hypotheses, 
however, that the constant is zero and that the slope coefficient is 
unity cannot be rejected at the 95 percent level for the dollar/pound 
and the dollar/DM exchange rates and at the 99 percent level for the 
dollar/franc exchange rate. The test statistics for testing the joint 
hypotheses are reported in the column headed by F in table 1. These 
results are relevant for assessing whether the forward rate is an 
unbiased forecast of the future spot rate. We turn next to the question 
of efficiency. 

It was argued above that in an efficient market, expectations con¬ 
cerning future exchange rates are reflected in forward rates and that 
spot exchange rates reflect all currently available information. If 
forward exchange rates prevailing at period t — 1 summarize all 
relevant information available at that period, they should also contain 
the information that is summarized in data corresponding to period 
f — 2. It thus follows that including additional lagged values of the 
forward rates in equation (1) should not greatly affect the coefficients 
of determination and should not yield coefficients that differ 
significantly from zero. The results reported in table 1 are consistent 
with this hypothesis; in all cases the coefficients of In F,^., do not differ 
significantly from zero, and the inclusion of the additional lagged 
variables does not improve the fit. Most important, in all cases the 
Durbin-Watson statistics arc consi.stent with the hypothesis of the 
absence of first-order auux'orrelated residuals, and an examination of 
higher-order correlations (up to 12 lags) shows that no correlation of 
any order is significant.^ 

To examine further the relationship l>etween the various exchange 
rates, we note that one of the assumptions underlying equation (1) 

* Since In F,_, is highly coi rclaied wi«h In .S,^, the Durbin-Waison stausiir may not be 
appropriate since equaiioii (1) is very similar to a regression of In S, on its own lagged 
value. Durbin’s A-statistic leveals, however, that the lesiduals are scnally untorrelaied 
at conventional confidence levels. 
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Nort.—SE IS ih< standard error of the equation and R* is (he coefhaem of deterrrunauon. in the case of instrumental variables esumauon, the /?* was computed as 1 - var (ii,)/var (In S,). The 
/-siaiistK: tests the joint resincuon that the consum equals zero, and the slope equals uiutv. The test suusuc is distributed 35^(2,71) Criucal values forf t2,7l) are 3 IS (95%) and 4.92 (99%). The 
instrumental variable ^IVAR) estimauun method is used m order to allots ior the possibilii> of errors m variables ansing from using In as a proxy for the expected future spot rate, 
the instruments are a constant, umc time squared, and lagged values of the dependent and the mdependcni vanabJes. The w-stansuc which tests for the absence of errors in variables is dutribuied 
X* with 2 degrees of freedom The cnucal value for x*(2) is 5 99 (95%) 
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was that the forward exchange rate measures the unobservable value 
of the expected future spot exchange rate. This assumption provided 
the justification for using equation (I) instead of the explicit specifica¬ 
tion of the rational expectations hypothesis that is embodied in 
equation (2): 

In S, = In S, + e,, (2) 

where In S, denotes the expected (logarithm of the) spot ex¬ 
change rate for period I based on the information available at period 
t — 1. If, however, the forward exchange rate at / — 1 is a “noisy” proxy 
for the expected future value of the spot rate (i.e., it measures it with a 
random error), then we would obtain 

a + h In F,_| = In .S', -I- v,-,; E(v,) = 0, (3) 

and substituting cfiuation (3) into equation (2) yields: 

In .S, = a + b In T,_, + (c, — v,..,). (4) 

In this case the error term in equation (1) would be u, = e, — c,-,; the 
assumption that the covariance between In F,-, and u, is zero would 
entail a specificatitni ei rot , and the application of the OLS procedure 
would yield biased estimates due to the dassical errors in variables 
bias. 

In order to examine the possibility that the OLS estimates might be 
subject to the ci tot s in variables bias, one needs to test the hypothesis 
that cov (u,. In F,-i) = 0. This test follows the specification test outlined 
by Hausinan (1978).'' To perfoim the test, etjuatiiiii (1) was estimated 
by applying the OLS prtKedure as well as by using an instrumenuil 
variables (I VAR) estimation method. Under the null hypothesis of no 
misspecificalion the (4L.S coelficients vector ha is an efficient and 
unbiased estimate of the li ue coefficient vector. Under the alternative 
hypothesis of misspecification, the vector (>„ is biased and an unbiased 
coefficient vector can be obtained by applying an instrumental 
variables estimation procedure. The test statistic relevant for testing 
the null hypothesis can be written as 

m = (^1 - /io)'(var fi, - var£o)"'(^i “ ^ 0 ). (5) 

where var (/;,) and var (i,,) denote the variance-covariance matrices of 
hi and h,,, respectively. Under the null hypothesis, m is distributed (in 
large samples) as with 2 degrees of freedom. Table 1 reports the 
results oi estimating ecpiation (1) by applying the instrumental \ari- 
ables estimation methorl. As may be seen for all exchange rates the 

’ This test was recently applied by Obstleld (1978) l<i the analysis oC the foieinn- 
exchangc market during the 1970s and by Frenkel (1980,t, 19801i) to the analysts of the 
foreign-exchange markets during the 1920s. 
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two vectors of coefficients ^1 and ^0 are very close to each other. For 
example, for the dollar/pound exchange rate, the constants are .033 
and .030, and the slopes are .956 and .961—consequently, the re¬ 
sulting m-statistic is 2.01, which is well below 5.99—the critical value of 
X^(2) at the 95 percent confidence level. The wi-stadstics correspond¬ 
ing to the other exchange rates are also below this critical value. It is 
concluded, therefore, that the use of the forward exchange rate as a 
proxy for expectations does not introduce a .significant errors in 
variables bias and, thus, the use of the OLS estimation procedure 
seems appropriate.® 

The principal conclusion that may be drawn from the previous 
discussion is that the behavior of the foreign-exchange market 
during the 1970s has been broadly consistent with the general impli¬ 
cations of the efficient tnarket hypothesis. 


Exchange-Rate Movement: Volatility and Predictability 

In this section I analyze the volatility of exchange rates and the extent 
to which this volatility is predictable. As is well known, during the 
same period exchange rates have been very volatile. The standard 
errors ot the tnonthly percentage changes of the three exchange rates 
have been about 3 percent per month. Further, the standard errors of 
the regtessions in table 1 indicate that the forecasts of future spot 
ext hiitigc rates based on the forward rates are very imprecise: The 
stattdard ertots of the equations are about 3 percent per month. 


® The efh( leniy of tlic loreign-exthange maikct can also be analyzed IVom a differ¬ 
ent angle as in Ftenkel (I980A) Consider the equation 

H 

X, = a„ + a, I + + yn,-, + w,. 

i-i 

where .v, denotes the pertenlagc change of the spot exchange rale (In S, - In .S,-,): rr,^, 
denotes the forward premium on foreign exchange (In F, , ~ In S',_,): < denotes time; n 
denotes the numbei of lags; and w denotes an error term. If 7r,_, summarizes all 
available information c ontcrning the future evolution of the exchange rate, then given 
the value of the forward pieniium ir,.the past hi.story of the percentage change of the 
ext flange late should not "help” the predation (i.e., the past history should not be 
s-iewed as (ii anger-causing future changes), and the joint hypotheses that Oq, a„ and gf 
are zero and that y is unity should not be rejected. The results of applying these tests to 
the three exchange rates for various numbers <»f lags as well as to the pcKiled data base 
ol the three exchange rates show that the null hypothesis cannot be rejected at the 95 
percent confidence level, since the values of the various F-statistics fall well below the 
corresponding critical values. It is noteworthy, however, that the power of this lest is 
low and that the joint hypothesis that cxo. Oi./Si. and y are zero could also not be rejected. 
The difficulties in "explaining” the percentage change of the exchange rale in terms of 
past values of various variables reflect the fact that like the prices of other a.ssets which 
are traded m organized markets, changes in exchange rates are dominated by “news” 
which by dehnitiori could not have been incorporated in past changes or in the lagged 
foiward premium. For a further elaboration see the following section. 
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These characteristics of price changes (volatility and unpredictabil¬ 
ity) are typical of auction and of organized asset markets. In such 
markets current prices reflect expectations concerning the future 
course of events, and new information which induces changes in 
expectadons is immediately reflected in corresponding changes in 
prices, thus precluding unexploited profit opportunities from arbi¬ 
trage. The strong dependence of current prices on expectations about 
the future is unique to the determination of durable asset prices; it is 
not a characteristic of price determination of nondurable com¬ 
modities. The strong dependence of asset prices on expectations also 
implies that periods which are dominated by uncertainties, new in¬ 
formation, rumors, announcements, and “news," which induce fre¬ 
quent changes in expectations, are likely to be periods in which 
changes in expectations are the prime cause of fluctuations in asset 
prices. Further, since the information which alters expectations must 
be new, the resulting fluctuations in price cannot be predicted by 
lagged forward exchange rates which are based on past information.^ 
Therefore, during such periods, one should expect exchange rates 
(and other asset prices) to exhibit large fluctuations. When the prime 
cause of fluctuations is new information, one may expect that lagged 
forward exchange rates (which are based on past information) are 
imprecise (even though possibly the best unbiased) forecasts of future 
spot rates. 

To gain further insights into the implications of this perspective on 
the relationship between predicted and realized changes in exchange 
rates, figures 1-3 present plots of predicted and realized percentage 
changes in exchange rates for the three pairs of currencies where the 
predicted change is measured by the lagged forward premium. Also 
presented in these figures are the differentials in national inflauon 
rates which are discussed in Section IV. The key fact which emerges 
from these figures is that predicted changes in exchange rates account 
for a very small fraction of actual changes.” This phenomenon is also 
reflected in the comparison between the variances of actual and pre¬ 
dicted changes in exchange rates: In all cases the variances of monthly 
percentage changes in exchange rates exceed the variances of 
monthly forward premia by a factor that is larger than 20.* This fact 
suggests that the bulk of exchange-rate changes seem to be due to 

’ The analysis of the role of "news" in determining current exchange rales and in 
explaining forecast errors from the forward rale has been made forcefully by Miissa 
(1976a, 19766, 1977, 1979a) and Dornbusch (1978). The large degree ol volatility is 
also analyzed by McKinnon (1976), who atiribuies it to in.sufficient speculation, 

'These and the following empirical regularities are analyzed in detail in Mussa 
(1979a). See also Frenkel and Mussa 1980. 

* For an analysis of the relationship between the variances of senes of predicuons and 
scries of realizations see Shillcr (1979) and Singleton (1980). 
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Fic; I. —Moiichly peicentagc changes of the U.S./U.K. consumer price indices [A(ln 
of the $/C exc hange rale (A In.V,); and the monllilv forward premium 
|ln(F, July 197;i-July 1979 


"new infonnation” which, by definition, could not have been antici¬ 
pated and reflet ted in the forward premium or discount which pre¬ 
vailed in the previtnis perkxl. 

This view of the foreign-exchange market can be exposited in terms 
of the following simple model.'® Let the logarithm of the spot ex¬ 
change rate on day I be determined by 

In S, = z, + hE,{In S,+, - In S,), (6) 

where /t.',(ln “ In .V,) denotes the expected percentage change in 

the exchange rate between I and I + 1, based on the information 
available at t, and where z, represents the ordinary factors of supply 
and demand that affect the exchange rate on day t. These factors may 
include domestic and foreign money supplies, incomes, levels of out¬ 
put, etc. Equation (6) represents a sufficiently general relationship 
which may be viewed as a “reduced form” that can be derived from a 
variety of models of exchange-rate determinatifin. These models may 

‘“The following paiagraph draws on Frenkel and Mussa (1980) 
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Fui. 2.—Monthly pcrceiilage iha!ig«-s ol the U.S./French tiniMimer priie indues 
[A(1 m COI-,;s/C.C)l.r)|; of the $/Fr exchange rale (A In .S’,); and the monthly forward 
premium (ln(f, [ulv 1973-July 1979. 


differ in their emphasis of the determinants of z(t), but they all are 
likely to share a similar reduced form." A.ssuminjr that expecuuions 
are rational in that equation (6) applies to expectations of future 
exchange rates, it follows, by forward iteration, that 


E, In ,S',+j 


I 

I + b 





(7) 


d’hus, the current exchange rate (j = 0) and current expectations of 
future exchange rates (j > 0) are linked, because both depend on 
expectations concerning the future z’s. The strength of the link de¬ 
pends on the magnitude of h which characterizes the dependence of 
the current exchange rate on the expected percentage change 
thereof.'^ The presumption is that, due to profit opportunities from 


“ See, c.g., the cornprehciisive wonornetric model of Kair (1979). 

A result of this j^eneral form is derived in Mussa (1976<i). I'he unique role of 
expectations is also emphasized by Black (1973), Doriibusch (1976c, 1978), Kouri 
(1976), and Bilson (1978). In general, the value of b may be viewed as the relevant 
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Fu. :L — Monilily percentage changes of the U.S./Germaii consumer price indices 
|A(ln (X)l^,.,/C'.OL,;)|: of the $/DM exchange rate (A In .S',); and the monthly forward 
piemiuMi |lt.(f,_,/,S,-,)l, July 197a-|iil> 1979 


jibiiiage, this link is strong at least lot the exchange rates expected in 
the near future. Heiite, the current exchange rate. In St = E, In S,, 
should be closely linked to the curient expectation of the next pe¬ 
riod’s exchange rate, E, In 5,+,, which in turn should be cltisely linked 
to the ex( hange rate expet ted for the following period, E, In S,+ 2 , itnd 
so on. 

In order to examine this hypothesis I present in hgures 4-6 plots of 
the .spot and the contemporaneous forward exchange rates for the 
three pairs of currencies. Also presented are the ratios of national 
price levels which are discussed in Section IV. If the dominant factor 
underlying changes in rates is new information which alters views 
about current and expected future exchange rates by approximately 
the same amount, then one should expect a high correlation between 
movements of spot and forward rates. This fact is clearly demon¬ 


parameter for determining whether a specihc tommodity (whose pricing rule is de¬ 
scribed in terms of cqq. like [6]-[7]) may be viewed as an asset. The higher the value o{b 
IS for a given commodity, the larger is its asset attribute. 
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Fig 4.—Monthly observations of the $/£ spot (In .S',) and forward (In F ,) ext hange 
rates and the ratio of the U.S./U K. cost of living indices [InfCOLis/CO^n)] (staled 10 
equal the spot exchange rale at the initial month): June 1973-July 1979. 


.Strafed by figures 4-6 where it is seen that spot and forward exchange 
rates tend to move together and by approximately the same amount 
(the vertical difference between the two rates corresponds to the 
percentage forward premium or discount on foreign exchange). I he 
correlations between the spot and the forward exchange rates lor 
the three pairs of currencies exceed .99, and the correlations between 
the corresponding percentage changes of the spot and forward rates 
exceed .96. The high correlation betw’een movements in spot and 
forward rates is expected, since the two rates respond at the same time 
to the .same flow of new information (which is presumed to affect the 
rate for more than one period). In general, the details of the relation¬ 
ship between spot and forward exchange rates depend on the ume- 
.series properties of the z’s in equation (7) and in particular on whether 
the new information is viewed as piermanent or transitory.'® 

'* New information might be “permanent" when the relevant hon/.on is I month, 
while it might be transitory when the relevant horizon is a vear. In that ca.se, the 
correlation between the spot exchange rate and the coniemptvraneous 1-month for- 
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Fi<;. 5.—Monihly observations €)t the $/Fr spot (In S,) and forward (In f,) 
(•x( h.tngc rates and the ratio of the U.S./Kreiit h tost-of-living indices (ln((;OLiis/CC)LF)i 
(scalerl to equal the spot exchange rate at the initial month): Jurte 1973-July 1979. 


I he toinovernent of .spot and forward rates is evidence of the close 
link between current and expected future exchange rales which is 
illustrated by ecjuation (7). This characteristic is typical of the 
foreign-exchange market and is also shared by prices of many assets 
and commodities traded in organized markets. The recent pattern of 
gold price.s provides a u.seful example of this general principle. I'able 
2 reports the spot and the futures prices of gold as recorded recently 
in the International Money Market (IMM) at the Chicago Mercantile 
Exchange on 5 consecutive days. The two key facts which are illus¬ 
trated by this table are (i) the extent of day-to-day volatility in gold 
prices and (ii) the general uniformity by which these changes are 
reflected in the price of gold for immediate delivery as well as in the 
prices for the eight future delivery dates. 

ward rate is likely to be high, while the correlation between the spot rate and the 
12-inonth forward rate is likely to be low. The perceived permanence of the new 
information can be inferred from the correlation between the spot and the various 
maturities i>f the forward rate. As expected, it i.s generally found that this correlation 
diminrshes with the matunty of the forward contract. 
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Ku.. 6,—Monthly observations of the $/l)M spot (In S,} and forward (In >',) 
exchange rales and the ratio of the U.S./Gciman cost-of-living indues lln(COL.|,,7 
C()l.,i)] (scaled to ecjual the spot exchange latc at the initial month): June 1973-july 
1979. 


Another fealure which is revealed by figures 4-6 is that the con¬ 
temporaneous spot and forward exchange rates are approximately 
equal, thus indicating that the market’s best forecast of the future spot 
rate is (approximately) the current spot rate. This phenomenon re¬ 
flects the fact that, as an empirical matter, exchange rates have fol¬ 
lowed (approximately) a random-walk process. For such a process, 
current prices are, indeed, the best forecasts of future prices (and to 
the extent that the exchange rate had some drift, the statement above 
should l>e interpreted in reference to that drift). It is relevant to note, 
however, that while the random-walk phenomenon seems to corre¬ 
spond to the actual paths of exchange rates, it does not reflect a 
theoretical necessity. 

The final characteristic of the foreign-exchange market is described 
by figures 7-9, which plot for the three pairs of currencies the spot 
exchange rate and the forward premium on forward exchange. Since 
the spot rate and the forward premium are expres.sed in terms of 
different units (where the latter is expressed as percentages per 
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Kici, 7.—Monthly obstTvations of the normalized $/£ sf)ol exchanRe rate (In S') 
and the normalized forward premium (ln(f,/S,)''l. Both senes are normalized by sub- 
tracUng from each set ics its mean and by dividing by the corresponding standard ei ror. 
June 1973-july 1979. 


month), the two series were normalized for the purpose of the plots 
by subtracting from each series its mean and by dividing by the 
corresponding standard error.''* The fact which emerges from these 
figures is that generally (though not always) there is a positive corre¬ 
lation between the expected depreciation of the currency (as mea¬ 
sured by the forward premium on foreign exchange) and the spot 
exchange rate. This positive correlation may be rationalized by noting 
that currencies which are expected to depreciate are traded at a 
discount in the forward market and, on average, these currencies also 
command a lower foreign-exchange value in the spot market.'* This 


The normalized values of the spot rate. In S', and of the for ward piemiuin. 
ln(F/S’,)*, which are plotted in fig.s. 7-9. arc defined as In S' = (In S, — In .S)/(ri„ and 
]n(F,ISi)' ™ [ln(T,/5,) — In(T/S)]/<ri,,f,.,T where a bar over a vaiiable indicates its 
sample mean, and where tr denotes the sample standard deviation of the subscripted 
variable. 

It is noteworthy that since the forward premium (like the rale of interest) and the 
exchange rate are dimensionally incommensurate, their association raises questions 
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Ki(.. 8.—Muiuhlv ohsei vaiKiiis of thf iionn.ili/cd $/Fi spol exchange rale (In .S^) 
and the not iT\ali/e<l toiwaid pitMniuin lln(F,/.S',y''). Bnlh series are norinali/ed by sub- 
Iran iii^' 1 11)111 ea< li series ils mean and by dividing by (he (.orresponding slandaiil ei loi ,■ 
(line 1973-[ill) 1979 


relationship is embodied in the speeifitaiion ol erpiation (6) and is 
interpreted further in the next section. 


Ill. Exchange Rates, Interest Rates, and Innovations 

This set lion contains an analysis of the relationship between exchange 
rates and interest rates from the perspective of the mtaneiary (tir the 
asset-market) approach to the exchange rale. Following a discussion 
f>f the broad facts, the analysis proceeds with an empirical examina¬ 
tion of the role of “news.” 

Exchange Rates and Interest Rates: The Broad Facts 

To set the stage for this section, it is useful to recall the analysis which 
predicts a negative association between the rate of interest and the 

that are familiar from the diseus.sions ol the (libsoii paradox. In a separate papet 1 
intend to examine the ielation.ship between exebange tales and the forward premium 
(or the interest differential) in light of the various explanations of the Gibson paradox. 
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Ki(.. 9.—Monthly ohscrvalioiiN of the normalized J/DM spot (In V,') and the 
nonnalizetl foiwaid premium |In{fV-5'f)'l- Both series aic normalized by subtraiiing 
( 10 m ea<h senes Ms mean and by dividiniz by the corresponding standard eiroi lunc 
1973-Jnly 1979. 

exchange rate. According to that analysis, a higher rate of interest 
aitracts foreign capital which induces a surplus in the capital account 
of the balance of payments and thereby induces an appreciation of 
the domestic currency (i.e., a lower spot exchange rate). Another 
variant of this approach states that the higher l atc of interest lowers 
spending and thus induces a surplus in the current account of the 
balance of payments whit h results in a iow'er spot exchange rate. A 
third variant claims that the higher rate ttf interest implies (via the 
interest parity theory) a higher forward premium on ftireign ex¬ 
change; and to the extent that at a given point in time the forwaid 
exchange rate which repre.scnts the expected future spot rate is pre¬ 
determined by past history—as would be the case under the adaptive 
expectations hypothesis (which is clearly rejected by the evidence on 
the ettmovements of spot and forward rates)—the required rise in the 
forward premium will be brought about by a lower spot rate (i.e., bv 
an appi eciation of the domestic currency). Whatever the route, by 
ignoring the distinction between nominal and real rates of interest, 
this approach predicts a negative relationship between the rate of 



JOVHnal or rouricAL *:co^oMv 

iiucrcM aiul ilH' spci fxthangc rale (or. alternauvely. a positive ,e 
l„i,>„sh.plK-twcen .he laieol interesi and the foreign-exchange 

^ -i/^propru/e for nownt],t,o,,,ry 

Mnniriu-nts (like the one prevmhng in the United Sutes in recent 
t In such perutds v.in.'ifitMi.v in t'Htes oi inletest are most likely to 

he dorniruited by y^irhitions in infiationary expectations rather than by 
iiquidity effects asscKiated with changes in the ratio of money to 


l)onds. In such an environment, the rate of interest is expected to he 
ponhnely correlated with the exchange rate. The broad facts are con¬ 
sistent with this hypothesis. Over the recent period, the rise in the rate 
of interest in the United State.s (relative to the foreign rate of interest) 
lias fleet! as.sociated with a rise in the spot exchange rate (i.e., with a 
depretiation of the dollar) rather than with a fall in the spot rate. 
Figuie 10 illustrates the point by plotting the foreign-exchange value 
of the U.S. dollar against the interest rate differential. As is evident, 
the higher (lelative) rate of interest in the United States has been 
associated with a higher exchange rate (i.e., with a lower foreign- 
exchange value of the dollar). 

I'he positive association between the rate of interest and the ex¬ 
change tate in the context of the U.S. dollar and the inHationary 
enviionment can be accounted for by the monetary (or the asset- 
market) approach to the exchange rate which puts a special emphasis 
on (he influence of the expectations on the current values of exchange 
rates.'* For example, according to the monetary approach, a ri.se in 
the domeslic (relative) rate of interest which is primarily dominated 
by a ii.se in the expected (relative) rate of inflation induces a decline in 
the demand for real cash balances; for a given path of the nominal 
money supply, a.sset-market equilibrium requires a price level which is 
higher than the price which would have prevailed otherwise. When 
the domestic price level is linked to the foreign price through some 
form of purchasing power parity, and when the path of the foreign 
price is assumed to be given, the higher domestic price can only be 
achieved through a rise in the spot exchange rate (i.e., through a 
depreciation of the currency).'^ 


'“tor iht'oietiial dcvdopincnls and application.s of the appioach .set', e.g., 
Dotnbusth (1976n. 1976A), Frenkel (1976), Kouri (1976), Mu.s.sa (1976a), Bilson (1978), 
Frenkel and Johnvrn (1978), Hodrick (1978). Frankel (1979), Clements and Frenkel 
(1980), and Frenkel and Clements (1981). 

” It should t)e emphasised that this explanation of the positive association between 
the rate of interest and the exchange rate does not rely on a rigid form of the 
pu 1 chasing power parity theory. It only requires that domestic and foreign price levels, 
when expressed in term.s of the same currency, are positively correlated. The evidence 
from the 1970,s is consistent with this requirement; sec Frenkel 1981. 
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Fk'.. 10.—Foieigil-fxchan({<; value of the U.S. dollar and interest rate ditferenlials 
•Sotirce: Mudd 1979. 


I'his explanation of the positive association between interest rates 
and exchange rates has an intuitive appeal in that it implies that, in an 
inflationary environment, a relatively rapid rise in prices is asscxiated 
with high nominal rates of interest as well as with a depreciation of the 
currency in terms of foreign exchange. This relationship is embodied 
in equation (6), which stales that an expected depreciation of the 
currency (which in our case is associated with inflationary expecta¬ 
tions and high nominal rates of interest) results in an immediate 
depreciation.*® 

The foregoing analysis also provides the explanation for the obser- 

The traditional prediction of a negaiivc relationship between iiueresi rales and the 
exchange rate could, in principle, be rationalized under the a.ssumpuon that u lonien- 
trates on the short-run liquidity effects ot monetary changes. It should he emphasized, 
however, that in an inflationary environment, like the one prevailing in tlie I'nited 
States, the applicability of this rationalization is very limited. The short-run liquidity 
effect is emphasized in Dornbusch (1976A). The role of inflationary expect.aiions in 
dominating exchange-r.ile developments is emphasized m Frenkel (1976). Edward.s 
(1979) and Frankel (1979) attempt to integrate these two factors. 
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vation (which was noted in Sec. 11) that, generally, there is a positive 
correlation between the forward premium on foreign exchange and 
the level of the spot rate. Since duiing inflationary periods the spot 
rate is expected to be positively correlated with the interest rate 
differential, and since according to the interest parity theory that 
differential must equal the forward premium on foreign exchange, it 
follows that the forward premium is al.so expected to be positively 
correlated with the level of the spot rate.'* 


Exchatifre Rates and “News" 

One t)f the central implications of the rational expectations hypothesis 
is that unanticipated events, “news,” play a predominant role in af¬ 
fecting real variables and asset yields. This implication is embodied in 
many expositions of modern macroeconomics, and its empirical con¬ 
tent has been the subject of numerous recent studies.“ In the context 
of exchange-iate determination the discussion in Section 11 and in 
particular the contributions by Mussa (1977, 1979rt) and Dornbusch 
(1978) emphasized that the predominant cause of exchange-rate 
movements is “news” which could not have been anticipated.'** Ex¬ 
pressing the spot exchange rate at period I as the sum of factors which 
were anticipated from the past, as well as factors which repiesent 
“news,” Dornbusch (1978) decomposes the effects of “news" into 
those which alter the expected future spot rate between the last 
period and the present and those which lead to a r eassessment of the 
1-pcriod interest rate differential starting at the present, that is, 
"news” about the term structure. Both of these white-noise, serially 
uncorrelated components play a role in determining the spot ex¬ 
change rate in Doi rrbu.sc h’s analysis. 

The evidence presented in Section 11 suggests that the forward rate 
summarizes the information that is available to the market when the 
forward rate is being set, and in equation (.8) it was assumed that the 
expected exchange rale can be written as a + b In /*',_) plus a serially 
unconelalecl error. We may, therefore, express the spot rate at pe¬ 
riod t as a function of factors which have been known in advance and 
are summarized by the lagged forward rale, as well as a function of 
the "news”; 

In 5, = a + A In -I- “news” + w,. (8) 

In what follows this notion is applied to an empirical analysis of the 

'* For evidence on the lobustncss ot the interest parity relationship see Frenkel and 
Lcvich (1977). 

.See, e.g., Barro 1977 and Kischci 1980. 

See also Bilson 1978; F renkel and Mussa 1980; Isard 1980; and longworth 1980. 
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role of “news” as a determinant of the exchange rate. The key 
difficulty lies in identifying the variable which measures the “news.” 
Since, quite frequently, it is difficult to observe and quantify the 
“news,” it is convenient to examine the relationship between the 
exchange rate and a variable whose time series is likely to manifest 
the “news” promptly. Assuming that asset markets clear fast and that 
the “news” is immediately reflected in (unexpected) changes in the 
rates of interest, equation (8) may be written as 


= a -1- /» In 

-f- a 

(i - t*), - - t*), 

expected 


-“news”- 


exchange rate 


(9) 


where the bracketed term denotes the innovation in the (1-rnonth) 
interest differential and where — i*), denotes the interest differ¬ 

ential which was expected to prevail in period t based t)n the informa¬ 
tion available at t - 1. fhe exjietted interest rate differential was 
computed from a regression of the interest differential on a constant, 
on two lagged values of the differential, and on the lagged forward 
exchange rate In /'Vi- 

As was argued above, the association between exchange rates and 
interest rates is likely t<} be positive during peritxis in whic h most of 
the variations in nominal rates of interest arc dominated by variations 
in inflationary expectations—a chaiacteristic which seems to fit the 
inflationaty environment of the 1970s. Under such circumstances, 
when the unexpected interest differential reflects “news" concerning 
inflationary expectations, the ccKd'ficient a in equation (9) is likely to 
be positive.I'able 3 reports the two-stage least-squares estimates of 
equation (9) for the three exchange tales over the period June 1973- 
July 1979.*^ As may be seen in all cases, the coefficients of the unex¬ 
pected interest dif ferential are positive and, in the case of the dollar/ 
|K>und exchange rate, the coefficient is statistically significant. In 


In general, of tonr.se. the Mgii of the ttH'fheieni a depends on the viunc of the 
v.inatioii in the intere.sl i.ite. 

In till cases, the lagged lorwavd exchange rate was included as an instunnent in 
Older to olitain consistent estiinates; see ,Nelson 1975. .Adding lagged rallies ol the 
percentage chattges ol the domestic and the loteign money supplies as deterininants of 
the exjHxted interest dilleienlial and .idding the lurient values of the |x*itenlage 
change of the money supplies as instruments lor the nnexiM-cted interest differential 
did not affect the results in any material way. In order to ohtaiti consistent estimates, 
tire assumption made in the regressions reported in tables 3-f> is that, m forming 
expectations concerning the interest differential. indivitlu.ils Icxik onlv at the l.igged 
forward premium and al past v.ahies ol the clifreiential. .An alleriiafive way to compute 
the expected diffeiential would icse data on the term structure of interest rales Since 
data on the differential of 2-monlh rales are not leadily availahle, this compulation 
would require interpolations. 
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order to verify the importance of using the series of innovations in the 
interest differential, table 3 also reports estimates of regressions 
which replace the innovations by the actual series of the interest 
differential. In ail cases the coefficients of the actual interest differen¬ 
tial do not differ significantly from zero. Table 4 describes analogous 
two-stage least-squares estimates of regressions which include both 
the actual interest rate differential and the innovation in the differ¬ 
ential.^’ Again, in all cases the coefficients on the actual differential do 
not differ significantly from zero, while the coefficients on the inno¬ 
vations are all positive and are significant for the dollar/pound and 
the dollar/franc exchange rates. 

One possible interpretation of the positive coefficient on the (unex¬ 
pected) interest differential may be given in terms of the prediction of 
the monetary approach to the exchange rate, in which case the esti¬ 
mate of the coefficient a in equation (9) might be interpreted as an 
estimate of a structural parameter. Under an alternative interpreta¬ 
tion the innovations in the interest differential belong in equation (9) 
only as far as they manifest “news” which is relevant for exchange- 
rale determination. If, for example, the dominant element of “news” 
was variation in inflationary expectations, then one could also use the 
innovations in other time series as long as they reflect the relevant 
“news." To examine this po.ssibility the same regressions as in tables 
.3-4 were estimated using the 12-month interest differential, and the 
results are leported in table.s 5-6. As before, in all cases the 
coef ficients on the actual differential do not differ significantly from 
zero, while in all cases the coefficients on the innovations in the 
differential are positive and significant for the dollar/pound and the 
df)llar/franc exchange rates.“ 

On the whole the record shows that during the 1970s exchange 


I'tic dilficuities in obuining inslrumenls lor data that are innovations ate obvious 
suite, by virtue of being “news," u is uiilikelv that variables which characterize the 
history can serve a,s gixxl instruments. Lhe difficulties are acute in cases where a 
variable and us expected value appear in the same regression, in which case consistent 
estimates iec|Uiie ihe use of an instrument that is contemporaneous with the innovation 
and is exogenous (see MtCallum 1979). Thus, in addition to a constant and the lagged 
forwaid exchange rale. Uurliin's rank variable was used as an instrument for estimating 
the innovations in the interest rate differential. 

fo allow lot the possibility that the exchange-rate eejuation intludes both the short- 
and the long-term interest rales where, as suggested by Frankel (1979), the former 
captuies liquidity effects and the latter captures expectations effects, eq. (9) was also 
estimated using the innovations m Ixith the I-monlh and the 12-month interest differ¬ 
ential on the right-hand side. Sinre the two sets of innovations arc highly collinear, 
none ol the coclfiaents differed significantly from zero. In this context it is also 
ntiteworlhy that the 12-month interest rale contains elements of the I-monlh rales, due 
to the characteristics of the term structure of interest rales. As a result, coefficient 
e.stimales from regressions which use both rates, like those in Frankel (1979). must be 
interpreted with great tare. 
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rates and the interest rate differential have been associated piositively, 
thus indicating that during that inflationary period the same factors 
which induced a rise in the interest differential also induced a rise in 
the spot exchange rates. Furthermore, consistent with the hypothesis 
that current changes in exchange rates are primarily a response to 
new information, the evidence shows the imjjortance of the innova¬ 
tions in the interest differential. 

The principle that current exchange rates already reflect expecta¬ 
tions concerning the future course of events implies that unantici¬ 
pated changes in exchange rates are primarily due to innovations. 
Since the empirical work suggests that most of the actual changes in 
exchange rates are unanticipated, it follows that most of the actual 
changes in exchange rates are due to “news.” In the present section 
this principle was applied to the analysis of the relationship between 
exchange rates and interest rate differentials. I'he principle, however, 
is general. For example, it implies that the relationship between a 
deficit in the balance of trade and the exchange rate deptends crucially 
on whether the deficit was expected or not. A deficit that was expected 
may have no effect on the exchange rate, since the latter already 
reflected these expectations. In contrast, an unexpected deficit in the 
balance of trade may contain significant new information that is likely 
to be accompanied by large changes in the exchange rate.^* This 
distinction might be useful in interpreting the weak and unstable 
relationship between the balance of trade and the exchange rate 
without having to rely on explanations like the J curve or on variable 
import and export elasticities. 


IV. Exchange Rates and Prices 

One of the striking facts concerning the relationship l>etween prices 
and exchange rates during the 1970s has been the poor performance 
of the predictions of the simple versions of the purchasing power 
parity doctrine. The originators and propr^nents of the purchasing 


“ For d further elabordlion on the relationship between exchanj^e r.iles and the 
current actount sec Koiiri (1976); Branstm (1977): Jiran.son. Hahtunen. and .Mavson 
(1977); Dornbusch and Fischer {1980); and Rodriguez (1980). For a special emphasis on 
the role of innovations in the trade balance see .Miissa (1979f), and for empiiical 
evidence see Dornbusch (1980) and Hakkio (1980), It should l>e noted that the empin- 
cal work on the association lietween exchange rates and cuiient account innovations 
faces some difhciilues since, in contrast wiih data on financijl variables like mteiest 
rates, data on the current account arc not available at short miervals Furihermoie. 
hndings on the as.stxialioii between exchange rates and current account mnosauons 
should be interpreted with care since, rather than reflecting the seiisitivily of exchange 
laies to "news," they might Just reflect invoicing prartiics according to which U.S 
expons are invoiced in terms of U.S. dollars, while imporis are invoiced in terms of 
loreign currencies. 
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power parity doctrine (Wheatley and Ricardo during the first part of 
the nineteenth century and Cassel during the 1920s) have viewed the 
doctrine as an extension of the quantity theory of money to the open 
economy. By now the consensus seems to be that, when applied to 
aggregate national price levels, purchasing power parities can be 
expected to hold in the long run if most of the shocks to the system 
are of a monetary origin which do not require changes in relative 
prices. To the extent that most of the shocks reflect “real” changes 
(like differential growth rates among sectors), the required changes in 
sectoral relative prices may result in a relatively loose connection 
between exchange rates and aggregate price levels. The experience 
during the 1970s illustrates the extent to which real shocks (oil em¬ 
bargo, supply shocks, commodity booms and shortages, shifts in the 
demand for money, differential productivity growth) result in sys¬ 
tematic deviations from purchasing power parities. As illustrated in 
figures 1-3, short-run changes in exchange rates have not been 
iloscly linked to short-run diffeientials in the corresponding national 
inflation rates as measured by consumer price indices. Furthermore, 
this loose link seems to be cumulative. As illustrated in figures 4-6 
divergences from purchasing power parities, measured in terms of 
the relationship between exchange rates and the ratio of consumer 
price indices, seem to persist. 

rhe link between prices and exchange rates is illustrated in table 7, 
which reports the results of regressions of the various exchange rales 
on the corresponding ratios of wholesale anti of cost-of-living price 
indices. As may be seen, the results of the regressions which involve 
the U.S. dollar are extremely poor. For the dollar/pound and the 
dollar/franc exchange rates, the estimates of the toefficieius of the price 
ratios are insignificant and, for the dollar/DM exchange rale, the e.s- 
timates differ significantly from unity. In contrast the results of the 
regressions of exchange rates that do not involve the U.S dollar or the 
U.S. price level (the pound/DM and the franc/DM exchange rates) are 
superior: F.xcept for the wholesale price indices in the franc/DM 
regression all the coefficients are highly significant and the elasticities 
of the exchange rates with respect to the various price indices do not 
differ significantly from unity. 

The vast difference in the performance of the regressions for the 
various currencies can be explained by noting that first, due to trans¬ 
port cost, purchasing power parities arc expected to hold better 
among the neighboring Fairopcan countries than among each of 
these countries and the United States; second, changes in commercial 
policies and nontariff barriers to trade seem to have been more stable 
within Europe than between Europe and the United States; third, 
within Europe the snake agreement and later on the European Mon- 
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PuRCHASiNc; Power Parities: Instrumental Variables Monihly Data: June 

1973-July 1979 

(Standard Errors in Parentheses) 


Def>endent 
Variable In S, 

Constant 

ln(PJP*) 

ln(PJP*) 

•SE 

D-W 

P 


r .712 

.165 


027 

1.63 

.963 

Dollar/pnund 

1 (.149) 

2.982 

( 507) 

1.070 

.029 

1.66 

,998 


1 (2.978) 
r -1..52I 

.184 

(.897) 

.029 

2.26 

863 

Dollar/franc 

1 (.027) 

1 -1.570 

(.374) 

-1.070 

.029 

2.30 

901 

1 

1 (.047) 


(817) 





r -.900 

1.786 


.034 

1.69 

739 

Dollai/DM 1 

1 (.018) 

1 -.908 

(230) 

2.217 

.031 

1.96 

.759 


1 ( 175) 

r -1 668 

.821 

(.263) 

027 

1.60 

.895 

Pound/DM 1 

(.041) 

- 1.666 

(.144) 

.965 

.027 

1.57 

.909 

1 

1 (.048) 


(.197) 





.863 

-.026 


.020 

1 61 

.981 

Franc/DM 

( 143) 
602 

(.487) 

1.180 

.019 

1.48 

929 


(.(H8) 


(.327) 





Noir — Itif o1 »hc ^ptrt rx<hji>Re b> S,. /P J> jml ImP, ;P*) liriHMr. rr^p«- 

iivoly. the logarithms of (he rauos of the wholesale pnee indues and (he co&i of living indKrs Cochrane-Ocuu 
iteiadve (echntque wuh a iwo-v(age leas('squares esiimation meihod was uved, the insinjmrnu are a (onvi.int, ume, 
lime u]uared. and lagge<l values oi the depemlcnt and indepeiKlrni varubles SF. is (he uaiidard error of the 
ei|uauon, and p deiuRes ihe fii'v(-<»rder auiocoirelation (orfh^nt 


clary System have resulted in a reduced degree of intra-European 
flexibility of exchange rates; and fourth, there seem to have been 
large changes in tfie ec|uilibrium real exchange rate between the U.S. 
dollar and the European cut rcncies.^^ It should be noted, however, 
that to some extent the overall poor performance of the purchasing 
power parities doctrine is specific to the 1970s. During the floating 
rales period of the 19‘20s, the doctrine seems 10 have l>ecn much more 
reliatile.^** 

d’he preceding discussion accounted lor the pcr.sisling deviations 
from purchasing power parities in terms of changes in real factors 
which affect equilibrium relative price structure. It should be noted, 
however, tliat even in the absence of sucli changes there is .i pre¬ 
sumption that, at least in the short lun, exchange-rate fluciuaiions 

” rhe (avluTC of the 1 cgi ossioii of ihc fraiu/DM cMh.ingc rale on the iam> of the 
whoic.sale price indices i.s explained in teiins ot the large thanges in ihe Ficnch 
intersectoral relative prices; for an elahrrr.ation see Frenkel (1981). 

For evidence see Frenkel (1976, 1978, I980fi) and Kivignian (19/8) 
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would not be matched by corresponding fluctuations of aggregate 
price levels. I'he discussion in Section II emphasized that in periods 
which are dominated by “news” which alters expectations, exchange 
rates (anil other asset prices which are traded in organized markets) 
are expected to be highly volatile. Aggregate price indices, on the 
other hand, are not expected to reveal such a degree of volatility, 
since they reflect the prices of goods and services which are less 
durable and, therefore, are likely to be less sensitive to the “news” 
whii h alters expectations concerning the future course of events. It 
follows, therefore, that in periods during which there is ample “news” 
which causes large fluctuations in exchange rates, there will also be 
large deviations from purchasing power parities.** 

I’he difference between the characteristics of exchange rates and 
national price levels is also reflected in their time-series properties and 
is fundamental for interpreting the deviations from purchasing 
powet paiities. The monthly changes in exchange rates exhibit little 
or no serial correlation, while national price levels do exhibit a degree 
of sei ial correlation. The “stickiness” exhibited by national price levels 
need not reflect any market imperfection but rather it may reflect the 
costs of price adjustment which result in the existence of nominal 
contrac ts of finite length. Likewise, it may reflect the results of confu¬ 
sion between relative and absolute prices and confusion between 
permanent and transitory changes. This difference between the 
time-series properties of exchange rates and prices is reflected in the 
low cotrelation between the practically random month-to-month 
exchange-rate changes and the serially correlated differences be¬ 
tween national rates of inflation. 

I'he different degrees of volatility of prices and exchange rates are 
illustrated in table 8 , which reports the average absolute monthly 
percentage changes in the various exchange rates and prices. As is 
evident, the mean absolute change in the various spot exchange rates 
has been about 2 percent per month (and even slightly higher for the 
changes in the forward rate). The magnitudes of these changes have 
been more than double the magnitudes of the changes in most of the 
various price indices, as well as in the ratios of national price levels. 
For example, the mean monthly change in the cost-of-living price 
index was 0.4 percent in Germany, 0.7 percent in the United States, 


On this see Mussa (1979a). It is noteworthy that the emphasis in the text has been 
on the words "large flufluaiions”; this .should be ronlrasled with periods during which 
there aic large secular changes in the exchange rate (like the changes which occurred 
during the f'rerman hypennflation). During such periods the .secular changes do not 
stem necessarily from "news" and need not be as.sociaied with deviations from pur¬ 
chasing power parities. 
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0.9 percent in France, and 1.2 percent in the United Kingdom. These 
differences are even more striking for the detrended series. 

The notion that exchange rates have been volatile is clearly illus¬ 
trated by figures 1-3 and by table 8. The comparison of the mag¬ 
nitudes of the changes in the exchange rates with the magnitudes of 
the changes in the price indices and in the ratios of national price 
levels may suggest, according to a narrow interpretation of the pur¬ 
chasing power parity doctrine, that exchange-rate fluctuations have 
been “excessive.” The previous discussion, however, has emphasized 
that exchange rates, being the relative prices of assets, are funda¬ 
mentally different from the price indices of goods and services and, 
therefore, are expected to exhibit a different degree of volatility in 
particular during periods that are dominated by “news.” An alterna¬ 
tive yardstick for measuring the degree of exchange-rate fluctuations 
would be a comparison with prices of other assets. Indeed, while 
exchange-rate changes have been large relative to changes in national 
price levels, they have been considerably smaller than changes in the 
prices of other assets like gold, silver, many other commodities that 
are traded in organized markets, and common stocks. For example, 
table 8 also reports the mean absolute monthly percentage change in 
stock-market indices. As may l>e seen, the mean monthly change in 
the.se indices tanged from over 3 percent in Germany to over 6 
percent in the United Kingdom. By these standards it is difficult to 
argue that exchange rates have been exces.sively volatile. 

Given the short-run deviations from purchasing power parities, it is 
relevant to explore whether these deviations tend to diminish with 
time or tend to persist or even grow in .size. In order to examine the 
patterns of the deviations, the autocorrelation functions and the par- 

TABI.F. 8 

Mf.an Absoiliie Pfrcfni ac.e Chan(.i-s in Prices ano Fxc.hanc.k Ratis 
Monihcy Data: Ji'NE 197^i-J\ iY 1979 


Vaktabi f 



FAchangc* R;Hc*s 
against the Dollar 

S«Hk 

— 


Country 

WPl 

COL 

Market 

Spot 

Foi ivai d 

COI./1 OL 

U.S. 

.009 

.007 

.087 




U.K. 

.014 

.012 

.066 

.021 

021 

007 

France 

Oil 

009 

.0,"»4 

.020 

.021 

003 

Germany 

.004 

.004 

.030 

.024 

024 

004 


NoTt,—All vjnabirs reprewm ihe atMoluic value* i»l monthly peTtcnUjtr chanf|c» in ihc daia WPl denote* ihc 
wholculr pnee index and COl denote* the index DaU on pntr* and exchanfje rate* are Irmn the IMF 

tape (May 1979 verMon) ITic snx-k, rnarkei indKes are from CaptiaJ intmtatumaJ (monthly i«»ue*> 
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tial autocorrelation functions of these deviations for the wholesale 
and the cost-ol-living price indices have been computed. The devia¬ 
tion from purchasing power parities during month t is denoted by A( 
and is defined as: 

A, = In 5, - lii(P/T*),. (10) 

Figures 11-13 illustrate the patterns of the deviations for the three 
exchange rates. As may be seen, the general pattern is very similar for 
the three exchange rates and for the two price indices. In all cases the 
autocorrelation function tails off at what seems to be an exponential 
rate, and in all cases the partial auttx-orrelation function shows a spike 
at the first lag. This pattern seems to indicate (as might have been 
expected on the basis of the time-series pi opcrties of exchange rates 
and price indices) that the deviations from purt hasing power parities 
follow a first-order autoregressive prtKess. It is noteworthy, however. 




(•ii »1 Ixmg litdiiit 

P* _ PI P* 



Fk.. 12 .— A, Fhe dollar/fram: deviarion.s frf)m PPP wiih wholesale price indices. /J, 
The doJiar/franc: deviaiions from PPP with indices. 
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MhoUtaU Prtc* lndi<ai 


C«it ftf liainf Indiiat 



Fk., 13.— A, The cloIlar/DM: deviauons from PPP wjlh wholesale price indices. B, 
The (lolUr/DM- deviations from PPP w'ith cosl-of-livinK indues 


that in all cases the value of’ the autoregression term is abfjut .9, 
indicating the possibility that the series may not satisfy the stationarity 
requirement. To allow for this possibility, the autocorrelation func¬ 
tions and the partial autocorrelation functions of A, - A,_,, that is, of 
the first difference of the deviations from purchasing power parities, 
have also heen examined. The results indicate that these differences 
are serially uncorrelated and thus imply that the deviations A, follow a 
random-walk process. In view of this possibility it is concluded that 
the deviations from purchasing power parities seem to follow a first- 
order, autoregiessive prcKess, but that the data do not provide 
sufficient evidence to reject the alternative hypothesis of a random 
walk,^® 


V. Concluding Remarks 

This paper examined some aspects of the operation of flexible ex¬ 
change rates. The analysis was based on the experience of the 1970s. 
The principal conclusions which may l>e drawn from the empirital 
work aie: 


It the dcvidtioii.s follow a raiuioin-walk protev,, then thev do not email (e\ ante) 
uncsploitcd pi'oht op(>oi luiiilics For a Mudy of the dcMations liom pun hasing p<iwcr 
paritie.s .see Roll (1979). Koi an an.dysis of equilibrium deviations fioni puithasing 
imwer pariiies sec .S.iidi (1977). It may be noted that the main dilfeienre heiwcen 
aeiepting the AR(1) rather than the random-walk hypothesis relates 10 the economu 
interpretation of the two alternative piiMesses. The tandom-walk pioeess implies that 
deviations from purrhasiiig power pannes do not tend to dimmish with the passage of 
time, while the stable .AR( 1) pi (x css implies that there are mechanisms which opera.le to 
ensure that, in the long run, purchasing power parities are satisfied. For the pur[x>se of 
forecasting the near f uture, however, there is very little tlifference lietween using the 
AR(I) prr)ce.ss with an autoregressive coefficient of 0.9 and using the landom-walk 
process. 
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i) 1 n spite ol' the extraordinary turbulence in the markets for 
foreign exchange, it seems that, to a large extent, the mar¬ 
kets have operated efficiently. It should be emphasized, 
however, that the concept of “efficiency" used in this context 
is .somewhat narrow in that it only refers to the notion that 
the markets do not seem to entail unexploited profit oppor¬ 
tunities. A broader perspective should deal with the social 
cost of volatility in terms of the interference with the effi- 
t iency of the price system in guiding resource allocation as 
well as with the cost of alternative outlets for the disturbances 
that are currently reflected in the volatility of exchange rates. 
As for the choice among alternative outlets for the distur¬ 
bances, one may argue that since the foreign-exchange mar¬ 
ket is a market in which risk can be bought and sold relatively 
easily, it may be reasonable to concentiate the disturbances 
in this market rather than transfer them to other markets, 
such as labor markets, where they cannot be dealt with in as 
efficient a manner. 

ii) The high volatility of exchange rates (spot and forward) 
reflects an intrinsic characteristic of the relative price of mon¬ 
ies and other assets that are traded in organized exchange, 
rhe pt ice of gold, the price of st<x.ks. as well as exchange rates 
between national monies depend critically on expectations 
concerning the future course of events and adjust rapidly in 
response to new information. In this perspective, the ex¬ 
change rate (in contrast with the relative price of national 
outputs) is being viewed as a financial variable which is de¬ 
termined in a macroeconomic selling. 

iii) During inflationary periods, variations in nominal rates 
of interest are dominated by changes in inflationary expec¬ 
tations; as a result, high nominal rates of interest are asso¬ 
ciated with high exchange rates (a depreciated currency). 
This relationship was supported by the empirical work, 

iv) The asset view of exchange-rate determination implies 
that "news” is among the major factors which influence 
changes in exchange rates. In this cotitext the key finding 
was the dependence of exchange-rale changes on the unex¬ 
pected changes in the rates of interest. 'I'his finding is in 
accord with the analytical prediedon that current exchange 
rates already reflect current expectations about the future, 
while changes in the current exchange rates reflect primarily 
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changes in these expectations which, by definition, arise 
from new information. 

v) The experience of the 1970s does not support the pre¬ 
dictions of the simple version of the purchasing power parity 
doctrine which relates the values of current measured prices 
to current exchange rates. The empirical work showed that 
deviations from purchasing power parities can be charac¬ 
terized by a first-order autoregressive process. 

vi) One of the key analytical insights that is provided by the 
monetary (or the asset-market) approach to the exchange 
rate is that exchange rates do not reflect only current circum¬ 
stances but also reflect those circumstances which are ex¬ 
pected to prevail in the future. This anticipatory feature of 
the exchange rate (which is emphasized by Mussa (1979i]) 
does not characterize (at least to such a degree) the prices of 
national outputs, which reflect to a large extent present and 
past circumstances as they are embedded in existing con¬ 
tracts. Consequently, periods which are dominated by large 
and frequent changes in expectations are likely to be peritxls 
in which the future is expected to differ greatly from the 
present and the past. Under such circumstances one may 
expect to find frequent deviations from purchasing power 
parities when the latter are computed using current prices. 
These deviations reflect the intrinsic difference between 
asset prices and national price indices.®' 

vii) Since commodity prices do not adjust fully in response 
to exogenous shocks, it seems that intervention in the 
foreign-exchange market, which ensures that exchange rates 
conform with purchasing power parities, would be a mis¬ 
taken course of policy. When commodity prices are slow to 
adjust to current and expected economic conditions, it may 
be desirable to allow exces,sive adjustment in some other 
prices. Further, changes in real economic conditions requir¬ 
ing adjustment in the equilibrium relative prices of different 
national outputs occur continuously. An intervention rule 
which links changes in exchange rates rigidly to changes in 

” It is interesting to note that this phenomenon was recognised by Gustav Cas.sel— 
he mo.st recognized pi uptinent of the purchasing power parity dex trine—aciording lo 
vhom: "The international valuation of the currency will, then generally show a len¬ 
iency to anticipate events, so to speak, and become mote an expiession of the internal 
aluc that the currency is expected lo possess in a few months, or perhaps in a year’s 
ime” (Cassel 1930, pp. 149-50). 
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domestic and foreign prices in accord with purchasing power 
parity ignores the txrcasional need for equilibrating changes 
in reJative prices. 


Appendix 

1. Exchange Rales 

The spot exchange rates are entl-of-moiiih rates obtained from the IMF tape 
(May 1979 version, updated to July 1979 using the November 1979 issue of 
iht Intenintwjwl fin/mdal Slatistifs) obmincA from ihe International Monetary 
Fund. 

T he forward exchange rates arc cnd-<»f-nii>nth rates for 1-month maturity. 
Ihe forwai'd rales for the U.K. pound and the DM for the period June 
i973-June 1978 are bid price.s obtained Iroin the International Money Mar ¬ 
ket (IMM). For the period July 1978-July 1979 they are sell prices obtained 
li'orn the Wall Street Jourruil. The forward rates for the Frenth franc for the 
period June 1973-July 1974 ar e bid prices calculated from the Weekly Rexhew 
publication of the Hatris Bank which reports the spot rate and the forwatd 
premium; in each case the closest Friday to the end of the month was chosen. 
For the peiiod August 1974-Junc 1978 the rates are bid rates obtained from 
the IMM, and ibr the period July 1978-July 1979 they are sell prices obtained 
Iroin the Wall Street Journal. 

2. Prices 

I he wholesale and cost-of-living price indices are period averages obtained 
fiom the IMF t.ipe, lines 63 and 64, 1 espectively. 

3. Rates oj Interest 

All interest tales are l-moiiih EurcMurrency tales obtained I'tom the Weekly 
Reiaeui of the Hai 1 is Bank. In all cases the bgures used cot respond to the last 
Friday of each month 

4. Slock Maikets 

The suick mat ket indices correspond to the List trading day ol the month. 
I he sources are rci/tita( Intmuitumal Perspertn’e ((leneva, Swit/.erland. monthly 
issues). 
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1 his paper uses historical U.S. data to directly estimate the contri¬ 
bution of intergenerational transfers to aggregate capital accuinula- 
noii, The cvirlence presented indkates that iiitcigencratioiial irans- 
feis auount foi the vast inajority of aggregate U.S. lapital lin ina- 
tioii; only a negligible Iraction ol actual capital accnmulaiioii can be 
liaicd to lilc-cycic or "Innnp” savings. .\ major difleienie belwcen 
this study ;ind previous investigations of this issue is the use of more 
accurate longitudinal age-earnings and age-consumption prohles. 
These profiles are simply too flat to generate substantial life-cycle 
savings. This paper stiggesis the importance of and need lor sub- 
siantially greater research and data collection on intergenerational 
tiansfers l.ife-cycic models of savings that emphasize savings for 
retirement as the doininam form of capital accumulation should 
give way to models that illuminate the determinants of intergenera- 
ticsnal transfers. 


During the past riuarter century, the life-cycle hypothesis (Modig¬ 
liani and Brurnberg 1954; Ando and Modigliani 1963) has emerged 
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as the principal theory of agjrregate saving in the American economy. 
It has stimulated a vast quantity of research testing and exploring its 
implications. Despite this effort, the importance of lifetime consump¬ 
tion smoothing to the process of capital accumulation remains unre¬ 
solved.* This paper pre.sents evidence that the pure life-cycle compo¬ 
nent of aggregate U.S. savings is very small. American capital 
accumulation results primarily from intergenerational transfers. 
Distinguishing the roles of life-cycle .savings and intergenerational 
transfers in the capital accumulation process is crucial to a number of 
economic issues. Economic models that incorporate intergenerational 
transfers can generate strikingly different results for a number of 
major economic questions, including the burden of the national 
debt, the impact of social security on savings, the incidence of tax¬ 
ation, the perpetuation of the inequality of wealth, and the optimal 
structuring of taxes to promote economic growth (see, e.g., Dia¬ 
mond 1970; Atkinson 1971; Feldstein 1974; Oulton 1976; Calvo, 
Kotlikoff, and Rodriguez 1979). 

This paper has two objectives. The hrst is simply to answer an 
accounting question, namely, can life-cycle savings alone account for 
the U.S. capital stock? The second goal is to answer an economic 
question: If, ceteris paribus, there were no intergenerational trans¬ 
fers, how large would the U.S. capital stock be? That is, if all such 
transfers were taxed in a confiscatory way, by how much would capital 
accumulation be reduced? We find that life-cycle savings cannot ex¬ 
plain the capital stock in an accounting sense and that in the absence 
of intergenerational transfers the U.S. capital stock would be substan¬ 
tially smaller. 

Our findings are based on a new methodological approach.* In 
particular, a variety of historical U.S. data detailing population, labor 
earnings, consumption, and government taxes and transfers are used 
to directly estimate the shapes of historic age-earnings and age- 
consumption profiles. The.se profiles are combined with data on rates 
of return to calculate a stock of life-cycle wealth. This stock of life- 
cycle wealth is compared with aggregate wealth holdings in the 
United States. If there were no intergenerational transfers, the stock 
of life-cycle wealth would exactly equal total U.S. wealth. When in¬ 
tergenerational transfers occur, these two slocks differ by an amount 
equal to the stock of net received transfers. Hence, comparing total 
wealth with life-cycle wealth indicates whether the life-cycle model 


' lobiii (1967) and Boskin (197H) argue that lilc-cvcie savings are ihc prcdominanl 
form of savings in the United Slates V^ile (1978) and Darby (1979) argue otherwise. 
Tobin and Dalde (1971) provide a somewhat interinediaic mcw. 

’ Our methtKlological approach was influenced bv and is similar to those of Briiuin 
(1978) and Darby (1979). 
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alone can explain aggregate U.S. capital formation. We find that 
lifetime consumption and earnings profiles do not exhibit the kind of 
shapes needed to generate substantial life-cycle savings. Log linear 
approximations to these profiles suggest that growth rates of real 
earnings slightly exceed growth rates of real consumption over the 
lifetime. Since the life-cycle theory strongly relies on a lifetime growth 
rate of consumption in excess of the growth rate of earnings, the 
life-cycle theory of savings with no intergenerational transfers is a 
very poor description of the process of capital accumulation in the 
U.S. economy. 

Our findings are in agreement with recent studies by White (1978) 
and Darby (1979) and rationalize other stylized facts about the U.S. 
economy at odds with the life-cycle theory. Mirer presents evidence 
from s(K ial security data that “the aged do not run down their wealth 
during their lifetime” (1979, p. 442). Indeed, after he adjusts for 
inteicohori differences in wealth at retirement, Mirer finds “that 
wealth clearly iticrea.ses with age” (1979, p. 442). Darby points out that, 
although the ratio of expected retirement years to expected life span 
increased by 67 percent from 1890 to 1930, aggregate savings rates 
showed no increase during this period as would be predicted by the 
life-cycle theory (1979, pp. 22-28). Atkinson (1971) and Oulton 
(1976) construct life-cycle models to determine how much of observed 
British inequality of wealth may be explained by this theory. The 
answer is, very little. After taking into account inequality in age- 
earnings profiles and realized rates of return, Oulton concludes, “The 
results indicate that none of these factors, either singly or in combina¬ 
tion, are capable of accounting for a substantial proportion of actual 
wealth inequality” (1976, p. 99). 

Section 1 presents a theoretical framework for considering the 
importance of intergenerational transfers to aggregate capital ac¬ 
cumulation. Section 11 discusses the procedure to estimate the stock of 
lifc-cytle wealth. The data are described in Scctitzn III. Section IV 
preseiiLs and intuitively motivates the findings. The sensitivity of the 
findings to reasonable possible errors in the data and estimation 
procedure is also considered. The fifth section attempts to account for 
the residual between total U.S. wealth and the stock of life-cycle 
wealth using the limited information available on actual intergenera¬ 
tional transfers. The sixth and final section of the paper presents the 
conclusions and discu.sses some of their implications. 


I. Life-Cycle and Transfer Wealth—a Theoretical Framework 

The division of the stock of wealth, W, into life-cycle and transfer 
components, L and T, respectively, is easily understood by consider- 
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ing a 3-period model of economic growth with identical individuals in 
each age cohort. At a point in time, aggregate wealth in the economy 
is the sum of individual wealth holdings. Let Wi stand for the wealth 
holdings of a representative individual of age i, and let P/ stand for 
the population of individuals age i; then 

W = WtP, + WiP^ + W 3 P 3 . (1) 

The fV(’s equal, by definition, accumulated flows of net received 
transfers plus accumulated flows of earnings net of government taxes 
less government transfers less accumulated flows of consumption, 
where accumulation takes place at the after-tax interest rate r. Let Tjj 
be the net transfer received at age i from individuals age / for indi¬ 
viduals who are currently age k. For example, represents the net 
transfer (which may be positive or negative) that a current 3-period- 
old individual received at the time he or she was 1 period old from 
individuals who were then 2 periods old. Let ef and elf represent the 
earnings and consumption of A:-period-o!d individuals when they 
were i periods old. Assuming that earnings, consumption, and trans¬ 
fers occur at the end of each period, IT,, the wealth of age 1 individu¬ 
als, equals zero. With this notation, and W 3 may be expressed as: 

W 2 = (r ?2 + r? 3 )(l + r) + {ej - c?)(l + r) 

W:, = (r ?2 + n.)(l + rf + + r^ 2 )(l + r) (2) 

+ (e^ -cl)(l + rf + (e^ -f?)(l + r). 

Combining (1) and (2) yields: 

W = T + L, (3) 

where T = + r)P, + ((7^2 + + rf + (Tl + Th) 

(1 + r)]P 3 and 1 . = (ej - c?)(l + r)P^ + [(e? - f?)(l -t r)^ -t- (1 -t- r) 

(ci - 

Equation (3) is the fundamental accounting relationship analyzed in 
this paper. Transfer wealth, T, corresponds to accumulated net re¬ 
ceived transfers, while life-cycle wealth, L, corresponds U) accumu¬ 
lated earnings less accumulated consumption. Clearly, in a world with 
no intergenerational transfers, T would equal z.ero and W would eqvial 
L. Our first goal is to establish the relative magnitudes of the two 
components T and L and thereby determine whether T.S. wealth 
holdings can be predominantly explained by life-cycle savings. Since 
substantially less information is available alxiut the \ allies ol the Cj, 
most of this paper is devoted to calculating the value of/.. Section V 
does, however, attempt to estimate T directly using fragmentary data 
and invoking steady-state assumptions. 

If the economy is in a steady state, net intergenerational transfers 
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received at a given age are consfanf through time, tio that; 

7-., = = ri, 


7',., = ri;, = n, 
7'., = n, 

T'j;, = n,. 


(4) 


Under the assumption that population grows at a constant rate n, 

Pi = + nf and P, = P^O + «)• (5) 

Since transfers received by age group i f rom age group ; equal the 
negative of group /’s transfers from age group i, 

T„P, = -T„P„ (6) 

and 


r,,p, = ~r„p.,. (7) 

From (4), (5), (6), and (7), T may be expressed in the steady state as:'' 


T = P, 


Ti,\ 


1 + r ^ (1 + rf 


1 + H 


... (1 + r'f j. 


(I 

(1 +r) 


(«) 


(1 +n)^ 


Under the assum[)tion that r = «, T equals the yearly net flow of 
transfers from old to young cohorts, I, multiplied by the transfer- 
weighted age gap, g, between donors and recipients; 


T = [P,(r,, + 7.,) + P,T,,] 


T,^P, + 2T„Pi + T.,,P, 

Pi(T,^ + T») + P;Tr, 


If r exceeds (is less than) n, accumulated transfer wealth will exceed 
(be less than) the annual flow of transfers times the weighted average 
age gap. Equations (8) and (9) show that the contribution of transfers 
to the total stock of wealth depends critically on both the volume of 
the annual flow of transfers and the age span of transfers. Equation 
(8) is invoked in Section V to directly estimate the size of T. 

The second goal of this paper is to ask the economic question, How 
would the U.S. capital stock, tV, change if, because of changes in taxes 
or tastes, intergcnerational transfer wealth, T, was reduced? Since 
consumption, leisure, and transfer decisions are jointly determined it 
is important to examine the possibility that changes in transfers might 


' By examining (8), one tan sec that T corresponds simply to the actuniulated 
holdings of transfers received by people who are currently alive from people who are 
currently dead. 



AGCREGATE GAHITAI. AC;CGMIJI,A I ION ^ j j 

induce or be associated with changes in consumption and earnings 
paths and thus alter the lite-cycle component of wealth. In addition to 
recognizing interdependencies between transfer wealth and life-cycle 
wealth, the effect of changes in transfer behavior on capital formation 
may be examined in fjoth partial and general equilibrium contexts 
The partial equilibrium change in fV resulting f rom a reduction in T 
holds wage rates and interest rates constant and corresponds to a shift 
in the household supply curve of capital. This paper considers only 
the steady-state partial equilibrium impact of changes in transfers on 
the capital stock. A calculation of the general equilibrium effect re¬ 
quires uking account of the responsiveness of the level and shapes of 
earnings, consumption, and lifetime transfer profiles to changes in 
capital accumulation.'* 


The Steady-State Partial Equilibrium Reduction in Capital Intensity Arising 
from a Reduction in Inter generational Transfers 

In order to analyze the partial equilibrium reduction in the stock of 
wealth, we hrsl note that life-cycle wealth, L, equals accumulated 
earnings minus accumulated consumption. Accumulated earnings, in 
turn, equals accumulated wages at full-time work minus the accumu¬ 
lated value of leisure. Letting C stand for accumulated consumption, 
S for accumulated full-time wages, and M for the accumulated value 
of leisure, equation (S') becomes; 

W = T + S- M-C. (3') 


The partial equilibrium percentage reduction in W tor a percentage 
change in transfer wealth, T, equals: 


aiV dM dC \ T 5 

dT W I dT dT I IL’ 


( 10 ) 


Equation (10) states that the proportionate reduction in the capital 
stock equals the share of transfer wealth in aggregate wealth, T/W, 
minus two additional terms indicating how accumulated earnings and 
accumulated consumption respond to changes in transfers. 

Our analysis of the response terms dA/ /d T and dCtdl assumes that 
the utility of consumption and leisure is separable from the utility 
derived from intergeneralional transfers. This assumption implies 
that the marginal rates of substitution Ix'tween consumption and 
leisure at different points in time are independent of the level of 


* K«tlik.olf(1979) deiTicmsii ales ihal general equilibrium ihangcs in capital intensity 
can Ije substantially smaller than partial equilibrium ihangcs. 

* Because wage rates are held eonsiani in this partial equilibrium analysis, dSIdT 
equals ?ero. 
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intergenet ational transfers. Two examples of utility functions exhib¬ 
iting this property are; 

f/o = log Co + log C| + log Cz + log /| 

+ log 4 + log 4 + « log r ,2 -I- a log Tiz (11) 

+ a log Tzi + a log Tz^ + a log -f- a log T^z 

and 


U„ = log Co + log C, + log Cz + log /, ^j 

+• log 4 + log 4 + n 17,. 

I'he term U„ stands for the utility of a representative individual of 
generation zero. The terms 4. and 4 correspond to leisure in 
different periods. In (11) the individual derives utility directly from 
the level of net transfers. In (IT) the individual derives utility from 
the utility, Ui, of his descendants. This is an example of an “overlap¬ 
ping utility f unctiftn." For both types of separable utility f unctions the 
marginal rates of substitution between consumption and leisure are 
independent of the level of a, the preference parameter influencing 
the size of transfers. In addition, the first-order conditions for the 
optimal choice of consumption and leisure involve the equality be¬ 
tween these marginal rates of substitution and the relative prices of 
consumption and leisure at different ages. Hence, neither changes in 
tiansfer preferences nor taxes on intergenerational transfers that 
affect only the price of transfers but not the price of consumption or 
leisure will alter the first-order conditions. 

Given the levels of transfers, the utility-maximizing levels of con¬ 
sumption and leisure can be separately derived from these first-order 
conditions and the lifetime budget constraint given below: 


Cl 


Cz 


C:, 


I -I- 


(1 


-r Af| + 


Mz 


1 4- r 


+ 


M, 


(1 -l-r)* 




1 + r 


(1 + rf 


+ Tyz + T,:t + 


Tz-. 


1 + r 


1 + r 


( 12 ) 


(1 +rf 


-f- 


Tzz 


(1 -t- r)* 


In equation (11), M,,Mz, and M 3 are the values of leisure in periods 
1,2, and 3 (by value of leisure we mean the number of units of leisure 
times the price [wage] per unit). The terms Si, ,V 2 , and is are full-time 
wages in periods 1, 2, and 3. Equation (12) indicates that lifetime 
consumption and leisure are financed by full-time lifetime earnings 
plus net lifetime received transfers. The separability assumption im- 
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plies that changes in preferences or taxes that alter the values of the 
T(j have only an income effect on the choices of consumption and 
leisure over the lifetime. 

We now demonstrate that this income effect of a change in trans¬ 
fers is identically zero for the case thatr equals n.® Using (5) and (6) to 
rewrite (12), we obtain; 


C, + 


1 -I- r 
•S2 


-f- 


(1 +rf 

■^3 


+ M, -I- 


1 -I- r 


Af, 


1 + r (1 + r)"* 


. - n) 

(1 +r) 


(1 +rf 

- n) 
(1 +Tf 


( 12 ') 


, T-.aUI +r)» - (1 + n)=‘l 
(1 + rf 

We can also write the budget constraint (12') in terms of aggregate 
transfer wealth per young person T' where T' = TIP,: 


C. + 

l + r 


(1 + 


+ M, + 


Mi 


+ - 


M, 


- s, + 


S.3 


1 + r 


(1 + rf 


1 + r (1 + r)'^ 

(r - Tt)(l + rxfT 
(1 -I- rf 


( 12 ") 


As is clear from (12') and (12"), when r = n, lifetime consumption 
and leisure are financed solely out of lifetime full earnings; all of the 
accrued interest on received transfers, as well as the principle itself, is 
used to maintain steady-state transfers per head at a constant level. 
When r = n, reducing transfers has no impact on the steady-state 
budget constraint, and the terms dM/dT and dCldT in (10) are zero 
(i.e., since lifetime consumption and earnings paths slay the same 
when r = n, steady-state life-cycle wealth, L, will be unaltered by 
changes in steady-state transfer wealth, T). I'hus, any decline in 
transfer wealth, T, will reduce total wealth, W, dollar lor dollar in 
partial equilibrium when r = n. 

In the case that r differs from n, steady-state changes in the level 
and pattern of transfers (the T,, terms) affect the steady-state budget 
constraint only insofar as they alter the stock of transfer wealth T. 
This is a general proposition that can easily be shown to hold inde¬ 
pendent of the number of periods in the model. When r exceeds n, 
part of lifetime consumption and leisure is financed by lifetime trans¬ 
fers. Again, under the separability assumption, the reduction in trans- 


' In [he case of a tax on iransters, the inctnne effect ot a ch.inge in transfers when r = 
n will be zero only for the rase of a compensaled tax on transfers. Compensated rather 
than uncompensated tax changes are the appropriate focus ol studies of government 
tax policies toward savings. See Diamond i970 and Kotlikoff and Summers 1979. 
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fers has simply an income effect on consumption and leisure and, 
assuming both are normal goods, will reduce lifetime consumption 
and raise lifetime earnings. In (10) both dMIdT and dCldT will be 
positive when r exceeds n. The reverse will be true when n exceeds r. 
It is easy to demonstrate that introducing labor-augmenting produc¬ 
tivity growth simply requires relabeling n everywhere as the popula¬ 
tion growth rate plus the productivity growth rate. In the ca.se of 
productivity growth, the steady state is characterized by a constant 
level of transfers per effective worker. 

The value of these terms depends on the particular preferences 
determining the levels and shapes of consumption and leisure paths. 
As one example, table 1 gives the value of the transfer wealth- 
adjustment factor [1 — (dMIdT) — (dC/dT)] for a particular spec¬ 
ification of preferences for consumption and leisure and for dif¬ 
ferent values of r and n. I'he example assumes a 55-year life span 
and uses the logarithmic utility function with time-preference 
parameter p and leisure-preference parameter a J 

J t) fi) 

log C,e~’“dt + «J log (1.3) 

The It terms in (13) are units of leisure at different points in time. The 
calt Illations underlying table I are presented in Kotlikolf and Sum¬ 
mers (1980, appendix A). 

The figures in table 1 indicate that the response of consumption 
and labor supply can be quite important in determining the final 
partial equilibrium effect of a reduction in transfers when the interest 
I ate dif fers from tlie economy’s growth late. For the specific 
logarithmic function chosen here, a real interest rate that is 1 percent 
higher than the rale of population plus productivity growth generates 
total wealth response numbers approximately equal to 0.7. If, on the 
other hand, the rate of jjopulation plus productivity growth exceeds 
the real interest rate hy 1 percent, reducing transfer wealth by $1.00 
reduces aggregate wealth by $1.40. 

For the United States the annual rate of population growth has 
averaged 1.40 percent from 1900 through 1974, the period of this 
study (see U.S. Department of Commerce 1975, pt. 1, p. 8). Produc¬ 
tivity growth as measured by the annual percentage change in real 
CiNP [>er man-hour has averaged 2.20 percent over this period." 


' The utility runctiiiii in (13) corresponds only to that part ot tiital utility pertaining to 
consumption anti leisure. Any set of prelercnces for transfers may lie added to (1.3) as 
long as utility of consumption and leisure is .separable from the utility ol iransters. The 
assumption of a steady stale with pnxltictiviiy growth also requires that the entire utility 
lunction be homotlietic. 

" U.S. Department ot Commerce 1973, series A167 and A168, p. 210, and U.S. 
Department of Labor 1978, p. 336. Kendrick’s scries of output per man-hour is used 
for the years 1900-1910, and the Bl-S series thereafter. 
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lABLE 1 

Transkek We^ai.iii Aojuvemeni Factor 


r 

71 

P 

dM dC 

dT dr 

.02 

.01 

02 

.73 

.02 

.01 

03 

71 

.02 

.02 

.02 

1.00 

.02 

.03 

.02 

1 40 

.02 

.04 

.02 

2.00 

.03 

.01 

.01 

.58 

.03 

01 

.05 

.48 

,03 

.02 

.03 

.71 

.03 

.03 

03 

1.00 

.03 

,04 

.03 

1.43 

04 

.01 

.01 

.45 

.04 

.01 

07 

.30 

.04 

.02 

.04 

49 

.04 

03 

.04 

70 

04 

.04 

04 

1.00 


Adding 1.4 to 2.2 yields 3.6 percent as the rate of population plus 
productivity growth. This paper calculates a portfolio-weighted net 
nominal rate of return for the U.S. economy from 1900 through 
1974. After subtrac ting inflation, the real annual net after-tax rate of 
return in the U.S. economy averaged 4.5 percent between 1900 and 
1974." 

The fact that real interest rates have on average exceeded growth 
rates suggests that part of U.S. consumption and leisure may well have 
been financed by interest earned on intergenerational transfers. 
Hence, a dollar reduction in transfer wealth can be expected to 
reduce aggregate wealth by less than $1.00. The calculated 1 percent 
gap between r and n suggests that eliminating transfer wealth in the 
U.S. economy would reduce total wealth by about 70 percent ol the 
amount of transfer wealth. While the 70 percent figure is meant to be 
suggestive rather than precise, it appears that almost any choice of 
preferences would yield off set factors not far from 70 f>ercent."’ That 


' t o obuiin j I eat net rale <it icuivn senes, we suirti aet the annual iTerccnlage rliange 
in the (T1 Iidiii our net nominal leliirn sciics indicated in U.S. DepaiTtnrnt ol 
Uommeric (197.'), pt. 1, pp. 210-11. table B2) and US. department of ('.omincrie 
(I97H. p. 490). 

riicse calculations were performed lor die moic general isoel.cstii uliliiv lunetion. 
A wide range of parameter values lor rates of risk aversion and time prelerence 
produced values of [ 1 - {dMIdT) - (dC/d'Dl ranging from 0.6 to 0 75 when r exceeds 
n by 1 percent. While it is unclear what iililily lunetion is most appropriate, thc^e 
parameter values cover a wide range of different slia[)es ol consumption and leisure 
profiles and, we presume, txnind the likely rr^pmsc. The assumption implieit in these 
utility functions that consumption and leisure rcspmd homolhciically to changes in 
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is, the life-tycle offset to a change in transfer wealth is likely to be 
small. 


II. The Estimation of Life-Cycle Wealth—Methodological 
Approach 


I’olal life-cycle wealth in the U.S. economy equals the sum over all 
living persons in the economy of life-cycle assets. Each person’s life- 
cycle assets correspond to his or her accumulated earnings less ac- 
cumtiiated consumption where accumulation occurs at the actually 
leali^ed rates of return. If data existed detailing each person’s earn¬ 
ings, consumption, and realized rate of return on assets at each point 
in lime in the past, it would be easy to check whether U.S. wealth 
holdings could be explained predominantly by life-cycle accumula¬ 
tion. 

Obviously, such detailed individual specific data are not available. 
However, historical data for the United States on aggregate earnings, 
aggregate consumption, rates of return, and age-earnings and age- 
consumption prohies may be u.sed to carry out this life-cycle asset 
computation on a cohort-by-cohort basis. 'I bis paper treats individu¬ 
als of each sex within an age cohort as identical and estimates the 
average excess of after-tax earnings plus government transfers over 
consumption experienced by members of that age-sex cohort during 
each of their adult years in the past. These differences are then 
act iimulated up to 1974 using historical net nominal interest rates. 
The total over all age-sex cohorts of these accumulated life-cycle 
assets is then ccjinpared with the 1974 value of total U.S. private net 
worth. 

Life-cycle wealth of the age-sex cohort that is age a and sex j in 
1974, Lj(a), is given by 


^ 0 + i) - Cj{a, 0] 


} [ [1 -f r(a, k)]}Pj(a). 


(14) 


transfers may be inappropriate. Even if the utility lunctum is homothelic, capital 
market lonstraints precluding fiorrowing when young against future earnings may 
make the consumption and leisure response to a reduction in transfers nonhoinothetir. 
These capita! market consiraincs may not be binding in the presence of transfers 
because the excess of consumption over earnings when young is financed by the 
received transfers. Eliminating or reducing transfers could make these capital market 
constraints binding and require proportionately less consumption and leisure when 
young than when old. 
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In equation (13), Pj(a) stands for the number of pieople alive in 
1974 who are age a and sex/ The terms£j(a, i), Gj(a, t), and Cj(a, t) 
are, respectively, the average after-tax earnings, government trans¬ 
fers, and consumption of the age a-sex j cohort at the time its 
members were age i. Age 18 is taken as the age of adulthood. Con¬ 
sumption expenditures by adults on children under the age of 18 are 
considered to be a part of adult consumption rather than intergener- 
ational transfers." The term r(a, k) is the economy-wide annual net 
nominal interest rate received by the age a cohort during the year the 
cohort’s meml>ers were k years old. 

Our estimate of total life-cycle assets in the economy in 1974 equals 
the sum over all age-sex cohorts of the estimated values of Lj (a)." The 
difference between life-cycle wealtli and total U.S. wealth in 1974 is 
our estimate of transfer wealth. 


Calculation of Longitudinal Profiles of Net Earnings, Government Transfers, 
and Consumption 

The essential idea involved in these calculations is to use cross- 
sectional distribution profiles to allocate aggregate flows of net earn¬ 
ings, government transfers, and consumption to different age-sex 
cohorts in a given year. By performing this computation for each year 
from_1900to 1974, we obtain the longitudinal profiles£j(o,i).Gj(a,i), 
and C,(a, i) as i varies. _ 

To illustrate the computation for £,(a, i), we define the following 
terms: e, is the average earnings of 40-year-old male workers in year t; 
g„ (a, t) is the ratio of average earnings of male workers at age a in year 
t to average earnings of 40-year-old male workers in year I; gf(a, t) is 
the ratio of average earnings of female workers at age a in year t to 
average earnings of 40-year-old female workers in year <; k, is the 
ratio of average earnings of 40-year-old female workers to average 
earnings of 40-year-old male workers in year t; a,„(a, I) is the percent¬ 
age of males age a with work exjjerience in year /; oif(a, t) is the 
percentage of females age a with work experience in year f; H, is total 
after-tax labor income in year/; P„{a,t) is the population of males age 
a in year /; and Pf(a, t) is the population of females age a in year t. 
Given information on all of the above variables except e,, equation 
(14) is used to solve for eg. 


" 4'reatiiig consumption of children under age 18 as their own lite-cvcle consump¬ 
tion would greatly reduce our estimate of life-cycle wealth and greatly increase our 
estimate ol transfer wealth. The age of adulthood ts somewhat arbilrarv, but 18 apjrcars 
to be a reasonable number and of general interest. 

Assets of children under age 18 are assumed to be completely inherited and are not 
included in life-cycle wealth. 
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100 

= ft X 0«m(a. t}Pm(a, t) 

« = 18 

+ ^igAa> 0«/(«. t)Pf(a, 01- 

Given e,, E,„{a, i) for males and £/(«, 1 ) for females satisfy; 

£m(«> 0 = E,„[a, a - (1974 - /)! = ftgm{a, t)a„{a, t), 
Ef{a, t) = Ef[a, a — (1974 — <)1 = «A(^/(a, 0- 


(15) 


(16) 

(17) 


The prticedure for computing the longitudinal profiles of con¬ 
sumption, C,(a, i), is identical to that just described for earnings; a 
cross-sectional profile of relative consumption by age and sex is used 
to distribute aggregate U.S. consumption in each year to different 
age-sex cohorts. 


Calculation of Net Nominal Interest Rates Series 

rhe interest rate term r(a, i) on (13) depends only on the year / since 
r(a,i) = r[1974 - (a — t)] = r(t). The computation is performed using 
two different interest rate lime series. Series 1 was calculated by 
dividing U.S. wealth holdings into six separate assets, estimating a rate 
of return series for each asset, and then weighting each asset’s rate of 
return by its share in the U.S. portfolio for the particular year in 
question. 

Seiies 2 is based on the wealth augmentation relation: 

IV,= IV,(1 + r,) + £', +G, - f:,. (18) 

Equation (18) indicates thal private U.S. net worth in year t + 1 
equals U.S. net worth in year / plus savings in year/. Savings in year/ 
is, in turn, equal to net income on assets, r,IV,, plus alter-tax labor 
income and government transfers, E, and C,, less aggregate con¬ 
sumption in year /, C,. Equation (18) is .solved for r, using time-series 
data on 1V„ E,, G,, and C,. 

III. Description of Data Used to Calculate Life-Cycle Wealth 

This section dcscrilx-s sources and procedures for several of our data 
series. Kotlikoff and Summers (1980) provide a detailed description 
of all the data and adjustments to the data used in calculations. 

Information on compensation of employees after 1929 is obtained 
from the Natioruil Income Accounts and, tor the years 1909-29, f rom 
Leven, Moulton, and Warburton (1934) and Kuznets (1941, 1946). 
Prior to 1909, employee compensation is imputed from Kendrick's 
estimates of net national product using the ratios of employee com- 
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pensation to national product from 1909 to 1918.'-’ The imputation of 
labor income to the self-employed follows Christensen (1971). The 
number of proprietors is multiplied by the average earnings of full¬ 
time equivalent employees. The chief bias here is simply that the 
self-employed may earn more or less on average than employees. To 
insure that the results do not reflect a substantial underestimate of 
entrepreneurial earnings, the estimate of entrepreneurial earnings 
for each year is increased by 20 piercent. 

Estimates of state and federal income taxes paid on labor income 
were obtained from IRS Statistics of Income, and t\\e National Income and 
Product Accounts. Combined employer and employee social security 
and health insurance taxes were obtained from the Social Security 
Annual Statistical Supplement for various years. 

Estimates of the age-earnings profiles for men and women were 
obtained by fitting separate regressions to social security estimates of 
median annual earnings of workers at different ages for the years 
19.50 through 1975.'"* The general shapes of the profiles predicted by 
the regressions are quite similar throughout the 1950-75 period. For 
the years prior to 1950, the predicted male and female age-earnings 
profiles for the year 1955 were used,'"’ 

Tbe value of X,, the ratio of average earnings of 40-year-old f emale 
workers to average earnings of 40-year-old male workers in year!, was 
taken to be .55 throughout the period. This appears to be an upper- 
bound value for X, (see Kotlikoffand Summers 1980, p. 46, n. .35). 

Values for work experience rates by age and sex, a„(a, t) and ocf(a, 
t), are available only after 1959. Substantially more information, espe¬ 
cially for the early 1900s, is available on labor force participation rates 
by age and sex. Regression analysis lor the post-1959 period indicates 
that work experience rates can be predicted quite well by functions of 
age and labor force participation rates. This post-1939 regression was 
used to estimate the « f unction for each year from 1900 to 1974 based 
on labor force participation rates for the appropriate year. The labor 
force participati<in rates equal the values predicted from regressions 
of lalxir force participation rates on fifth-order age polynomials for 
each sex and for different census years. 

While cross-sectional distribution functions by .sex and age were 
computed for social security and medicare transfers, we assumed that 


See Ic-veii e( .»!. ItlS-t, p. 1 5ii, Ku/uets 1941. pp, 322-2:1. aiitl Keiidiick's scnc!. .Xti 
and B61 in LI .S. Departnieni <»! ('.ninnierce (1973), pp. 182.222. lie<aiiseol limiLaiinns 
on population estimates on individuals atxive age 85, we treat all mdiv idiials who were 
above age 85 in 1974 as it they were age 85. Siiue 85-year-olds in 1974 did not reach the 
age 18 until 1907, we really do not use any data for the years piior to 1907 in this paper. 

T'hese and other regression results are reported in KotlikoH and Summers (1980). 

The year 1955 was chosen Ix'cause by 19.55 the social security ilata cover most of 
the private economy's work lorce. 
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Other transfers, inciiiding veterans’ benefits and welfare payments, 
were distributed in the cross-section over the years 1900-1974 ac¬ 
cording to that year’s age-earnings distributions. 

Total consumption expenditure, Z/, is taken from National Income 
Accounts after 1929 and from Kendrick prior to 1929. Ideally one 
should subtract expenditures on consumer durables and add imputed 
rent on consumer durables to obtain true economic consumption. 
The difficulties of implementing this for the pre-1929 years led us to 
simply use the consumer expenditure series. In any case, there ap¬ 
pears to be very little difference between the consumer expenditure 
series and the true economic consumption series.'® 

The cross-sectional consumption distribution functions were ob¬ 
tained from the 1960 and Consumer Expenditure Survey (CES) 

tapes. Unfortunately, similar data are not available to generate these 
profiles lor earlier years. Life-cycle wealth is computed using the 
1972-73 cross-sectional profiles throughout the period 1900-1974. 
The 1960 profiles generated essentially the .same level of life-cycle 
wealth. 

In distributing total hou.sehold consumption to household mem¬ 
bers, household heads and their spouses are assumed to consume 
equally; all other household members, including children, were allo¬ 
cated 50 percent of the household head’s consumption. The total 
consumption of children under the age of 18 was then reallocated to 
the household head and spouse, assigning each one half of children’s 
consumption in the case of two spouses, or giving all the children’s 
consumption to the household head if he or she was single. The 
general shape of the profiles was quite insensitive to the assumption 
that other household members and children consume only 50 percent 
of the consumption level of the household head. The 50 percent 
assumption generated slightly more life-cycle wealth and is the one 
used. 

Net nominal interest rate series 1 is generated from data on histori¬ 
cal rales of return and Goldsmith and Lipsey’s (1963) data on 
portfolio shares. From Goldsmith and Lipsey’s balance sheets we 
constructed seven asset categories plus liabilities. These are tangible, 
noncorporate business assets including land and structures, residen¬ 
tial land and structures, money, short-term claims (savings accounts 
and U.S. Treasury bills), corporate stock, long-term corporate bonds, 
and U.S. savings bonds. A rate of return series was calculated for each 
asset as well as the liabilities. A weighted rate of return was computed 

'* Chnstensen and Jorgenson’s <1973) "true” consumption series (p. 17) is very close 
to the NIA series for the years 1929-69. The Chnstensen and Jorgenson series is 
slightly higher for most years; using the Christensen and Jorgenson scries for the 
pfjsl-1929 years would, therclore, lead us to calculate less life-cycle wealth. 
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taking the share of each item in net worth during the period consid¬ 
ered as the weight. The nominal rales of return were converted to 
after-tax nominal rates of return using average income tax rates for 
each year. The series 2 net nominal interest rates were computed for 
the post-1929 period using equation (22). Reliable net worth data are 
not available until 1929, In the computation the series 2 rates from 
the pre-1929 years are filled in with the series 1 values. 


IV. The Size of Life-Cycle Wealth 

At the end of 1974 total net worth in the U.S. economy equaled 
$4,154 trillion. Of the $4,154 trillion, $134 billion represent the 
tangible assets of nonprofit institutions (see FRB-MIT balance sheets); 
in 1974 nonprofit institutions owned 4.4 percent of corporate equities 
(SEC 1977, p. 11). Under the assumption that this percentage applies 
to all other financial assets, total net worth of nonprofit institutions in 
1974 would equal $270 billion.” After subtracting $270 billion from 
$4,154 trillion, the 1974 net worth figure for the noninstitutional 
household sector is $3,884 trillion. This is the number to compare 
with the life-cycle wealth calculations. 

In addition to presenting total life-cycle wealth of individuals, a 
second life-cycle wealth concept, LCW2, is calculated that adds to 
LCWl an upper-bound estimate of accumulated inierspousal trans¬ 
fers. Many economists would include within their definition of life- 
cycle wealth the amount of accumulation that arises from inierspousal 
transfers. Interspou.sal transfers give rise to wealth accumulation, 
first, because wives are on average younger than husbands and, sec¬ 
ond, because females live longer than males. Since it is conceptually 
difficult, if not impossible, to exactly trace individual patterns of 
household formation that involve marriage, divorce, and death, an 
upper-bound estimate of the stock of interspousal accumulated trans¬ 
fers is added to LCW1. To be specific, at age 21 all males are assumed 
to marry 18-year-old females.'" If either spouse dies prior to reaching 


See SF.C 1977, p. 11. rhis assumes ihai nonprofu inslilulions ha\e no liabilities. 

The 1972-73 t;F..S data indicate that the average age gap between husband and 
wile IS 3 years. The number of deaths in an age-sex cohort in a particiilai year was 
obtained by multiplying age-, sex-, and year-sperific death rales from the L^S census 
bureau’s population estimates. HisUmccd Slatultcs (U.S. Department of Commerce 1975, 
vol. 1, death rates, series B 181-92, pp. 61-62) are smoothed. Even in the absence ot 
marriage, our LCW2 concept would be of interest because il cssciiuall> treats indit idu- 
als as it their wealth at each moment were fully annuitired: i.e , given the uncertainly of 
the date of death, individuals in a life<^ycle model with no betjuest motive would always 
purchase annuities. Hence, our procedure of essentially passing a cohort's bequest over 
to surviving members mimics the operauon of an annuities market. Thus 1.CW2 can be 
thought of a.s essentially the life-cycle wealth of the U.S. economy il individuals always 
purchased annuities. 
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when old. riie historical reality for the United States is simply that 
longitudinal earnings and consumption profiles have not exhibited 
the kinds of shapes required for substantial life-cycle accumulation. 
For males, earnings profiles greatly exceed consumption profiles over 
most of the life cycle. For females, however, the opposite is true. 
I.if e-cycle wealth starts out negative and remains negative over all 
ages. Figures I and 2 graph longitudinal profiles of the sum of male 
and female average earnings plus government transfers and average 
consumption for two different age cohorts. The profiles are pre¬ 
sented in real 1967 dollars. The diagrams clearly show that the male 
excess of earnings over consumption is es.sentially offset by the female 
excess of consumption over earnings. 

Contrary to the life-cycle simulation studies that have generated 
sufistaniial life-cycle wealth, the actual growth rate of lifetime con¬ 
sumption does not substantially exceed the actual growth rate in 
lifetime earnings.™ For example, the male age cohort that reached 


R(}g<^r Guidon lias suggested to us that observed age-earnings prohles may difler 
from true agc-eainings profile.s lo the extent that hrins and workers are engaged in 
implu It contractual arrangements. Firms could pay out to workers less (more) than they 
truly earn when young and more (less) than they truly earn when old. Some part of a 
worker's life'cycie saving or dissaving would then be accomplished within the firm and 
would correspond to a claim (oi liability) attached to the firm'.s assets. While we would 
strongly contest the empirical validity of this proposition, certainly a very upper-bound 
estimate ol life-cyr le savings with firms would be the diflerence between the market and 
replacement cosu of capital m the corporate sector. U.sing 0.819, the average value ol q 
over the period 1952-74 (von Furstenberg 1977) and the 1974 replacement value of 
corporate capital of $1,679 trillion gives an upper-bound estimate of $304 billion for 
this effect. 
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Fig. 2.—Sum of male and female longitudinal average earnings and average con¬ 
sumption profiles, age 18 in 1940-age 52 in 1974. 


age 18 in 1920 experienced a growth rate of 2.93 percent in real 
earnings between ages 18 and 65. In comparison, the male growth in 
real consumption between ages 18 and 72 was only 2.32 percent. For 
the corresponding female cohort, earnings growth was 2.23 percent 
and consumption growth only 1.72 percent. Real consumption pro¬ 
files are fairly flat, and real earnings profiles peak in late middle age, 
not at early ages. Both of these factors militate against the life-cycle 
theory of savings as an explanation of U.S. wealth holdings. 

The large transfer wealth value of $3,151 trillion cannot simply be 
explained as private transfers from old to young offsetting forced 
government social security and medicare transfers from young to old. 
Under the Barro (1974) view, introducing unfunded social security 
into an economy will not lead to a change in consumption but will 
increase private net transfers from old to young. If Barro’s view is 
correct, it is easy to determine how large private transfers would have 
been with no social security and medicare system by simply setting all 
historic social security and medicare benefits and taxes to zero in the 
calculation. This procedure led to a series 1 value of $775 billion and a 
series 2 value of $1,094 billion for LCW2. Hence, even if Barro is 
correct in his view about unfunded social security, transfer wealth 
would still have exceeded $2.79 trillion in 1974 if there had never 
been a social security program. 

One pmssible problem with the estimates is that the age- 
consumption profiles prior to 1960 may have looked substantially 
different from those after 1960. To explore this issue, relative con¬ 
sumption prior to age 40 was reduced by 10 percent and relative 
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consumption after age 40 was increased by 10 percent. Thus, 60- 
year-olds were effectively assigned an extra 20 percent consumption 
relative to 30-year-olds in every year from 1900 to 1974. Altering 
consumption profiles in this manner produced values of —193 for 
LCWl and 1,702 for LCW2 using series 1 and —169 for LCWl and 
2,178 for 1.CW2 using series 2. These numbers are still substantially 
smaller than the $3,884 billion of 1974 U.S. net worth. 

Another issue of concern is whether our data series on aggregate 
flows accurately describe U.S. experience in the 1900s. As mentioned, 
to insure against underestimation of the labor income of the self- 
employed, we have already increased our estimates by 20 percent. 
Raising this add-on factor to 30 percent increases the series 1 values of 
LCWl to —679 and LCW2 to 872. It is worth noting that eliminating 
any self-employment add-on factor lowers LCWl to —1,7>39 and 
LCW2 to -714. 

A final data issue concerns consumption expenditures of nonprofit 
institutions as well as contributions of individuals to such institutions. 
Ideally we should subtract institutional cttnsumption from aggregate 
consumption and add contributions to these institutions to arrive at 
total household consumption. While not all the data needed for this 
adjustment are available, what data are available suggest that the 
adjustment would be very small and would slightly lower our estimate 
of life-cycle wealth.^' 

The LCW2 wealth concept effectively deals with possible bias aris¬ 
ing from differential survival probabilities between the rich and the 
poor. This calculation assumes that at least one member of each 
household survives to age 75 and attributes all household life-cycle 
accumulation to the surviving spouse(s). Hence, this procedure as¬ 
sumes that rich and poor households have identical survival prob¬ 
abilities, which eliminates this issue of bias. 

Another type of aggregation error could arise if some households 
continually received higher rates of return on their assets than other 
households. In order to investigate this issue one would need detailed 
knowledge of the joint distribution of rates of return and household 
consumption and earnings patterns. Unfortunately, this information 
cannot be obtained f rom existing micro data, and this fact precludes a 
reasonable assessment of the magnitude of this source of error. 


III 1970, for example, tile level of philanthropic payments of individuals totaled 
$16.09 billion (U.S. Department ol Commerce 1975, vol. 1, series H 399-400, p. 359). 
In comparison, the N/a (U.S. Department of Commerce 1976, p. 90, lines 101 and 102) 
reported 1970 expenditures o( institutions for religious, educational, and welfare 
activities totaled only $11.32 billion. 
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V. Explaining the Residual—the Stock of Transfer Wealth 


This section investigates the extent to which U.S. data on intergener- 
ational transfers can explain the large residual between total net 
worth and the estimates of life-cycle wealth. Unfortunately, there are 
very few data detailing nontaxed intergenerational transfers. These 
nonenumerated transfers take many forms. Parents who lend money 
at below market rates to their children for a down payment on a house 
or a business are engaged in an intergenerational transfer. A father 
who makes his son a full partner in a lucrative business can effectively 
transfer large sums of money with no tax liability to the IRS. Parents 
who f ully or partially support their children through college or after 
college are making transfers. A grandmother’s gift of her expensive 
wedding china and rings to her granddaughter is an intergenera¬ 
tional transfer. Transfers in these forms, as well as outright monetary 
gifts, are very rarely reported. 

To obtain some idea abiiut how large aggregate intergenerational 
transfer flows need to be in order to explain a $3,151 trillion stock of 
transfer wealth, consider the multiperiod analogue to equation (8); 


T = 


t -£ - _[1 - 

(r - n) 


(19) 


Equation (19) is a simplified expression for the steady-state stock of 
transfers: the formula assumes that all transfers are given at age G 
and received at age /. The yearly flow of intergenerational transf ers is 
t, the age of death Z), and r and n are rates of interest and population 
plus productivity growth, respectively. In the case r = n, this expres¬ 
sion reduces to T = t{G - I), the analogue to equation (9). Table 3 
evaluates the age gap factor, that is, the terms multiplying I for 
various parameter values and taking D to be 55P 

Exactly which age gap is appropriate is unclear.^ An age gap of 30 
allows some significant transfers to grandchildren as well as children. 
The age gap factor in this case for a .01 excess of r over n is 45. Hence, 
to explain $3,151 billion in transfer wealth, the yearly flow of trans¬ 
fers would have to equal 3,151 divided by 45, or $70 billion. A 2 
percent differential between (r — n), again assuming a 30-year gap, 
would require only a $46 billion annual transfer flow. 

The 1962 Federal Reserve Survey of Consumer Finances was used 


11 wc take age IB a.s the age of adtilth<K><I. the value of 55 for O (orresponds to a 
leal-world age of death of 73. 

“ To prcfisely calculate a weighted age gap we would need data detailing age o( 
donors and recipients as well a.v the size of the tran,sfcrs. Such data are currently 
unavailable. 
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TABLE 3 
Ace Gap Factor 

T — n 


Ace Gap (G -/) -.02 -.01 0 .01 .02 .03 


20 . 8 13 20 31 49 78 

2b . 11 16 25 38 59 92 

30 . 14 20 30 45 68 103 

35 . 17 24 35 51 76 113 


(o estimaie the bequest portion of the yearly transfer flow. 
Specifically, the distribution of net worth holdings by age, sex, and 
marital status was calculated. Next, 1962 mortality probabilities were 
applied to this distribution to arrive at an estimated distribution of 
bequests by age, sex, and marital status. An additional adjustment to 
these figures was made in order to reconcile the U.S. aggregate net 
worth value estimated from the Federal Reserve tape with the FRB- 
MIT net worth data. 

Unpublished data collected by Paul Menchik and Martin David 
from the Washington, D.C., Inheritance Tax File for the year 1967 
leveal that males who were married at the time of their death left 10 
percent of their estate to their children. Married female decedents left 
19 percent of their estates to their children. The proportion left to 
grandchildren and other young relatives is unclear, but it probably 
does not exceed 2 percent for males and 4 percent for females, which 
represents half of the percentage contribution to other relatives. For 
single male decedents, 32 percent of the estate is left to children; the 
single female proportion is 37 percent. Again taking half of the 
contributions to other relatives as “distant in age” intergenerational 
transfers, another 22 percent can be ascribed to single males and 23 
percent to single females. Using these figures and the Federal Reserve 
simulated bequest distributions, the estimated “distant in age” in¬ 
tergenerational bequest transfer flow in 1962 was $11.9 billion. Mul¬ 
tiplying this figure by the 221.6 percent growth in the nominal value 
of total U.S. net worth between 1962 and 1974 gives us an estimate of 
the 1974 bequest transfer flow of $26.4 billion. 

In 1974 the total value of life insurance death benefits equaled 
$8,885 billion. In conjunction with the Menchik numbers, this raises 
the $26.4 billion bequest flow figure to $28.9 billion. 

We also estimate the flow of intergenerational transfers from par¬ 
ents to children that occur in the form of financial support during 
college. In 1974 college enrollment totaled 8.8 million students (U.S. 
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Department of Commerce 1978, p. 138). In 1976 parental contribu¬ 
tions to college-enrolled children who were taken as tax deductions 
averaged $1,738 (Froomkin 1977, p. 479). Assuming that non-tax 
dependent college-enrolled children received one-quarter of this level 
of support, average support from parents in 1976 was $1,270. Re¬ 
ducing this number by 15 percent, the growth in tuition between 1974 
and 1976 (U.S. Department of Commerce 1978, p. 165) suggests a 
level of college support payments of about $1,080 in 1974. This type 
of intergenerational transfer could then account for $10.3 billion of 
the total 1974 flow. Adding the 10.3 to the 28.9 leaves us with $39.2 
billion of explained transfer flows. 

Another component of the intergenerational transfer flow is trans¬ 
fers made in the form of trusts. While direct data on the value of new 
trusts formed in 1974 are not available, there are fiduciary income tax 
data for the years 1965 and 1970 that permit a rough calculation of 
this transfer flow component (U.S. Internal Revenue Service 1970, p. 
46). Between 1965 and 1970 the number of new trusts established 
each year averaged 35,098. The 1970 income of the 152,398 existing 
trusts in 1970 totaled $7,513 billion. The average 1970 trust had, 
therefore, an income of $9,985. Dividing this value by our 1970 series 
1 interest rate of 0.0787 gives an estimate of the average value of a 
1970 trust of $126,874. Multiplying this figure by 35,098 leads to an 
estimate of $4.44 billion as the value of new trusts established in 1974, 
After multiplying this number by 1.395 to allow for the growth in 
total wealth between 1970 and 1974, our estimate of the 1974 flow of 
intergenerational transfers in the form of trusts is $6.19 billion. This 
figure raises the explained transfer flow to $45.4 billion. 

This $45.4 billion figure seems small compared with the $70 billion 
total flow needed if the stock of transfer wealth is $3,151 trillion and 
the age gap factor is 45. On the other hand, it is not too far from the 
$46 billion total flow needed if r exceeds n by 2 percent and the age 
gap factor is 68. 

Unfortunately, before one can precisely determine tbe total iii- 
tergenerational flow, substantially more information will have to be 
collected about family gift giving and support payments to children 
and grandchildren. These data may prove particularly difficult to 
obtain since we would have to estimate the value of the cedar chest 
passed from grandmother to gfrandnephew, or the family car given to 
son John as a college graduation present, or the value to son Alex of 
making him a full partner in a lucrative family business. Since the 
distribution of wealth is very highly skewed, such surveys need to be 
aimed at the intergenerational transfer payments of the very wealthy. 
However, the very wealthy may be the least willing to disclose these 
types of transfers because of potential estate and gift tax liabilities. 
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VI. Summary and Conclusions 

The evidence presented in this paper rules out life-cycle bump saving 
as the major determinant ol capital accumulation in the U.S. econ¬ 
omy. Longitudinal age-earnings and age-consumption profiles do not 
exhibit the kinds of shapes needed to generate a large amount of 
life-cycle wealth accumulation. The view of U.S. capital formation as 
arising, in the main, from essentially homogeneous individuals or 
married spouses saving when young for their retirement is factually 
incorrect. 

Intergenerational transfers appear to be the major element deter¬ 
mining wealth accumulation in the United States. Our best estimates 
of the 1974 stock of transfer wealth after allowing for interspousal 
life-cycle accumulation is approximately $3 trillion. 

While these estimates of the stock of transfer wealth are quite large, 
totally eliminating transfers in the U.S. economy would not necessar¬ 
ily reduce total U.S. wealth by the full amount of transfer wealth. We 
have demonstrated within the context of a steady-state growth model 
that a $1.00 reduction in the stock of transfer wealth may reduce total 
wealth by less than $1.00 if the steady-state real interest rate exceeds 
the steady-state growth rale. Taking the U.S. historical real interest 
and growth rates as illustrative, eliminating a $3 trillion stock of 
transfer wealth would reduce total U.S. wealth by about $2.1 trillion 
in a steady-state context. This, however, is a partial equilibrium analy¬ 
sis. Substantially more research must be undertaken before we can 
begin to attach probable numbers to full general equilibrium re¬ 
sponses to changes in transfers. 

This paper suggests the importance of and need for substantially 
more re.search and data collection on intergenerational transfers. 
Economic models of savings that stress the homogeneity of agents and 
the importance of the demographic structure should give way to 
models that emphasize the rather massive intergenerational transfers 
in the U.S. economy and the apparent concentration of these trans¬ 
fers among the very wealthy. 
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This paper examines the coniroveisy coiueniing the relative desira¬ 
bility of fixed versus flexible exchange rales by examining whether a 
country can achieve a smaller variance of domestic output around its 
lull employment path operating under an exchange rate rule or 
under a money supply rule. The paper finds that neither [jolicy will 
always dominate the other. However, it does find that if shocks in 
one market of the economy are large relative to .shocks to other 
markets, then one type of rule can be shown to dominate the other. 


I. Introduction 

One of the major concerns of the international trade literature has 
been whether or not a country is better of f under a fixed or under a 
flexible exchange rate system. There is no consensus that one ty{)e of 
exchange rate scheme will always dominate the other. Rather, the 
optimal exchange rate scheme for a particular country has been 
found to depend upon the degree of openness of the country, the 
degree of international capital and labor mobility, and its rate of 
adjustment to disturbances.' Another consideration has been whether 
or not the disturbances experienced by the country are more likely to 
be external or internal in origin, although recently the emphasis has 


I am indebted to Francis Afiking, Ru.sscll Boyer, Rodney Faivcy, .\naioli Kuprianov, 
Leslie Lipschilz, Michael .Salemi, Wilson Schmidt, members of the Workshop on Mon¬ 
etary Theory at VPlUcSU, and two anonymous referees for useful cominencs I take 
responsibility for any remaining errors. 

' For a g<K>d survey ot ihe conditions under winch one exchange rate arraiigemeiu. 
will dominate the other, see Tower and Willeit (1976). 

[Jaurnai of Po^ihco/ Fron 0 m>. 1981, vtti 89. no 
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shifted to ilie market in which the disturbance occurs rather than its 
geographical oiigin* 

This paper examines the controversy concerning the relative de¬ 
sirability of lixed verstis flexible exchange rates from a different point 
of view. The question examined in this paper is whether a country can 
achieve a smaller variance of domestic output around its full employ¬ 
ment path operating under an exchange rate rule or under a money 
supply rule. The exchange rate rules we consider will be completely 
general, so that a country operating under an exchange rate rule can 
be considered to be engaged in managed floating and, as a conse¬ 
quence, will have an endogenous domestic money supply. On the 
other hand, when an economy operates under a money supply rule, 
its exchange rate becomes endogenous, so that an economy operating 
under a money rule can be thought of as having perfectly floating 
exchange rates. The money supply rules we consider will also be 
completely general. 

The model which underlies this analysis is an aggregate siipply- 
aggrcgate demand model (or an open economy with perfect capital 
mobility and rational expectations. Ihe model is a stochastic and 
dynamic version of that used by Parkin (1977). It also has the same 
basic fiamework as Dornbusch (1976) but departs from his actual 
model by assuming continuous market clearing and rational expecta¬ 
tions. The model is presented in .Section 11. The reduced-form so¬ 
lutions for real output under the alternative policy rules are lound in 
.Section 111. In .Section IV the variability ol output under the two 
alternative policy rules is compared and some implications are pre¬ 
sented. A summary is presented in Section V. 

II. The Model 

following Dornbusch (1976) and Parkin (1977) we model a country 
which is small enough in the world capilal market that it cannot affect 
the interest rate it faces, riitis, capilal mobility will insure theecpiality 
of foreign and domestic interest rates after adjustment for expected 
exchange rate appreciation or depreciation. We also assume that 
domestic goods aie imperfect substituies for foreign goods, so that 
the domestic price level can differ from the foreign price level by 
more than the current exchange rate. Further, we assume that the 
country is small enough that its demands for imports will not affect the 
price of imports, and we arbitrarily set the price of foreign goods 
equal to unity.’ 

* I'wt) iftcnt anulc^ which torus on clisiiirbatKes by maikct orij^iii railu'i ihaii 
gcogiaphual origin arc those l)y Boyer (1978) and hipsihit/ (1978). 

^ By assuming that domesUf output and foreign output arc nut [xjrfett jiub.siilutes, 
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The basic model consists ol the following four equations: 

Jt = a,(.P, - Et-,Pt) + a^y,-, + /t, (aggregate supply curve), (jj 

yt = *][*< - - f'l)] + - P,) + u, (IS curve), (2) 

wif = CoPt + c,y, + C2I1 + + Vi (LM curve), ( 3 ) 

I, = i + /’J,_,(f(+i ~ «/) (interest rate parity). (4) 

The variables are defined as follows: y, = the level of real output 
(income); m, = the domestic nominal money supply; P, = the price 
level; i, = one plus the nominal interest rate; e, = the exchange rate 
(the domestic currency price of foreign currency); i = one plus the 
world nominal interest rate; and (j.,, u,, 17 ,, e,, and o>, = white-noise 
stochastic disturbance terms. All variables are expressed in log¬ 
arithms. The operator denotes the rationally formed expecta¬ 
tion based upon information available at lime t - 1. All stwhastic 
error terms are assumed to be white noise and distributed inde¬ 
pendently of eacli other. Later in the analysis the operator cr'i will be 
used to denote the variance of the random variable x. 

The aggregate supply curve (I) is similar in form to that proposed 
by Lucas (1973) and has been used in a closed economy macro model 
by Sargent and Wallace (1975) and McCallum (1978) and in an open 
economy model by Flood (1979). It incorporates the natural rate 
hypothesis since real output is affected only by unanticipated changes 
in the price level. Following (iray (1976) and Fischer (1977/>), we 
justify (1) as follows. Assume that nominal wage contracts for periixl I 
are negotiated at the end of period / — 1 and equate expected labor 
supply and expected labor demand conditional on the information 
available when the contracts are negotiated. Actual employment in 
period I, however, tlepcnds upon the realized wage in the period, and 
for this rea.son innovations in the price level cause fluctuations in 
employment and output.'* Further, since employment varies inversely 


our from ihc iiiodcis ot Bilsoii (1978) ami Flood (1979) \\> will also wsc* 

monry nili’s in oui jiiidysis uhiih aii* more )(eneral iImii the r.indoni-wdlk mone\ 
growth rules wIikIi they use. 

* The .iggrcgaie bupply tui>e (I) is also obtained by (Kirdon (1978, thap 7) using 
assuiTiplions very Minilai lo those olXir«iy (1976) and Kim her (1977A) VVe hd\enoi used 
1 aj( ns's denvaiioii ol (t) a«i he assumes informalionally separated markets. As KUxkI has 
|X)inted out, ilie as.siiiiiption of iidoi matioiiaily separated markets on the supply side is 
not con.sistent with the (ontemporaneous trading across countries iinplini m (2) 
through (4). Such (oniempoianeous iiading als4> implies that the bc'haMoral param* 
eiers m the model tan be assumed to remain um hanged as the monetary authontv' 
switches from an exch.tnge rate rule to a money rule or vice versa. As Lucas (1973). 
Barro (1976), and I-eiderman (1979) have shown, such an assumption mav not he 
eorrett if agents ojxrate m mldrmaiionally se|x»raied markets and must make infer* 
ences about the price level or the exthange rate liased upon iransacnons prices olv 
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with the real wage, we restrict a, > 0. The omission of a constant term 
in ( 1 ) indicates thatji< should be interpreted as the deviation of output 
about the level which would obtain if there had been no past innova¬ 
tions in the price level. 

The IS curve (2) is that of conventional theory with two modifica¬ 
tions. The first is that the interest rate variable is the real rather than 
the nominal interest rate. This modification is required by the fact 
that when the price level is allowed to change in the analysis, invest¬ 
ment demand should depend upon real rather than nominal interest 
rates.'' The second is the presence of the relative price of foreign to 
domestic goods. (fhe logarithm of the foreign price level is omitted 
since the foreign price level was assumed to be unity.) Since higher 
real interest rates should depress the demand for domestic output 
and since an increase in the price of foreign goods relative to domestic 
goods should increa.se the demand for domestic goods, we expect 
0 and hi 5= 0 . 

I'hat the LM curve (3) is the conventional LM curve can be seen by 
rewriting (3) as 

m, - CqP, - fjc, = Ciy, + Cji, + 17 ,. (3') 

1 hus, (3') states that the demand for real balances depends ufx>n real 
income and nominal interest rates and that for the money market to 
be in equilibrium the demand for real balances must be equal to the 
supply of teal balances. The reason for including the exchange rate in 
the determmatioti of real balances is that individuals in the economy 
may deflate their holdings of nominal money by a combination of 
both domestic and foreign prices since they can purchase both 
domestic and foreign goods.® We expect that Cq, C 3 ^ 0 and Co + Cs — 1 • 
Further, the conventional demand-lor-rnoney view is thatc, ^ 0 and 

Ci « 0 . 

Fejuation (4) results from the assumption of the smallness of the 
country m world capital markets and from the assumption that 
(breigti and domestic assets are perfect substitutes for investors. As a 
consequence, capital mobility will force equality of expected nominal 
returns on domestic and foreign bonds expre.ssed in units of the .same 
currency. 

in equations (2) and (4) of the model, expectations concerning the 

sei ved in their mat kel. It would be interesting to examine how the conclusions of this 
paper would change it this informational assumption were modified. 

* In general, lor a random variable*, log (h*) > £(log*). Therefore, t, — — 

/*,) IS only approximately equal to the iogarilhm of one plus the real interest rate. 
Further, (4) is only an approximation to the interest rate parity condition for the same 
reason. However, log (Ex ) is equal to the expected value of the first-order Taylor series 
expansion of log * around Ex. 1 am indebted to a retcrcc for this point. 

• I'his same assumption is made by Boyer (1978, p. 1047). 
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rates of change of the price level and the exchange rate are assumed 
to be conditioned on information available in period / — 1. On the 
other hand, period t information on the price level, the nominal 
interest rate, and the exchange rate are assumed to be used by agents 
in their output and money demand decisions. This raises the possibil¬ 
ity that the model may be asymmetric with respect to its information 
assumptions, although such asymmetry could be reasonable if bond 
purchases were assumed to be made at the beginning ot period t 
before the output and money demand decisions. Nonetheless, we 
recognize that there is a degree of arbitrariness with regard to the 
assumptions concerning the conditioning of expectations. For this 
reason we treat the case where agents use period t information to 
form expectations in the Appiendix. Both assumptions about the 
conditioning of expectations have appeared previously in the litera¬ 
ture. The papers by Sargent and Wallace (1975), McCallum (1978), 
and Flood (1979) use expectations conditioned on information avail¬ 
able at time / — 1, whereas expectations conditioned on information 
available at time t are used by Bilson (1978) and by Floixl in his 
appendix. 


III. Implications 

We now solve the model given by (1) through (4) hrst under a money 
policy rule and then under an exchange rate rule. 


Money Rule 

In an economy in which the monetary authority is following a rule lor 
the determination of the money supply, the exchange rate is an 
endogenous variable: that is, the country operates under Ilexible 
exchange rates. When e, is endogenous, solving equations (1) through 
(4) for the price level yields 

Pi = A{(c3 + b-iC |)(a 

■I" rsA,!*' F,^_,(<■,+ , — C() — ht-i{P i+i — P f)] (5) 

- + FVilCj+i - f»)l + btm, + e,} 

where Ot — [-(Cs + c,b 2 )fii + c^v, — hjy],] and ft = (/>* + rtif.i + r 
Applying the operator to both sides of (5) and subtracting the 
result from (5) yields 

Pt - K,-,P, = k(0, + b^^i), (6) 

where = m, — is the innovation in the money supply in period 



73 H JOURNAL OF POLITICAL ECONOMY 

t. Substituting ( 6 ) into (1), we obtain the reduced-form solution for 
real (output as 

Vr ^ < 7 + Ih^t) + »tyt-t + 1^1 

(7) 

= klhifjL, + - Tj,)] ajj;,-,. 

Given (7), the solution for the real exchange rate (ci — P,) can be 
obtained from (2) and (3) using the Sargent-Wallacc forward solution 
technique. We hnd^ 

e, - Pt = -bjlb^ + + Ai («2 - UJbi-i 

+ A[/i, - (1 + f,n,)p, + a,(e, - t),)!- 

Substituting (7) and (8) into (2) yields the solution for the real interest 
rate as 

It - t:,^,{Pl+t - Pi) = i + {«2(«2 - ))/V>-2 + fr|(«2 - 1)]>)’(-I. (9) 

Further, from (7) we obtain titc variance of real output under a 
money rule to be 

= k'^il - nl)~‘[b'^l + -I- (a,bif(a-'i, + (10) 

•Since E,-ininj, / ^ 0, terms do not appear in the solutions (7) 
through (9), our results indicate that changes in the money rule do 
not have iinpads on the real variables in the economy. And since this 
is true for the levels of these variables, it must hold for their vaiianccs 
as illustrated by (10). On the other hand, etpiations (7) through (9) 
show that with the exception of the real interest rate, the real variables 
in the economy do respond to unanticipated slnx ks. These arc, of 


^ I he •solution lo obtain <HJ rc<{uires ns (o hrst obtain the tfirward soliiii(»ii to- 

h, . ... 1 


f’,) = ~ 


i+j) ^ *J 


Uy repealed sul)*>nuui<m. llie alK)Vt* e(|uatioii tan l>e wuuen .is: 


■f’-t-iyt- 


^* 1 } 


J_Vf- * 


Ll" 

I Ai - A, / 

thus, (H) it-qiiiu's ihc ,issunipiu>ii that: 




/>, - h. 

^' 1 - 1(^1 + 


I hf im[K)Sitif)n ot this icriniiwl Londition rules oui the [x>.ssibiliiy of exfx't led changes 
in future relative prices, which would lead individuals lo demand inhiiilc quantities uf 
either toreigri 01 domestic gcxids in the present period. This terminal conclilion is 
similar lo that cif Sargent and Wallace (197.5. p, 248). F.cj. (8) also requires that A.', iy,,j 
not be too explosive, i.c., that «, not lie too large. 
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course, the familiar results (or aggregate models which incorporate 
rational expectations into a natural rate aggregate supply equation 
such as (1).* 

The solutions (7) through (9) also have interesting implications for 
whether or not the domestic economy is independent of the rest of 
the world under a money rule. From (7) we find that the level of real 
output is independent of the world interest rate. So, from this per¬ 
spective, our results agree with Parkin’s (1977, p. 238) conclusion that 
the domestic economy is independent of the rest of the world under a 
money rule. However, the time paths of the real exchange rate and 
the real interest rate depend on the world interest rate but not on the 
money rule being followed. Tbus, the domestic economy is not inde¬ 
pendent of the rest of the world in the sense that changes in the world 
interest rate will have real impacts on the domestic economy which 
cannot be offset through the mtmey rule.’* 

d’he reason that a change in the wttrld interest rate will have real 
impacts can best be seen by demonstrating that the opposite cannot 
(xcur. If changes in the world interest rate were to have no impacts on 
domestic real variables, an increase ini would have to be offset by an 
equal decrease in 'ti order to leave the real interest rale and 

the real exchange rale unchanged. However, the decrease in next 
period’s exchange rate implies a decrease in next pericxl’s relative 
price of foreign and dotnestic gocxls and next period’s demand for 
output. C',onse<iuently, current cxpetialions of next periods price 
level will decrease. I'lns reduces the demand for cuircnt period 
output since the real interest rale will be increased. However, since 
current output cannot change due to the form of (1), equilibrium 
must be restored through increases in the real exchange rate to offset 
the increase in the real interest rate.'" 

In order to illustrate the w'ay in which unanticipated disturbances 


" Kxphcii soliUKiris lor/', and r, <<mld Ih- ol)i.<iiK-d 1>) soImiijc ihi- scMciii of e(|iicilK>ns 


p, 

1 

^.1 ^3 


f>,lt + K,-i(roi - f,) ~ f'o - f’,>l + W - y, 



^3 


-Cell + - c,)l + ,m, + e, - t), - r,y, 


Mmiillanenii.sly nsin({ the .Sjrgein and Wallace torwaid-soluiion lochnkjue. Fi <>m (11) il 
IS obvious thal such solutions will include ihe ^ 0, .cs independeni vanahics. 

1 liiis, ilii- nominal variabic.s in lliis economy will res|X)nd lo changes in the money 
supply rule tx-iiig tollowed. 

'' Fiom (11) II IS olivioiis ihal Ixnhi and policy paramelers will apiiear m ihe .solutions 
lor the doniesiic price level, the exc'h,ingc rate, and ihe nomiiial iniere.M rale. Thu!,. ilie 
cx|>c(led levels ol these vaitables c.aii be dclernnned by ibe doineslic nioiieiary au¬ 
thorities regardless ol the level of the world interest rate. 

Since we have assumed llial the world price level is delcniiinate and h\ed. changes 
m I can only be caused by changes in the world real nueresi rate. I'hiis. il is nol 
surprising to find the domestic real interest rale alfecled by sue b changes under flexible 
exchange rates. 
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affect the real and nominal variables under flexible exchange rates, 
we examine the effects of an unanticipated increase in the domestic 
nominal money supply. Differentiating (7) through (10) with respect 
10 €/, we hnd that an unanticipated increase in the nominal money 
supply will increase real output, the exchange rate, and the price level 
while decreasing both the nominal and real interest rate. Further, if a, 
> 0, the increase in the price level will be less than the increase in the 
exchange rate so that the real exchange rate will increase. 

These changes iKcur for the following reasons. The unexpected 
increase in the nominal money supply leaves expectations of current 
and future prices and exchange rates unchanged. However, it does 
cause money supply to be greater than money demand with a corre¬ 
sponding equal-%alued excess demand for both domestic and foreign 
bonds which are perfect substitutes. The excess demand for bonds 
will |mt upward pressure on the exchange rate as agents attempt to 
sell domestic money for foreign money to buy foreign bonds. The rise 
in the exchange rate increa.ses the demand for output through the IS 
tuive. Thus, the effect of the unanticipated increa,se in the nominal 
money supply is to shif t out the aggregate demand curve. There will 
then he an excess demand for output which will cause upward pres¬ 
sure on the domestic price level, and since the price level increase is 
unanticipated, it will cause the supply of output to increase unless = 
1). Obviously, the rise in the price level must be less than the increase 
in the exchange rate otherwise there would be no shift in the aggre¬ 
gate demand curve. 

Because of the recent discussion concerning whether the exchange 
rate will oveishoot or undershoot its final level in response to a 
monetary disturbance," we conclude the discussion of the money rule 
case by determining the time path of the exchange rate in response to 
an unanticipated change in the money supply. To facilitate this dis¬ 
cussion, we assume a specific money rule of the simple autoregressive 
form 

m, = pm,-, + e,. (12) 

Under such a rule, the time path of the exchange rate, ignoring terms 
involving the world interest rate and errors other than e,, is 

oc 

r, = k(l>2 +«,)€, + +a,b2ky2ai)e,-i-i, (13) 

J-O 

where 


" Die overshcKitiiig pheiumicnon is obtained by Dornbusth (1976, 1978), although 
in the latlei paper he hnd.s that undershooting rather than overshcKrting occurs when 
price levels are allowed to adjust. The overshooting elfect has also been found empiri¬ 
cally by Frenkel (1978). 
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>1 = 


and 


72 = ^2 


1 - C2(l -p) 

gp - c,lbt{a3 - 1) + ^ 2 ] 


lAi(a2 - 1) + 62KI + C2(a2 - 1)] 


Differentiating (13) with respect toe^, we obtain the initial effect of 
the monetary shock to be 


de, 

df, 


= k(b 2 + a,) > 0, 


(14) 


so that the immediate effect of the unanticipated increase in the 
money supply is to increase the exchange rate. 'I'he reasons for this 
effect are given in the above discussion of the impact effects of 
unanticipated changes in the money supply. 

The effects of an unanticipated change in the money supply in 
period / on the exchange rate in subsequent time periods are obtained 
by differentiating (13) for period I + with resjjecl to e, to obtain 

^ (15) 

06 , 

From (15) we first find that if both | p | and | \ are less than unity, 

unanticipated changes in the money supply will decay exponentially. 
We also find that unanticipated changes in the money supply in 
period t will not have impacts past the initial period unless there are 
feedback effects in either the money supply rule or the aggregate 
supply function, that is, unless either p or 02 is nonzero. 

Htjwever, when feedback effects are present in the money supply 
rule but not in the aggregate supply function (cz = 0 and p > 0), we 
find even though will be above the long-run level ol the exchange 
rate, we cannot determine whether it will be above or below c,. I hus, 
in this case we find that even though the exchange rate will always 
initially overshoot its long-run level in response to an unanticipated 
money supply change, the overshooting actually may be larger in the 
period after the shtKk than in the period of the shock. 

On the other hand, when we relax the assumption that «2 = 0 and 
allow 0 ^ 02 ^ 1. we find that it is possible for the initial osershooting 
of the exchange rate in period I to be followed by undershooting in 
subsequent time periods. This result occurs because it is not p<issiblc 
to sign 72 - Thus, our analysis indicates that even though an vinantici- 
pated money supply increase must initially increase the exchange 
rate, if lagged output terms appear in the aggregate supply function, 
the exchange rate may either undershoot or overshoot its long-run 
level in subsequent time periods. 
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Exchange Rale Rule 

Wc now consider the case in which the monetary authority in the 
economy follows an exchange rate rule rather than a money supply 
rule. A monetary authority following an exchange rate rule sets a 
target value for the exchange rate (e,) according to some known rule. 
This rule could be of the linear feedback type. Alternatively, the rule 
could be thate, = e for all t, yielding the case of fixed exchange rates, 
or that the exchange rale appreciate or depreciate at a given rate over 
time. The important point for the analysis which follows, however, is 
not the paiticulai form of the exchange rate rule. Rather it is that the 
rule used to determine the exchange rate be known and constant over 
time. In this case, the rational expectation of the exchange rate in 
period / + 1 will be E,^i(e,+j) = o+j. 

Of course, the monetary authority cannot establish c, by fiat. Under 
an exchange rate rule the rnoneuiry authority must accomplish its 
objectives by buying or selling foreign or domestic reserves until Pt is 
achieved. However, since either action will aflect the domestic money 
supply, under an exchange rale rule the domestic money supply 
becomes an endogenous variable. Further, given the structure of the 
economy, a given exchange rate rule will imply the time path of the 
domestic money supply and the monetary authority’s holdings of 
international rc.serves.'^ 

When m, is endogenous, .solving (I) through (4) for the price level 
yields 

Ri = — t"/) — Ei-^i{R,+j — /’,)J 

/ I 

+ Ihe, + p, + a,E,-.tP, - + a,). 

Apjjlying the F,., operator to both sides of (16) and subtracting the 
lesull from (16) yields 

r, - Ei^fP, = (p, - fi, + h.iOit)l{h + «i), (17) 

where ta, = e, — Ei -^e, is the innovation in the exchange rate in peritxl 
Substituting (17) into (1) yields the reduced-form .solution for real 


OrtdiTi exchange rules might rc(|uire the iwmeiary authority to sell a (^uaniiiy 
ol inicrnattonal rcscTves exceeding ihcir initial holdings. Such exchange rale rules 
would obviously not be feasible and are iiTi|»li<Tily ruled out of the analysis. However, 
sintc the variance of r<‘ai output under an exchange rale rule is independent of the 
actual rule chosen, the inleasfhility of some exthange rate rules docs not affcci our 
results. I am indebted to a referee lor this point. 

Since the monetary dUlhorny i.s intervening in the foreign exchange market, it i.s 
difficult to see how such innovations in lire exchange rate could occur. Nonetheless, we 
have inirf>duccd oj, into the analysis of the exchange rale rule case lo make it symmetric 
to the money rule case in which the actual money supply could deviate from the desired 
money supply of the monetary autlioriiy by However, since a>t is a white-noisc 
random error which is assumed to be uncorrelaied with the other di.sturhances. the 
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output as 

y, = (^> 2 /i.( + a,v, + a,bi 0 i,)/(h 2 + a,). (18) 

Using the Sargent-Wallace forward solution technique, the solution 
for the price level can be obtained from (16) as'"* 

p - bj _02_ , 1 V 

' b, b,+b,(a., - 1 ) ( 62 - 6 ,) 

[^i-Ei-iCj+j+i + (^>2 ~ (19) 

- (p, - V, + biO},)l(bi + a,). 

The real exchange rate can be obtained by subtracting (19) from c, = 
A',_iC( + oj,, and the real interest rate can be obtained by substituting 
for yt and c, — P, in (2). 

The reduced-form solution for output (18) once again has the 
familiar neutrality result for models which incorporate rational ex¬ 
pectations and the natural rale in the aggregate supply equation. 
However, since (19) depends on terms, under an exchange 

rate rule both the real exchange rale and the real interest latc depend 


inclusion ot the in, does not modify the fatt that r, is exogenously determined in the 
ext hange rate rule rase. 

'■* The solution to oblain (19) requires us to first obtain the forward solution to: 




P, - - 





f I + 



By repealed suliMitulitin, the above equation can be written as. 


f-’,- 





+ t:, 


iCf+j+i) ■t' 




t hus, (19) lecpiires that. 




/' 


r+Mf-j 


which rules uuj "speculative bubbles*’ in domesiic goods F.q. (19) also lequires ihai the 
proc ess 


( - 6 . )^' 


1 er,jf I + /t* I 


I 


th-b. 


' Ft 


iwn (k? Itx) explosive. These conthtioiis are veiy similar to those rcquiietl h\ Saigeiii and 
Wallace (I97,S). They itiller from those netes.sary loolilain (H) because in ihctaseol (8) 
we arc solving tor the real exchange rale and in the ca,se of (19) we are solving lor the 
domestic price level. It is also interesting to note that, whenas Sargent and Wallace 
have shown that in a closed economy the price level is iiidetermiiialc it the moneiaiy 
authority follows an interest rate lule, we can obuin a deleiminaie result il the 
monetary authority follows an exchange rate rule in an open economy. 
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upon the exchange rate rule being folJowed. The results that nominal 
variables (and the nominal money supply is included in this case) are 
affected by the policy parameters also continues to hold. 

Further, from (18) we find that under an exchange rate rule the 
variance of real output is 

tJik = + afo-J + a\blcrl,)l[{bi + ai)^(l - oi)], (20) 

which once again is independent of the particular exchange rate rule 
being followed. However, we do find that the variance of output 
under an exchange rate rule differs from the variance of output 
under a money rule. One major difference is that under an exchange 
rate rule the variance of output is independent of any disturbances 
which occur in the money demand equation. The reason is the fol¬ 
lowing. Under an exchange rate rule, the nominal interest rate is 
entirely detertnined by this rule and the world interest rate. With the 
interest rate thus fixed, real output and the price level are determined 
solely by the aggregate supply and IS curves. The only role of the 
money demand equation is to determine the level of the endogenous 
nominal money supply. 

T he .solutions (18) and (19) also have interesting implications for 
whether or not the domestic economy is independent of the world 
interest rate under an e.xchange rate rule. Since the systematic part of 
the exchange rate rule (thefiz-ir^+j terms) appears in the solutions for 
both the real exchange rate and the real interest rate, the monetary 
authority can offset any changes in the world interest rate by chang¬ 
ing its exchange rate rule. Specifically, if the monetary authority 
reduces the expected exchange rate in the next period by the amount 
of any increase in (he world interest rate while holding the expected 
current exchange rate constant, the domestic economy will be inde¬ 
pendent of changes in the world interest rate. Such an exchange rate 
rule would keep both the domestic nominal interest rate and the 
current exchange rate constant. Consequently, the change in i would 
leave aggregate demand in the domestic economy unchanged with 
the result that no real variables would change. 

Thus, we have found that whereas under a money rule the domes¬ 
tic economy will not be independent of the rest of the world, under a 
managed exchange rate system the domestic economy can be inde¬ 
pendent of the rest of the world in the sense that changes in the world 
interest rate will have no real effects. The difference in the two rules 
can be explained as follows. If changes in the world interest rate were 
to be neutralized under a money rule, the money supply would have 
to be systematically changed in response to changes in i. However, 
since (2) through (4) are homogeneous of degree zero in the expected 
current money supply and the expiecled current and future price level 
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and exchange rate, any change in the expected money supply will be 
accompanied by proportional changes in the price level and exchange 
rate which leave the expected rate of change of the price level and the 
exchange rate unchanged. Consequently, under a money rule an 
increase in » will have real effects. Under an exchange rate rule, 
however, , — e,) can be modified without necessarily altering 

the current money supply, and an exchange rate rule can be devised 
to make the domestic economy independent of the world interest rate 
as discussed above.'® 


IV. Implications 

Comparison of the variance of output under a money rule and under 
an exchange rate rule indicates that it is not ptossibie to determine that 
one rule always dominates the other. Like the results in the traditional 
analysis of the merits of fixed versus flexible exchange rates, we see 
that which rule is preferred depends upon location and magnitude of 
the variance of the stochastic disturbances in the model Nonetheless, 
we can determine some special cases in which one type of policy rule 
will yield a lower variance of output than the other. 

We first consider the case in which the variance of the disturbance 
in the money demand function is large relative to the variances of the 
other disturbances. In this case, the variance of output will always be 
smaller if the monetary authority follows an exchange rate rule rather 
than a money rule, since under an exchange rate rule, real output is 
determined independently of the money market. Because fixed ex¬ 
change rates are a special case of our exchange rate rule, this result is 
similar to Boyer’s (1978) finding that a fixed exchange rate is optimal 
when shocks occur only in the money market and that is the market in 
which the authorities are intervening. Further, it corresponds to the 
results of Turnovsky (1976), Fischer (1977a), and Flood (1979) who 
find that money demand shocks do not affect output (consumption in 
the case of Fischer) under fixed exchange rates. 

Next, we consider the case in which the variance of the disturbance 
to the aggregate supply curve is large relative to the variance of the 
other disturbances. String cri = cr* = aj = at = 0 in (11) and (20), 
we obtain 

On the other hand, if the motielary authority follows a hxed exchange rate rule, 
the domestic economy will be affected by changes in the world interest rale since 
changes in the world interest rate will not beolfset by exchange rate changes. Thus, 
changes inf will have real eflccts since the change in the domestic nominal interest rate 
will shift the aggregate demand curve. Further, (19) shows that with hxed exchange 
rates, the price level will be determined solely by the level of the world interest rate and 
random shocks. Thus, a fixed exchange rate policy will yield no inllation on average 
unless there is an upward drift in the level of the world nominal interest rate. 
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= (ai + hifUbi + a^(J)^c^ + (21) 

Thus, 

(rlit {§} o-'iiw as <’1^2 + '■a {1} 1- (22) 

This rcsull is explained by the fact that the variance of the output 
response to an agjrregale supply shock will be larger, the sleeper is the 
slope of the aggregate demand curve. Under an exchange rate rule, 
the slope ol the aggregate demand curve is 



Under a money rule, the slope of the aggregate demand curve is 

+r,). (24) 

rhiis, when + r., < 1, the slope of the aggregate demand curve 
will he sleeper and the variance of output larger under a money rule. 
However, when > 1, the aggregate demand curve will have a 

steeper slope and the variance of output will be larger under an 
exchange rate rule. 

These results do not agree with those of Fischer (1977rt) and 
l.ipschii/ (1978) who find that fixed exchange rate rules are always 
preferred when the major sources of disturbance.s arc shocks to out¬ 
put (supply) which are independent of demand. The dif ferences arise 
because Fischer isconcei iied with the variance of consumption, which 
IS defined to be the difference between nominal income and the 
balance of payments. He obtains his result since, under fixed rates, 
balance-of-payments cjutHows tend to off.sel part of the increase in 
consumption caused by the increase in income. In Lipschitz's model 
changes in the exchange rate cause wealth ef fects which tend to move 
in the same direction as the effects of the change in income. Thus, 
since fixed rates eliminate the wealth effects, they arc variance 
miniini/ing. 

The third case to consider is that in which the variance of the dis¬ 
turbance to the IS curve (caused, e.g., by large disturbances to the 
investment demand function) is large relative to the variance of the 
other disturbances. .Setting erj = cr* = CTf = a-'t, = 0, we obtain 

= r|(a, + + ai(^ 2 t'i + r,)p. (25) 

Under the presumption that c^, the elasticity of money demand with 
respect to the exchange rate, is small, (2.5) indicates that a money rule 
will always yield a smaller variance of Output against output demand 
disturbances. This result occurs because under an exchange rate rule 
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the nominal interest rate is fixed. Thus, any disturbance to the IS 
curve must be cleared by price-level and output changes. However, 
under a monetary rule part of the market equilibration can <x:cur 
through changes in the interest rate. I'his le.ssens the amount of 
price-level change necessary to clear the market, (’.onsequeritly, the 
amount of output response which must occur as a result of the shock 
to the IS curve will be smaller under a money rule. This is similar to 
Lipschitz’s (1978) hnding that if the major sources of disturbances are 
shocks to demand which are independent of real output, then an 
economy should use floating rates. Our result also agrees with 'fur- 
novsky’s (1976) finding that, with a high degree of capital mobility, 
Ilexihle exchange rales yield a higher degree of output stability than 
fixed exchange rates. 

Finally, we consider the case in which the variances associated with 
the choice of the exchatige rate regime itself are large relative to the 
variances of the other disturbances in the model. Setting o’J = oj = 
(T^ = 0 we obtain 

=(<T?/cri){(rt, + + a|(Z> 2 C, •+ <:.■,)?}. (26) 

riius, (26) indicates that a tnoney rule will be preferred to an ex¬ 
change rate t ule if unanticipated changes in the money supply cause 
smaller shifts in the aggregate demand curve than do unanlit ipated 
changes in the exchange rate and vice versa. I'his result can be 
obtained by solving the demand side of the model along the lines used 
to obtain the slopes ot the aggregate demand curves tor the twt) 
regimes. 


V. Summary 

We can summarize our findings in this paper as follows; 

1. We find that we cannot determine that one lyjx; of monetary (or 
exchange rale) policy will always dominate the other. However, il we 
are able to determine that the variance of one type of shcK'k to the 
economy is large relative to the other shocks, then the above analysis 
indicates that one type of rule can be sliown to dominate the other. 

2 . On the other hand, there may be some merit in the discussion of 
the tnerits of f reely fluctuating vet sits managed exchange rates based 
u[)on the degree of openness of the economy if the sfunks to the 
domestic aggregate supply curve have the largest variance. The dis¬ 
cussion in 'Fower and Willett (1976, pp. 44-45) indicates that the 
elasticity of the demand for output with resjteti to the relative price of 
domestic and foreign goods, is larger the more open the economy. 
I hus, the more open the economy, the more likely it is that a money 
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rule (freely fluctuating exchange rates) will yield a smaller variance of 
output in response to shocks to the aggregate supply curve.'® 

3. Since policy parameters do not appear in the variance of output 
under an exchange rate rule, fixed rates are as good as any other form 
of managed rate in terms of stabilizing output. Thus, arguments for 
managed rates as opposed to fixed rates must be based upon differ¬ 
ences in price level variability or some other criteria. However, since 
the variance of output will be dilferent under flexible than under 
managed exchange rates, output stability can be used as the criterion 
for deciding between the two types of exchange regimes. 

4. Real variables in the economy can be insulated from changes in 
the world nominal interest rate under a managed exchange rate 
.scheme, whereas the real exchange rate and the real interest rate will 
be alfected by changes in the world interest rate under flexible ex¬ 
change rates. 


Appendix 

We now solve I he model for the symmetric information case in which infor- 
rtiaiion in period t is a.ssumcd to be available to agents in making their 
commodity and money demand decisions. Thus, we use the model 


yt = - Ef-.P,) + ix, (aggregate supply) (Al) 

y, = - (E/Poi - P,)\ + biU, - P,) + v, (IS) (A2) 

m, = CoP, + f,y, + c^, + c,c, + y), (LM) (A3) 

t, = i + £(€,-, - e, (interest rate parity). (A4) 

In (Al), we assume a-i = 0 for simplicity. 


We fu st consider the money rule case. When p, is endogenous and letting 6? 
= h -2 ~ h, and fj = fa — fa. we solve (Al) through (A4) for P, and obtain 

Pf = A*{(«|f3 + hfCf)E,^iP, + cfbili + /“.((fi+i ~ Eo.,)] (A5) 

+ + E,f,+,) 4 e?}, 

where t/,* = [—(f-f + + rfr, — bfi),] and k,* = li.J‘(l — f^) 4- Uicf 4- 

r,a,6?) '. 

Since (A 1) must hold for all periixls, it mu.st hold for period I + ;, where / > 
0. Ihus, 

Vf+j “ u,(Ef4j — lE^+j) 4“ fXt+j. (Al ) 

Applying the operator E/ to both sides of (AT) yields 

E,yt^, = 0, ; > 0. (A6) 

'• Since Flixwl (1979) assumes that purchasing power parity holds, he is concerned 
with the case in which —» <». He finds dial var (P, — E,^,P,) is not af'f'ecteH by changes 
in <rj under fixed exchange rates, whereas it is affected by changes in cr* under flexible 
exchange rales. Taking the variances of (6) and (17) and letting fe, —* » shows that 
Fltxid’s result also holds lor our model. 
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Equation (A2) must also hold lor all periods. Applying the Et operator to 
both sides of (A2) for period t + ] and using (A4) and (A6) yields 

0 = frih” + £({ei+;+i — P nj+i)] + b$Et(enj — Pt+i)' (A2') 

This first-order difference equation in E,(e,+) — /*,+/) can be solved forward, 
yielding 

Et(e,+j - P,^,) = -bfUbi, J > 0. (A7) 

Equation (A3) must also hold for all periods. Applying the E, operator to 
both sides of (A3) for period t + j and using (A4) and (A6) yields 

Eim,+j = —CflA,(e(+j — Pi+i) + Cji + (A3') 

■*■ (1 “ <'2)Eiei+j+i. 

Equation (A3') can fx; solved forward, and using (A7) we obtain 

^^1*10= “ (c2 + CoAi/Ai)i +(l ~ Cj) ‘ ^ (A8) 

After substituting lbrfi,(^#+i ~ f^+i) u.sing (A7) and E,{en,) using (A8), we 
apply the operator to both sides of (A5) and subtract the result from (A5) 
to obtain 

P, - E,.,P, = k*(et + hfi,). (A9) 

.Substituting (A9) into (Al) yields the solution for real output as 

y, = *♦[*?( 1 - rj)/x, + ffv, + bf{e, - ■>},)]. (A 10) 

And from (A 10) we find that the variance of output under a money rule will 
be 

<^vu/ = (**)^f(*f)’(l - t'tfcrl + (cffcrl + (*?)*(of + o-?)]. (All) 

Further, if we were to solve for the real exchange rate and the real interest, 
we would oiue again find that they were independent of the sysicmati< pai t of 
monetary policy but not indejrendenl of the world inteiest rate. I'hus, these 
results do not differ much from those in the text. 

We next consider the case of the exchange rate rule. When m, is endogen¬ 
ous, we solve (Al) through (A4) lor P, and obtain 

Pt = - P, + *i(»' + E,(rm - £,*,)] (.\I2) 

-t- X, -1- h{e,}l((it + bf). 

Following the same procedures used to obtain (A9). we find that under an 
exthange rate rule 

P, - E,^,P, = (-Mr + »'f + bfw, )/(/>} + «i). (.M3) 

.Substituting (A 13) into (Al) yiekls 

yi = (bffi, a,p, + a,bfm,)l{hf + n,). (.AM) 

Therefore, the variance of output under an exchange rate rule is 

o'JiA- = [(bifai -H rtfo-f, + (a,bffa-l]/{bf + n,)*. (A 1.5) - 

If we were to solve for the real exchange rate and the real interest rate, we 
would find they have the same properties with respect to systematic changes 
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111 the t'xdiiiiige rale iiile and changes in ihe world interest rate as do the 
solutions in the lext. 

Furllier. when expectations are based upon iniormation in period t, the 
slopes ol'ihe aggregate (.leinand curve under a money rule and an exchange 
lale rule arc 


(t^)m ~ '■rlAr? + tM) (A 16) 

an<l 


(iki - 

I espectively. 1 hiis, the cases in whirl) one tyj)e of rule will dominate the other 
are unchanged fiotn ttie lext. 

The only lesult which < liangc's due to the change ol timing of expectations 
IS the lime path ot the response ol the exchange rate to an unanticipated 
iin tease m the money supply. And letting the aggregate supply cm ve be (1) 
and the money tide he (12), it <an be shown that tlie only dilferente in the 
eflects ol uiiaiuicipaled changes m the money supply on the exchange late 
due to the change in the timing of expectations will he in the dc,/9e, term. 
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Point Spreads versus Odds 


Gilbert W. Bassett, Jr. 

Unwer',U\ »j Illinois at Chicago CircU 


Wliy do bookies use ihe point-spread device for wagering on football 
games when odds betting on the winner is also feasible? One possi¬ 
bility is that point-spread betting is more fun than odds betting. The 
existence f)l point .spreads, however, does not logically entail this 
"mote fun" explanation. Given simple specifications of beliefs and 
betting behavior, and given that point spreads and odds are equally 
thrilling, it is demonstrated that the point-spread game can exist a.s 
the game produced by a profit-maximizing bookie. The disparate 
beliefs whith create the demand for some form of betting market 
can also expl.iin the pai ticular Itetting game produced by the bookie. 


I 

The latest point spread on the Bears-Rams football game this 
weekend lists the Bears as nine-point underdogs. For a 10-cent fee a 
bookie will accept an even-odds one-dollar wager on the event B + 9 
> R, where fl and R are the Bears’ and Rams’ final point totals. Ba.sed 
on ycjur knowledge of the teams’ relative strengths, you assign a 
(personalistic) probability of .7 to the event B + 9 > R (and a proba¬ 
bility of zero to B -I- 9 = R). Hence, a one-dollar (plus the 10-cent 
bookie fee) wager on the Bears has an expected monetary value 
(EMV) of .7(1 - .1) — .3(1 -f .1), or a positive 30 cents. Given the 


An earlier veisioii of this paper was presented at the Economelrit. Sexiety Meeting, 
New York (afy, December 1977. { want to acknowledge helpful comments from a 
reicrce at this Journal. 
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alternative of not betting, you as an EMV maximizer will wager that 
the Bears will beat the spread. 

Point-spread schemes characterize virtually all of the wagering in 
football betting markets. So-called odds betting, however, is also pos¬ 
sible and can provide for any particular individual an EMV identical 
to the point-spread game. If the odds are 6:1 on a Bear victory (B > 
R) and if you assign a probability of .2 to this event (and a probability 
of zero to B = R), then the EMV of a one-dollar (plus the 10-cent 
bookie fee) Bears bet is .2(6 — .1) — .8(1 -t- .1), which is again 30 cents. 
Ceteris paribus, you are indifferent between the point-spread and 
odds wagers. Hence the question. Why do bookies produce the 
point-spread betting game? 

The existence of point spreads is commonly attributed to bettor 
preferences. This explanation is implicit in news articles on point 
spreads, and it is also the most frequently offered explanation in the 
very informal settings where 1 have posed the point spreads versus 
odds question. Under this view bettor preferences are thought to 
depend on more than EMV or expected utility—utility being defined 
on income or wealth. Point spreads are said to be more fun or thrilling 
than odds so that bettors are willing to forgo expected return with 
odds in order to play the point-spread game. This appeal of point 
spreads could reflect a general preference for even-odds wagers, or it 
could be simply that point spreads are more complementary with 
viewing a football game. But the bias in favor of even-odds wagers 
does not extend to other lotteries, and many football bettors presum¬ 
ably wager on games they do not watch. It would appear that the 
majority (in the informal settings) think the intrin.sic delight in beating 
the spread is obvious and requires no deeper analysis. The answer to 
the point spreads versus odds question, then, is simply that point 
spreads are more fun than the odds betting game. 

Bettor preferences provide an answer to the point spreads versus 
odds question, but the explanation posits rather curious tastes which 
cannot be understood in terms ot unEMV or expected utility decision 
criterion. It is possible, however, that preferences are in fact consis¬ 
tent with some more traditional decision criterion, and the stated 
views merely reflect the difficulty of imagining football betting with¬ 
out the point-spread device. Alternatively, the stated views might 
derive from the belief that bettor preferences are the only logical 
explanation consistent with the existence of point spreads. Without 
denying the pjossible empirical validity of the bettor-preferences ex¬ 
planation, the intent here is to generate alternative hypotheses for 
point spreads in a more traditional setting where bettors do not 
consider point spreads more fun. Disparate beliefs regarding the 
random variable X = R — B are one reason for the existence of some 
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betting market, and they are examined here as also providing an 
explanation for the particular betting game produced by the bookie. 

The market of potential bettors will be assumed fixed, and each 
bettor will be described by a probability distribution representing 
beliefs regarding X. In deriding on wagers the bettors are assumed to 
use an EMV criterion. The thrill of betting is therefore supposed 
constant over alternative betting games, and risk preference or aver¬ 
sion is ignored. (Bettors might be imagined to risk only a small frac¬ 
tion of their income or wealth.) Betting is facilitated by a profit- 
maximizing bookie who is paid a fee by participating bettors in 
exchange for either a point-spread, an odds, or a combined-betting 
game. T«) insulate himself from the outcome ofX the bookie will al.so 
establish equilibrium parameters for the betting games. (The betting 
market is of some independent interest because it contains an agent, 
the bookie, who has an incentive to establish these equilibrium values.) 
The betting market is therefore envisioned as arising for purely 
speculative reasons. The market exists for the sole purpose of ex¬ 
pediting the exchange of wagers whose EMVs are positive rather than 
because of variations in initial allocations of physical-IcKation-dated 
commodities or the possibility of utility-increasing risk transfers. Wa¬ 
gering is rational in that each bettor is made better of f according to his 
EMV cr iterion even though it is certain that the aggregate of bettors 
will lose money to the bookie. 

I'he bookie’s profit-maximizing objective translates in the games 
described below into the objective of maximizing the betting pool— 
the total dollars wagered on the football teams. Given the assumptions 
that no wagering scheme is intrinsically more fun than any other and 
that all individuals wager rationally according to an EMV criterion, 
and given the example above which illustrates that for any irrdividual 
there is some odds game in which betting behavior is identical to the 
point-spread game, it might seem that the bookie would be indiffer¬ 
ent as to which game to produce. In the model, point spreads would 
then exist merely as the outcome of the bookie’s random choice from 
a set of equivalent betting games. If a deterministic rationale lor point 
spreads is sought, the model would then have to be abandoned in 
f avor of the bettor-preference explanation. We will see, however, that 
depending on belief s, certain betting games generate greater profits 
for the bookie so that an appeal to the supposed delights of point 
spreads is not logically necessary in order to understand the bookie’s 
production decision. 

The rationale for assuming that individual beliefs are consistent 
with a probability distribution comes from the personalistic or subjec¬ 
tive theory of probability, and Winkler (1971) has found some empiri¬ 
cal supfjort for the theory in the football context. In the economics 
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literature, I know of only two references dealing directly with football 
betting.' Pankoff (1968) used football betting to analyze issues relat¬ 
ing to the efficient-market hypothesis. Smith (1971) examined betting 
markets because of the equilibrating role played by the bookie and 
because football wagering could provide an unusual framework for 
examining decision making under uncertainty. He used the 
expected-utility hypothesis to analyze how betting behavior under 
various betting .schemes might be influenced by wealth, beliefs, and 
the pleasures of gambling. While Smith did consider Irehavior under 
point spreads and odds games and his model is similar to the one 
presented here, he did not examine the conditions under which one 
betting game is produced over another. 

No attempt is made here to associate differing opinions with differ¬ 
ent or incomplete information among potential bettors. In fact, po¬ 
tential bettors might be envisioned as having virtually identical infor¬ 
mation on the football teams’ current situation and past history. Still, 
bettors are not presumed to agree on the distribution of X. While my 
casual observations suggest that this is a reasonable working hypothe¬ 
sis, it does conflict with the spirit of rational expecunions where 
differing opinions are as.sociated with and analyzed in terms of some 
more fundamental differences among individuals. Besides casual 
empii icism, the fact that the probability a.ssigned to a future event like 
the winner of a football game cannot be as easily inferred as the 
probability assignments in, say, roulette might also serve to justify the 
existence and persistence of differing opinions. Despite being able to 
ccjntinuously update their theories by observing the outcomes of 
football games, potential bettors do not reach agreement. Theories 
are sufficiently vague or depend on s(» many exogenous factors that, 
while each bettor possesses a well-defined distribution for X, dif¬ 
ference of opinion exists and persists. Difference of opinion is what 
makes for a horse race and is what is examined here as an explanation 
for the point-spreads versus (xlds que.stion. 

The plan of the rest of the paper is as follows. .Section 11 describes 
the bookie’s objectives and production possibilities, that is, the games 
which can be offered to bettors. A simple parameterization of market 
beliefs and a betting criterion are presented in Section III. The 
assumptions are adopted in order to illustrate how point spreads 
might beat odds and also to determine in a simple model the condi¬ 
tions for the optimality of a betting game. Section IV begins the 
exercise by working through a special case. Propositions in Section V' 
express equilibrium values for the betting games and b<H)kie profits as 
a function of the parameters describing market lieliefs. .Section VI 


I am indebted to Stephen Salnnt for tnakin}^ me aware oi these letercnces. 
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then presents conditions under which point spreads beat odds. The 
assumptions on beliefs and betting criteria are relaxed in Section VII 
and another example is presented. The final section contains some 
concluding remarks. 

11 

I he bookie brings together individuals with disparate belief's regard¬ 
ing X by producing a betting game called spread-odds (s,z). Depend¬ 
ing on the bookie’s choice of the spread parameter s and the odds 
parameter 2 , the spread-odds (s,z) game is either the point-spread 
game, the odds game, or a game combining features of both point 
spreads and odds. The spread-odds (s,z) device, apparently first used 
by Smith (1971), is convenient for stating results and for obtaining 
conditions under which point spreads beat odds and any other 
combined-betting game. 

The bookie collects a fee or commi.s.sion r for each bet purchased in 
the spread-odds (\,z) game. The commission is assumed fixed so that s 
and 2 are the bookie’s only choice variables.* Furthermore, the 
bookie’s administrative and other costs per bet are assumed to not 
depend on the particular spread-odds (.<, 2 ) game which is produced. 

Bettors are offered a Rams bet or a Bears bet in the spread-odds 
(s, 2 ) game (bettors will also have the option of not betting). The bets 
are a (unction of (*, 2 ) where -<» < j < =» and 0 < 2 < 1. Each bet is 
described by its net gain, its net loss, and the event under which the 
gain is lealized. d he spread-odds (.s,z) bets arc: Rams bet, [22 — c:2(l 
— z) + c, X > ,t]; Bears bet, (2(1 — 2 ) — c: 22 4- c,X < . 5 ]. An individual 
who puK bases one Rams bet will pay the bookie 2(1 — 2 ) -I- c dollars 
and w'in two dollars gross, 22 — c dollars net, ifX > .s. If X < s the Rams 
bettor loses his initial payment of 2(1 — 2 ) + c dollars.* Similarly, a 
Bears bettor wins 2(1 — 2 ) — c dollars per bet if X < s and loses 22 + c 
dollars per bet if X > .s.^ 

“ In practu e the bnokic'si cciiiiinission is al.so a choice variable whose equilibrium value 
IS dclerinined by the belting industry’s maiket .structure. For an inletesting study in a 
slightly different contest of how the choice ofc can inOuente (a race track’.s) profit, see 
Cruen (1976), To partially (usiity thee fixed assumption used here, one might supptrse 
a competitive betting industry in long-run equilibrium with the average cost pei wager 
equal to c 

^ If X = t all Itets are off. This differs fiom some Itetting games in which the btxrkic 
wins all tics 

' The odds parameter 2 is a transformation of what is usually referred to as "the 
(xlds.” E.g., if 2 = 2/3 and r = 1/99, then the Rams bet is established at (131/99:67/99) 
and the Bears bei at (65/99; 133/99), In the usual parlance this represents approxi¬ 
mately 1.96 to 1 odds and I to 2.0.5 odds. lespectively, the slight asymmetry in the odds 
Iteing the result of ihe fee r. In older that the net gains and losses in the spread-odds 
b, 2 ) game Ise positive, the domain ofc must he slightly restricted to the interval (c/2, 1 - 
r/2). 
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The bookie’s choice of a spread-odds (s,z) game is restricted to the 
set of (s,z) which makes profit indepiendent of the outcome of the 
football game. This domain for (s.z) is called the equilibrium set and is 
given hy y = [(j,z) | \k(s,z:c) = Aa(i,z;c)], where Aj, and A^ denote 
the number of Rams and Bears bets purchased at the spread s, the 
odds z, given the commission c. 

To see that (i,z) in y makes profit independent of X, consider the 
betting pool (the total dollars wagered on the Rams and Bears), [2( 1 — 
z) -I- c]Aa + I2z + c]Aa. For (j,z) in K the pool is (1 -(- c)A where A = A* 
-I- Afl. From the pool the bookie returns 2Aa dollars to Rams bettors if 
X > s, and at (5,z) in V this is identical to the 2Ab dollars returned to 
Bears bettors if X < s. The bookie’s profit is therefore cA(5,z: c) with 
certainty for each (s.z) in F. 

A point-spread game is spread-odds (s, i4); that is, z is fixed at *A. 
With this restriction, wagers are made at (neglecting c) even odds on X 
> s or X < s. The equilibrium point spread is T where (T, 'A) is in F, 
spread-odds (T, *4) is the point-spread game which the bookie may 
produce, and the profit from producing this game is cA(J, Yf.c). 

An odds-betting game is spread-odds (0, z) so that bets are made on 
the events X > 0 or X < 0, Rams win or Bears win. The equilibrium 
odds is denoted by z where (0,z) is in F, spread-odds (0,z) is the odds 
game which the bookie might produce, and the profit from this game 
is cA(0,z :c).® 

The profit-maximizing bookie who is permitted to produce any 
spread-odds (5,z) game will select an (s*,z*) in F so that A(s*,z*:c) is a 
maximum, and spread-odds (s*,z*) is called an optimal betting game. 
If A is constant over all (s,z) in F then every betting game is equivalent 
and optimal, and the bookie’s profit is identical no matter which game 
is produced. This situation corresponds to the initial hyjxithesis 
under which the bookie is indifferent between alternative betting 
games. If A(j’'‘,z*:<:) > A{s,z:c) for some (.v,z) in F, then some betting 
games will not be produced because they result in lower profits than 
an optimal game. Finally, when A reaches a maximum for a unique 
(.r,z) in F, there is a unique optimal game, and profit maximizing by 
the bookie completely determines the spread-odds (s,z) game which is 
produced. 

The bookie who is constrained to either point spreads or odds will 
produce the former when A(s, V^;c) > A(0,z:c). When this holds we 
will say that point spreads beat odds. Notice that point spreads, even 
if it beats odds, need not be an optimal game. Finally, an extreme case 
to be considered in Section VII occurs when point spreads beat odds 


‘ In order that the poinl-ispread and odds games arc not identical, ii is supposed that s 
IS not equal to zero. 
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and A(0,z: r) = 0; that is, no one purchases the odds wager. In this 
case we will say that point spreads versus odds is no contest. 


Remarks 

1. In practice a bookie who fails to select an (s,z) in F will try to 
exchange some of his wagers with another bookie or will begin to alter 
the spread and odds parameters. Hence, all bets may not be pur¬ 
chased at the same (.s,z). In the model here, however, all bets are 
assumed to be made at an (v,z) in V. One might suppose that the 
bookie’s skill is his ability to read the market and know the (.s,z) such 
that Afi = Afl. 

2. The equilibrium set corresponds to one possible measure of 
consensus regarding the distribution of X. If 7r(s) = z for (.s,z) in V, 
then TT is a market probability distribution in the sense that the fraction 
of the betting pool (excluding the bookie fee) wagered on the event X 
< is z, and the fraction wagered on X > ,v is 1 — z. Bookies who fail to 
select (i,z) in F become bettors themselves and effectively are betting 
that their opinion is more accurate than this market consensus. 
Natural selection probably eliminates the.se bookies quickly either 
because they have become wealthy and retire or, more likely, because 
they have become financially or physically bankrupt, the latter bank- 
ntptcy being a likely consequence of not paying winning bettors. 

3. In the standard point-spread game the bookie’s commission 
(called the vigorish) results in spread bets being offered at 10:11 
odds. With the spread-odds (s,z) notation this means that the point- 
spread bet is offered at trdds t>f 20/21;22/21 and r = 1/21. 

Ill 

The set of potential bettors is denoted by N and remains fixed 
througlunit. Associated with each individual t in A/ is a probability 
distribution /'.(x) and a belting function; Fi(x) is the /th bettor’s proba¬ 
bility assignment to the event X < x, and the betting function along 
with Fi(x) determines the number of Rams or Bears bets purchased by 
bettor i. 

Aside from the example in Section VII, beliefs will be assumed 
identical except for a location parameter. With this simple specifica¬ 
tion Fi(x) = G(x — /u.(), and G is further taken to be a symmetric (about 
zero, G(x) = 1 — G(— x)) distribution with continuous derivative g > 
0. Given G, the ith bettor’s beliefs are completely determined by /Xf, 
the bettor’s opinion regarding the median-mathematical expectation 
of X. The distribution of the /u.( over the set of potential bettors is 
approximated with N large by the continuous and increasing dis- 
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tribution A4 (y - fji ^)—the proportion of potential bettors who believe 
the median-mathematical expectation ofX is less than^i. The distribu¬ 
tion M is further assumed symmetric about zero so that is the 
median or average of all bettors’ beliefs regarding the median- 
mathematical expectation of X. Market beliefs under this simple rep¬ 
resentation are therefore a location family of distributions depending 
on G, Af, and 

A potential bettor will participate in the betting market by pur¬ 
chasing a positive quantity of the Rams or Bears bet if and only if the 
EMV of the corresponding bet is positive. With this requirement the 
set of potential bettors may be partitioned as follows. 

Given a spread-odds (s.z) game the£MK of one Rams bet for bettor 
i is 2z — c — 2G(s — /j,,). The requirement that this be positive can be 
expressed directly in terms of ft, as fji,> s - Q(z - c/2), where Q(p) is 
the (increasing) inverse or plh quantile of G. Making similar calcula¬ 
tions for the Bears bet gives the partitioning, Rams bettors = [i in 
/V|/i( > 5 — Q(z — c/2)], Bears bettors = [i in A/1/X| < j — Q(z + c/2)]; 
and the nonbettors are those individuals whose /t., is the interval [s - 
Q(z + c/2), s - Q(z - c/2)]. This holds for arbitrary, not necessarily 
equilibrium, values for s and z. To determine £ requires further 
assumptions which describe the quantity of wageis purchased when 
the EME is positive. 

Let X* and Xi” denote the number ol Rams and Bears bets pur¬ 
chased by the ith individual. The betting rule assumed here is X,* = 
X[pii — s 4- Q(z - c/2)l, and X,” = XU — /zi — Q(z + c/2)|, where X(u») is a 
positive and nondecreasing function for te > 0 and is zero otherwise. 
I his rule is consistent with the positive £AfF requirement. If X(u’) = 1 
for w > 0, bettors are restricted to a single bet when its EMV is 
positive. If X(w) is increasing forte > 0 the quantity of bets purchased 
is based not on EMV directly but on the difference between /a, and j - 
Q_{z - c/2) or s - Q_{z 4- c/2), depending on which wager’s EMV is 
positive. If /a-t > > .i - Q{z — c/2) the EMV of one Rams bet is 

greater for bettor 1 and this liettor purchases more Rams bets than 
bettor 2, Xi" > X 2 '' > 0. With this betting rule we will have X," = X," > 0 
(t and j have equal but opposite opinions) when /a, and n, are equidis¬ 
tant but on opposite sides of the interval which describes the nonbet- 
I ors. 


Remarks 

1. The assumption of a large numlrer of individuals with well- 
defined distributions defined on X = R ~ B is an empirical proposi¬ 
tion which is probably not unrealistic (.see Winkler 1971). If X repre¬ 
sents an arbitrary future event, however, the assumption need not be 



760 JOURNAL OF POLITICAL ECONOMY 

satisfied, and the model will not be applicable.^ For example, if indi¬ 
viduals do not possess a subjective distribution on the electoral (or 
popular) vote point spread, then the model cannot explain why odds 
beats point spreads in wagering on the outcome of a presidential 
election. 

2. Under the football game interpretation, each potential bettor 
knows that X = R — B is an integer-valued random variable. There¬ 
fore, there is the assumption that the outcomes of R ~ B are 
sufficiently numerous that Fi(x) may be approximated by a condnu- 
ous distribution. In other cases this may be far from adequate, for 
example, baseball and soccer games. In the limit where the winner 
scores one and the loser zero, no point-spreads versus odds question 
will arise. 

3. The location family of distribuUons is a very restrictive assump¬ 
tion which undoubtedly does not allow for the full diversity of actual 
beliefs. The assumpdon is adopted because it permits easy calculation 
of opdmal game conditions and thereby will illustrate, albeit in a 
restrictive setting, how one betting game can be favored over another. 

4. The assumed betting criterion is also adopted because it is con¬ 
venient. With a location family of market beliefs, neither the equilib¬ 
rium set nor the optimal game conditions will be found to depend on 

But the criterion might also be fairly realistic since bettors often 
rationalize their wagers by appealing to the difference between the 
quoted spread and the spread at which they would decide not to 
wager. 

IV 

Before examining the conditions for an optimal game given an arbi¬ 
trary IcKation family of market beliefs, it is instructive to consider the 
following special case. 

Suppose G is a uniform distribution on the interval (— 10, 10); G(x) 
= .5 + .05x. Let c = . 1, = 3, and let M be an arbitrary distribution 

satisfying the previous assumptions. 

A Rams bettor is anyone who finds theTMF of one Rams bet to be 
positive or, on making the calculations, p.( > s — 20z +11. Under the 
assumed betting criterion such a bettor purchases ki" = X(/Z( — T — 4) 
Rams bets, where we let T denote .? — 20z + 7. Summing (integrating) 
over all such purchases gives 

• The assumption can fail in either of two ways. Beliefs might be so vague that 
individuals feel completely incapable of assigning probabilities to future events. Alter¬ 
natively, beliefs might l>c inconsistent with a probability distribution, e.g., if one as¬ 
signed probabilities of .3 and .8 to the respective events X < 0 and X > 1. 
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~ ^ ^)dM(iJL - 3) 

" - r - l)dM{w), 

where n is the number of potential bettors. Making similar calcula¬ 
tions gives 

A" = + T — l)dM(u;). 

This expresses the total quantity of Rams and Bears bets purchased 
given an arbitrary spread-odds (i.z) game. 

To solve for the (s,z) in V use the symmetry of M and obtain 

A« - A« = n K(w - T - l)dM(w) 

“ ^Jw>-T+t ^ ~ l)dM(w). 

At T = 0 this is zero and, since Af is increasing, A* - A" = 0 only if 
T = 0. Hence, T = 0 or s(z) = 20z — 7 gives the equilibrium set; at any 
[5(z), z] the bookie’s profit will not depend on the realization ofX. The 
equilibrium point spread would be 5('A) = 3, and the equilibrium odds 
is a solution to s(z) = 0 or z = .35. 

Substituting s(z) for s in the expressions for A* and A® gives the 
representation of profits for (s,z) in y as 

c(Ar + Afl) = 2cn f X(ip — l)dM(w), 

J tt’>I 

a term which does not depend on (s,z). So why does the bookie 
produce one game, say point spreads, over another? No (determinis¬ 
tic) answer is to be found in this example. The bookie's profit is 
identical in all spread-odds (s,z) games, every game is optimal, and 
disparate beliefs as represented here do not offer an explanation for 
the production of a particular betting game. In a world where the 
assumptions of this example are satisfied, the existence of point- 
spread betting would merely represent the bookie’s random choice 
from the set of all spread-odds (s,z) games. 

To put this result differently, consider the point-spread game with 7 
= 3. Instead of computing the equilibrium odds, mimic the example 
in the introduction and ask each bettor to compute the odds game at 
which his behavior would be identical to behavior under point 
spreads. For example, a Rams bettor with point spreads will purchase 
X(V() bets where fi/ — 4 = V( > 0. An odds game in which the same 
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number of Rams bets would be purchased is found by solving /Xi — 11 
— 20z =v,, orz = .35. Every Rams and, with similar calculations, every 
Bears bettor will obtain this same solution, and at z = ,35 every 
nonbeitor under point spreads remains a nonbettor under odds. The 
behavior of each individual is therefore invariant under the trans¬ 
formation from (s,z) = (3, V 2 ) to (0, .35). Since (3, Y 2 ) is in V and 
because no behavior is altered at (0, .35), it follows that .35 is the 
equilibrium odds. The point-spread and odds games are therefore 
equivalent. 

In this example any individual can find an equalizing odds game, 
and this is generally true as long as bettors use anTA/F-based decision 
criteiion. What makes this example special and misleading is that 
every individual arrives at the same equalizing odds of z = .35. When 
G is not uniform we will see that not all bettors compute the same 
equalizing odds, and the invariance of betting behavior as (5,z) varies 
over F will no longer hold. 

V 

For the spread-odds (i,z) game with arbitrary (.s,z), the EAfF-based 
Ijetiing criterion provides a straightforward characterization of A" 
and as a function off?, M, and /x,«. The equilibrium set can then be 
obtained by solving for the (.\,z) which satisfy the equilibrium condi- 
ti(.)n. The betting pool and bookie profits for the spread-odds (i,z) 
game with (.v.z) in F then follows directly. 

Proposition 1: For arbitrary (not necessarily equilibrium) (i,z), 

A'* = + T — A)ilM (to), 

where 7’ = s - — (QCz + c/2) + Qj{z — c/2)]/2 and/f = rQ(z + c/2) - 

QPz - r/2)J/2. 

Proof: This follows from the assumed betting criterion, the repre¬ 
sentation of market beliefs, and the direct substitution w = fi — 
Propoiilion 2: I'he equilibrium .set is F = {(i,z)|.s(z) 
c/2) + Q(z - c/2)J/2}. 

Proof: By the symmetry of M we have 

A« - A" = nf \(w ~ T - A)dM - ri( \(w -FT - A)dM. 
•J u-:>A+r J ir>A~T 

I’his expression is zero at T = 0 and, using the assumption that Af is 
increasing, it may be verified that A" — A* < 0 for T > 0 and A'* — A^ 
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> 0 for t < 0. Hence A" — A" = 0 if and only if T = 0, and this is the 
proposition. 

Proposition 3: Bookie profits for [s(z), z] in V are 

c(A" + A^) = cnf K(w — A)dM(w). 

J «'>A 

An optimal game is therefore an Ij*(z*), z*) such that^(z*.T) = min 
A{z:c). 

Proof: The first part of the proposition follow.s on substituting s{z) 
for .s in proposition 1 and then using the symmetry of M. The second 
part follows on noting that with M increasing the expression for 
profits increases as A decreases. 

Substituting 4(z) for,? in the previous expression for nonbettors, we 
find that as (.v,z) varies over V, the nonbettors are those individuals 
whose fall in the interval [/x*, — A(z:c), + A(z:c)]. Since the p., are 

symmetric about the proportion of potential bettors who purchase 
no l)ets is 1 — 2M[A(z:c)]. Since Af is increasing this proportion is 
minimized at nun A(z:c), which is the same as the maximum-profit 
condition. The objectives of maximizing profits and maximizing the 
participation of bettors in the market are theref ore equivalent, given 
our assumptions on beliefs and betting behavior. 

When/f(z:c) is a constant function ofz, all betting games will yield 
equal profits and an equal number of bettors. This occurs when Q is 
linear (for 0 < p < 1), and this in turn implies that G is uniform, the 
example of the previous section. Conversely, when G is not uniform A 
is not constant, and .some betting games will not be produced because 
they generate lower profits. The bookie fee in effect drives a wedge, 
consisting of the nonbettors, between Rams and Bears bettors, and 
the size of this wedge depends on which spread-odds (\,z) game is 
produced. 

Finally, consider the point-spread game with the equilibrium 
spread pw. If Rams bettors compute an odds game at which their 
behavior is identical with the point-spread game, they will find z^ = 
G(Q[(1 — f)/2] — pif} + c/2. The Bears bettors will compute z^ = 
+ c)/2] — p.vf} — c/2. IfG is not uniform the equilibrium odds 
are not equal to Zr or z„ so that the odds game with z = z„ or Zg is not a 
feasible game for the bookie. The existence of equalizing odds for any 
given bettor does not therefore imply that all Iretting games are 
equivalent. The point-spreads versus odds question depends on bet¬ 
ting behavior not at equalizing odds but at the equilibrium odds, and 
unless G is uniform these two odds will not be eqvial. 

Given the assumed beliefs and betting criterion, proposition 3 gives 
the optimality condition for spread-odds (s,z); namely, the bttokie 
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produces an [a(z), z] game such that A(z:c) is a minimum. The next 
section translates this condition into simple conditions on G. 


Remark 

Let the (median) consensus probability of the event X < r be denoted 
by7rM(s) where half of the individuals inA( believeP(X <s) < ir/nis) and 
half believe P{X < s) > 7r*f(r). Suppose further that this consensus is 
correct; If ttjAs) = z for a large number of football games, thenz is the 
frequency of X < t. It may be verified that for the present model 7Tnf(s) 
= G(5 - fjLm). Furthermore, if c = 0 everyone in N purchases a wager 
(the set of nonbettors has probability zero), and ttm is revealed by the 
equilibrium set since F = [(s,z) j7r«^s) = z]. However, from proposition 
2 we see that with c > 0 there will exist discrepancies between 7r(s) = z, 
(s,z) in V and the true measure of the market’s opinion This 

discrepancy arises because with c > 0 some potential bettors do not 
wager, and their opinions are not fully represented by the equilibrium 
set. For example, if c = .2, G is a mean zero, variance 49 Gaussian 
distribution and p-v, = 7, then one finds the equilibrium odds is z = 
.179, and this is a biased estimate of the potential market’s belief 
regarding X < 0 since rr^^O) = .159. Hence z will not reflect the 
fiet(uency of X < 0 even if the consensus beliefs of potential bettors 
are correct. Such discrepancies might be a reason for the observed 
difference between the probabilities implied by the odds in horse race 
betting and the frequency with which horses win (see Ali [1977, p. 
810] for a similar result; see also Seligman [1975]). 

VI 

Suppose in addition to previous assumptions that G is strictly uni- 
modal; that is,g(0) > g{x) for allx andg(x) increasing (decreasing) asx 
is negative (positive). With this assumption fii is the expectation, me¬ 
dian, and mode or most likely outcome ofX. In such a world we have; 

Theorem 1: If G is strictly unimodal, then point spreads is the 
unique optimal betting game. 

Proof: The first-order condition for the minimum of A hq(z + Vtc) 

— q(z — c/2) = 0, or g[Q(z + c/2)]'' — g[Q(z — c/2)]'‘ = 0. With G 
symmetric and strictly unimodal this holds if and only if Q(z + c/2) = 

— (2(z - c/2) = <2(1 — z + c/2), or z = !4. One may now verify that/4 is 
decreasing for z < ‘A and increasing for z > ]A so that at z = Vi, A is at 
its unique minimum value. By proposition 3, z = 'A, or the jxiint- 
spread game is the unique optimal betting game. 

To obtain simple necessary and sufficient conditions which 
characterize the optimal game, suppose c is small and use a linear 
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approximation for Q(z + c/2). Expanding around c = 0 and suppress¬ 
ing quadratic and higher-order terms gives Q{z -H c/2) = Q(z) 
+ q(z)cl2. Up to this order of approximation a(z) = + QU). 2 = 

G(-fiM) = 1 - G(/a«), and A(z:c) = q(z) = g[(2(z)]-‘. 

Theorem 2: For c small, spread-odds [5(z*)4*] is an optimal game if 
and only if Q_(z*) is a global mode of C(x).'^ 

Proof: Minimizing A(z:c) with c small is approximately the same as 
minimizing q(z) = g[Q(z)]"‘, and by definition this minimum is 
achieved at a mode of G(x). (Notice that if QU*] is a mode, then by 
symmetry so is -Q[z*] = Qi 1 - z*], so that [t( 1 - z*), 1 - z*] is also an 
optimal game.) 

Theorem 3: For c small, point spreads beat odds if and only if g(0) > 
giPu)- 

Proof: With c small, .4(z : c) = gi/imT' (since = gf-pwD and 

A(1/2:c) = g(0)~' so that the theorem follows from proposition 3. 

It should be emphasized that the simple optimality conditions in 
this section are a consequence of the simple specification of beliefs 
and betting behavior. Under less restrictive assumptions there will 
undoubtedly exist different configurations of beliefs, betting behav¬ 
ior, and preferences which are revealed by the bookie’s production of 
point spreads. 

The conditions illustrate how market beliefs and betting behavior 
can be mapped via the bookie into a particular betting game even 
when betting games are considered equally thrilling. For example, 
under the assumptions of theorem 2, the existence of point-spread 
betting does not reveal bettor preferences but instead reveals an 
aspect of a bettor’s beliefs, namely, that 0 = QC/z) is the mode of G. 
There is not a one-to-one correspondence between point spreads and 
the previously described bettor-preferences explanation, and such an 
explanation therefore is not logically necessary in order to under¬ 
stand the existence of point-spread betting. 

VII 

In this section the location-family-of-market-beliefs assumption is 
dropped, individual betting is supposed consistent with the positive 
EMV requirement but is otherwise arbitrary, and a very simple exam¬ 
ple is presented in which point spreads versus odds is no contest. Only 
the point-spread and odds games are compared, and there is no 
attempt to obtain simple or general optimality conditions. The exam¬ 
ple is presented to illustrate how point spreads can be favored over 
odds when bettors use an arbitrary £AfF-based decision criterion. The 

’ Jt* is a global mode of C if g(**) ^ gM for all x. 
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more specific criterion used in the previous sections, while influencing 
the particular optimality conditions, is not therefore crucial for the 
bookie to favor one iietting game over another. 

Suppose N consists of just two potential bettors. Let c = 1/9 and 
suppose beliefs are given by the uniform distributions. 
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These distributions arc depicted in figure I. Observe that the dis¬ 
tributions do not diff er by just location and are not therefore covered 
by the previous sections. Also note that there are disparate be¬ 
liefs both in the sense that F, F-^ and also in the sense that 
beliefs regarding the event X < 0 are much less dispersed than 
beliefs regarding, say, X < 2. 

The EMVs of the Rams and Bears l>ets for both individuals given 
the point-spread game are graphed in figure 2 as a function of s. 
From the figure one sees that when 4/3 ^ s 24/9, individual 1 is a 
Bears bettor and individual 2 is a Rams bettor. The equilibrium point 
sjtread will therefore be somewhere in this interval, with the particu¬ 
lar spread being dependent on each indivicfual’s betting criterion. 
Under the criterion of Section HI the etjuilibrium spread is 2; if 
individuals purchase a quantity of wagers which is prttportional to the 
EMV of the wager, then T = 16/9. But in any event both potential 




F r<*. *2.. — EMV?, ot Bears and Rams bets (oi pi'teniial bettcirs given point-spread game 
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Fu;, 3 ,—EMVs ot Bears and Rains bets for potential bettors ^ven odds game 

bettors will participate in the betting market by purchasing some 
point-spread bets. 

The EMVs of the Rams and Bears bets given the odds game are 
graphed in figure 3 as a function of z. For z < 2/9 the EMV of a Bears 
bet is positive for both individuals and for z > 5/18 the£AlK ot a Rams 
bet is positive for both individuals; hence, the equilibrium odds can¬ 
not be in either of these intervals. The only way to satisfy the equilib¬ 
rium condition is to let 2/9 s z < 5/18, and with any such odds both 
individuals purchase no bets. I'herefore point spreads versus odds is 
no conle.st. If the point-spread device had not been invented and 
wagers could only be made on the winner of the game, this football 
betting market would not exist. 

VIII 

So what is the answer to the point-spreads versus odds question? 
Perhaps the answer is that point spreads are more fun than odds. 
Since this, however, is not the only logical possibility, the answer 
ultimately must be based on empirical evidence. In a world where 
point spreads arc not considered more fun than odds, point spreads 
can exist as the game produced by a profit-maximizing bookie. Dif¬ 
ference of opinion, besides leading to the formation ol a betting 
market, can also have implications for the particular l)etiing game 
produced by the bookie. Finally, to profit from these findings you 
should look for an informal setting in a location-family-oi-beliefs 
world where bettors u.sc a criterion as described in .Section Ill. If a 
profit-maximizing bookie is pnxlucing the point-spread game. )ou 
might then wager that your fellow' bettors have expectations equal to 
their modes. 


References 

Ali, Mukhtar M. "Probability and Utility Kstiinates (or Raceliatk Beitois." 
J.P.E. 85. no. 4 (August 1977): 803-16. 



JOURNAL OF POLITICAL ECONOMY 


768 

Gruen, Arthur. “An Inquiry into the Economics of Race-Track Gambling.” 

J.P.E. 84, no. 1 (February 1976): 169-77. 

Pankoff, l.yn D, “Market Efficiency and Football Belting.”y. Bus. 41 (April 
1968); 203-14. 

Seligman, Daniel. “A Thinking Man’s Guide to Losing at the Track." Fortune 
92 (September 1975); 81-87. 

Smith, Vernon L. "Economic Theory of Wager Markets.” Western Econ. J. 9 
(September 1971): 242-55. 

Winkler, Robert L. "Probabilistic Prediction; Some Experimental Results.”_/. 
Amenran Statis. Assoc. 66 (December 1971): 675-85. 



Time Preference and International Lending and 
Borrowing in an Overlapping-Generations 
Model 


Willem H. Buiter 

UnwersUy of Bristol and NBEJi 


Two economies, represented by Diamond-type overlapping- 
generations models and differing only in their pure rates of time 
preference, are joined together. Capital formation, balance-of- 
payments behavior, and welfare are compared under autarky and 
openness. With a positive natural rate of growth, the low-time- 
preference country runs a current account surplus in the steady state 
but not necessarily outside it. If preexisting capital is not shiftable 
between countries, integration in the world economy makes the 
high-tiine-preference country worse off in the short run. The rank¬ 
ing of stationary utility levels under autarky and openness is am¬ 
biguous. 


I. Introduction 

The purpose of this paper is to explain international capital move¬ 
ments from differences in time preference and to evaluate the short- 
run and long-run welfare implications of a change from a situation of 
trade and financial autarky to one of openness in trade and finance, 
rhe analysis is conducted in terms of a deterministic model of two 
countries, each one of which consists of a Samuelson-Diamond type of 
overlapping-generations model and produces one identical good. All 
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detcrniinanis of international trade and international lending and 
borrowing other than a single Utste parameter, the pure rate of time 
preference, arc ignored. There are no difference.s in technology, in 
exogenous factor endowments, in scale, and in market structure: nor 
are there taxes, subsidies, and other man-made wedges between 
buyers’ and sellers’ prices or Ixjtween domestic and foreign prices. 

There is an extensive literature on the determinants of interna¬ 
tional capital movements. Starting from the accounting identity that a 
country’s net international capital outflow equals the difference be¬ 
tween national saving and domestic capital formation, a number of 
diff erent approaches have been taken to the determinants of saving 
and investment. Ad hoc .saving functions, mainly specifying saving as 
a constant fraction of disposable income, are the most common (Borts 
1964; Neher 1970; Fischer and Frenkel 1972; Onitsuka 1974; Hori 
and Stein 1977; Rufhn 1979). Stiglitz (1970) derived .saving behavior 
frttm the utility-maximizing choices of infinite-lived individuals. The 
approach adopted here and in Fried (1980) has optimizing, overlap¬ 
ping generations with finite lives. This is especially useful for the 
short-run and long-run welfare analyses of a move from autarky to 
openness. Capital formation is carried to the point where the margi¬ 
nal product equals the interest rate. This is different from the 
cxftgenous investment of Neher (1970) and Onitsuka (1974), the ad 
hoc (Keynesian) itivestment function of Borts (1964), and the cost- 
of-adjustment investment function of Fischer and Frenkel (1972). 
Only a single good, which can be consumed or used as a capital good, 
is produced in both countries. This is the same as in Neher (1970), 
Onitsuka (1974). and Ruffin (1979). Stiglilz. (1970) models production 
of and trade in distitici consumer goods and capital goods, as do 
Fischer and Fienkel (1972). Ht>ri and Stein (1977) have di.stinct con¬ 
sumer goods and capital gtKKls, but each of the two countries in their 
model is assumed to specialize completely in production of one of 
these goods. Borts (1964) distinguishes between traded and non- 
traded (labor-intensive) goods; traded goods are imports and exports; 
the traded gottds can be either consumer goods or investment goods. 
The one-comnmdiiy model cannot be u.sed to analyze the effect of 
capital mobility on the terms of trade, a concern of Boris (1964), 
Stiglitz (1970), and Hori and Stein (1977). The one-commodity 
structure also means that trade in goods must be matched by interna¬ 
tional financial mobility. Stiglitz (1970) has trade in consumer goods 
and capital goods but no financial capital mobility. Fischer and Fren¬ 
kel (1972) have international trade in capital goods, in consumer 
goods, and in financial claims—as does Borts (1964), although there 
all capital goods are imported. Finally, this paper considers a two- 
country model, as do Stiglitz (1970), Hori and Stein (1977), and Ruffin 
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(1979). The small-country case was analyzed by Boris (1964), Neher 
(1970), Fischer and Frenkel (1972), and Onitsuka (1974). In a recent 
paper, Findlay (1978) has approached the problem of international 
trade, lime preference, and interest equalization from an “Austrian” 
viewpoint. This paper can be viewed in part as a neoclassical comple¬ 
ment to Findlay’s analysis. Section II describes the autarky equilib¬ 
rium of the two countries. Section III analyzes the open, integrated 
world economy. 

One of the conclusions of the positive analysis, that in the steady 
state (but not necessarily outside it) the high-time-preference country 
runs a current account deficit, is hardly surprising and deserves 
mentioning mainly because of its rigorous foundation in optimizing 
behavior. The welfare analysis of autarky versus openness in Section 
IV presents some rather less conventional results. If preexisting cap¬ 
ital is nonshiftable between countries, the high-time-prcference 
country is wor.se off under optenness in the short run. The long-run 
welfare analysis is more complex but also fails to generate unambigu¬ 
ous welfare improvements for both countries under openness. 


II. The Autarky Equilibrium 

I'he world economy consists of two countries, the “home" country 
and the "foreign” country, identical in every respect except in taste. 
Eat h country is represented by competitive output and factor mar¬ 
kets, two overlapping generations, anti an itlentical, well-behaved 
constant returtis to scale prtKluction function /, a intxlel due to 
Diamond (1965).' 

Notation 

rf = consumplittn while young by a member of generation t; 
r'( = consumptitm while old by a member of generatittn t; 

L, = size of generatit)!! 1: 

K, = capita! stock in period t; 
k, =K,IL,- 

w, = real wage in perii>d /; 

r, = interest rate on savings carried from jjeriod t - 1 into 
period /; 

n = one-period prttporiional rate of growth of population 
= (1 -t- «)/.,; n 3^ 0; 

p = pure rate of time preference; p ^ 0; 

A, = wealth in peritnl t; 

' I'hc original development of the overlapping-generatioirs nuxlcl is b> Sainuelson 
(1958). See also t^ass .inti Yaari 1966. 
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a, = A(/Lt\ 

B, = trade balance surplus of the home country in period t\ 

b, =B,/Lr, 

G, = current account surplus of the home country in period t; 
g, = G,IL,. 

Mmnentary Equilibrium 

The momentary or single-period equilibrium of the closed economy is 
summarized in equations (I)-(7): 


maxM(ci,cf), (1) 

w, - c} 3= cf(l + (2) 

rl cj ^ 0, (3) 

ui, =J(k,) - k,/'(k,), (4) 

( 5 ) 

u', - c} = + n), (6) 

k, s 0. (7) 

The utility function, u, is twice differentiable, strictly quasi-concave, 

and increasing in c' and c*. The quantity Ui(0, c*) = u^lc', 0) = 


c*) = u^ic', «) = 0. The function/is twice differentiable;/(0) = 
0; /' > 0:/' < 0; /'(O) = +0C and/'(o^) = 0. 

Individuals within a given country are identical within and across 
generations. People live for two periods, work in the first period of 
their lives, and retire in the second. Labor supply is inelastic. The 
individual’s optimization problem is given in equations (1), (2), and 
(3). The utility of lifetime consumption is maximized subject to the 
lifetime budget constraint (2). The conditions on u ensure that the 
budget constraint will hold with equality and that an interior solution 
will be obtained for c} and cf. Equations (4) and (5) state that the labor 
market and capital rental market are competitive. The capital stock at 
the beginning of period t -t 1 equals the value of the saving in period 
1. This economy-wide capital market equilibrium condition is given in 
equation (6). Real capital is the only store of value. All saving in period 
t is performed by the young members of generation t. The old dissave 
previously accumulated wealth. The conditions on /ensure that k( will 
be positive. 

When we consider the consequences of different pure rates of time 
preference, the utility function u(c,', cf) will be specialized to the 
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following additively separable form: 

«(«t. c?) = v(cl) + (1 (>y‘v(cf). (8) 

Here p is the (constant) pure rate of time preference. More generally, 
the pure rate of time preference is the marginal rate of substitution 
between consumption when young and consumption when old, when 
equal amounts are consumed in both jjeriods, minus one. It can be a 
function of the level of consumption: p(c) = [ui(c, c)lu 2 (c, c}] - 1. 

The general solution of (l)-(7) is .straightforward and can be 
found, for example, in Diamond (1965). The interior hrst-order 
conditions for the individual are: 


cf) 

Uiic}, cj) 


1 -t- 


(9) 


For the utility function of equation (8), this becomes; 


v'(c})il + p) 
v'{cf) 


+ r,+,. 


(9') 


The consumer’s intertemporal equilibrium is graphed in figure 1. 
Equations (9) and (2) (the latter holding with equality) can be solved 
for c} and cf as functions of Wt and r,+ ,: 


f/3 c'(a'o f<+i). (lOa) 

f? = c*(ie,, r,+i).“ (10b) 

Similarly, equations (9') and (2) can be solved to yield; 

cl == c'(w,, r,+„ p), (10-a') 

c'f = c^(w,, r,+,, p). (lOb') 


If consumption in both periods is a normal good, 1 > clr, c5 > 0. VVe 
shall assume this to be the case. 


^dcl _ __ ~-(“n ~ (I r,+i)uMj _ 

dv, [U|j - (I + r,+ ,)u„](i + - u„ + (I + n,,)Ua 


dr,' _ u, - cf(l + ( I + ’^ni)u«l 

ar,+, u„ - (I + r,„)(2u,i - (I + ri4i)urj) 


dcf 

dr,t 


dcf 

dw, 




= -(1 + Cmi) 


del 

dr,^, 


I -t- n. 
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Kk. 1.— I'hc oMifuiner’s interKfmpor.il equilibrium 


Sleady-Slole Equilibrium 


rhe stfacJy-state oi long-run e(|uilibriuin is that .sequenre of’ momen¬ 
tary equilibria in which the lifetime per capita consumption profile 
remains constant from generation to genetation. It can be charac- 
terii'ed by the following set of equations; 


u,(c\r^) 
Wife /■■■') 


1 -t- r, 


( 11 ) 


n'(c')(l + p) 


+ r, 


(IT) 


I + r ’ 
w = f{k) - k/'(k), 

r =f'(k), 

w — c' ~ k{\ + n). 


( 12 ) 

(I-^) 

(14) 

(15) 


By substituting equations (13) and (14) into (12) and (15), we obtain 
two equations that represent the stationary decentralized (or compet¬ 
itive) consumption possibility locus; 
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Fit.. 2.— 1 lie stjitionjry competitive [Xissibiliiy Itxus 


f{k) - kf’ik) . (J6a) 

f{k)— kf'(k) = c''+A(l + n). (16b) 

The.se two equations can be solved for r' as a function of r^; 


r ‘ = i/i(c'''); (/»' = 


{1 + n + kf") 

(1 +«)(! +/' +kj") 


(17) 


For the f^obb-Douglas production function /(A) = A", 0 < « < I, the 
stationary decentralized consuinpiion possibility Itxiis has the shape 
of OF in figure 2.^ Along tliis locus; {dr'kk^) = —{[I + n + (a — 
l)aA“ ']/(! + «)(l + a^A“ ')}. At the origin (with A = 0), {decide'-) = 
[a/(l — «)](! + n) > 0. The capital-Iabonatio, A, increases moiiotoni- 
cally as one moves from O loF. As A approaches infinity (which would 


“ t he historic ai ongiii.s of Hg. 2 aic not < ompletely clear It wa.s a vcell-establishetl part 
of the “Yale blacklMiard tratiilioii" when 1 tame to Yale a.s a gr.iriiiate siiideiit in 1971 
James Tobin suggests that David Class may have been ihe first one to ulili/e it. ll has 
since been used as an expository device by Ihori (1978), Buiter (1979). and (larmichacl 
(1979). 
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be fore* < 0), (dc^ldc^) becomes — 1. Whenc' = 0 at/’, (decide') = — [1 + 
a^(l + n)(l — a)~'](l — a)~’ < 0. At the golden-rule capital-labor 
ratio, defined by/’(A) = n, {decide') is — [1 +/'(A)] = — (1 + n).^ I shall 
assume in what follows that OF is as in figure 2: strictly concave 
toward the r* axis for nonnegative values of c' and c*. This is certainly 
the case for the Cobb-Douglas production function but need not be 
true for alternative, otherwise well-behaved neoclassical production 
functions. It is. for example, quite possible for OF, while remaining 
strictly concave toward the origin, to be upward sloping at F. This 
would refiect extreme overaccumulation of capital; An increase in k at 
high values of ^ depresses the marginal product of capital to such an 
extent that reductions in both c' and c* are required to generate the 
additional saving required to maintain the higher value of A. 

A stationary competitive equilibrium is described by equations (16a) 
and (16b) and a tangency of a private budget constraint with slope 1 -I- 
l'(k) — 1 - 1 - r to an indifference curve, as given in (11) or (IT). At 
capital-labor ratios above the golden-rule level, the budget constraint 
cuts the OF curve from above if the production function is Cobb- 
Douglas.’ In what follows I shall assume that this condition holds. 

In figure 2 1 have drawn such an equilibrium at E^, corresponding 
to a capital-labor ratio in excess of the golden-rule capital-labor ratio. 
1 his stationary competitive equilibrium is inefficient in the sense that 
any lower capital-labor ratio that does not lie below the golden-rule 
capital-labor ratio can sustain higher stationary paths of per capita 
consumption and does not require any sacrifice in consumption to 
move to these higher stationary paths if only physical resource con¬ 
straints are taken as binding. The physical resource constraint faced 
by an omniscient and omnipotent social planner is -t- c,'JL, ^ 

f{k,)L, + K, - K,+ u or 

c?-,(l + w)-' + c} «/(*,) +kt- A,+ ,(l -I- n). (18) 

The stationary resource constraint is; 

* In what lollows 1 shall assume that ihe golden-rule capital-labor ratio is less than the 
rapital-lalxir ratio at F. This need not be the case. From (16b) it is clear that with a 
Cobb-Douglas production lunciion, c‘ = 0 implies either t = 0 (at the origin) or (1 — 
a)A“~' = (1 + n). (This requires that A be less than unity alF.) Thus, at F, k°' = (1 + 
«)/(! - a). At the golden rule, = n/a. It is likely, but not inevitable, that A at F 
exceeds the golden-nilc capital-labor ratio. 

' The slope ot the consumption possibility frontier, 5,, IS given by: 5, =((1 + n)(l + 
a*A‘'~‘)|/|( 1 - a)aA“ ‘ - (1 •(-n)]. The slope ol the budget constraint, Sj, is given by Sj = 
— (1 + ak°~'). When / '(A) = aA*"‘ < n, both are negative. 

c _ _ 1 + n + (I + n)g*A°~‘ _ 

‘ 1 -Ml -r (1 -I- nWk<^' -t- (1 - a)oA“-'(n - akT') ' 

This is < 1 if n > aA““'. 1 am obliged to an anonymous referee for saving me from error 
here. 
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—(19) 

1 T n 

A social planner aiming to maximize the stationary value of per 
capita resources available for consumption by the young and the old 
chooses k such that f'(k) = n. The social planner’s stationary con¬ 
sumption possibility locus is S| — in figure 2. The decentralized 
competitive economy will settle somewhere on OF if the economy has 
a stable equilibrium.* An equilibrium such asT, in figure 2 represents 
a steady-state capital-labor ratio below the golden rule. This cannot be 
characterized as inefficient because of (temporary) sacrifices in con¬ 
sumption that would be required to reach a higher capital-labor ratio. 

Stability 

Substituting (10a) into (6) and using (4) and (5), we obtain a first-order 
difference equation in A, that describes the evolution of the model 
from arbitrary initial conditions: 

/(A,) - k,f'(k,) - c'[f(k,) - A,/'(A,)./'(*,^,)] = A,^,(l + n). (20) 
The model will be locally stable when | (dki^ildk,) | < 1, that is, when 

|(ci. - 1)^''(1 +n + c./T'| < 1. (21) 

This says that a higher value of the capital stock in period t, k,L,, 
should not boost saving by young members of generation t by more 
than the increase in k,L,. Note that the assumption that goods are 
normal (0 < c)„ < 1) and the assumption that an increase in the 
interest rate reduces current consumption (cr < 0) are insufficient to 
ensure stability. 

Properties of the Autarky Equilibrium 

We can now establish five propiositions about the autarky equilibria of 
two economies that are identical in all respects except for their pure 
rates of time preference. The utility function is given in equation (8). 

Proposition 1. —Given w, and r,+,, a consumer with a higher pure 
rate of time preference will choose a higher value of consumption 
when young, c]. 

Proof. —We want to show that the consumption function c] = c'(u’,, 
rt+i, p) has the property c}, = (dc'/dp) > 0. Differentiating {(1 + 
p)v'(c')/v'[(w — c')(l + r)I} = 1 + r, we obtain 

*The assumptions we have made so far suffice to ensure the exi.stcnre of a long-run 
equilibrium (Carmichael 1979). A (probably overly strong) sufficient condition for the 
uniqueness of the steady-state equilibrium is that preferences be homothctic (Car¬ 
michael 1979). 




The utility function is increasing in c' and c^, so v' > 0. Strict convex¬ 
ity of the indifference curve requires v" < 0. Therefore (dc'Idp) > 0. 

Proposilton 2. — When one starts from a given A, (and therefore a 
given w,), a higher pure rate of time preference is associated with a 
lower value of (and a higher value off/) if a higher interest rate 
df)es not reduce saving. 

Proof .—For the case of the additively separable utility function, 
equation ( 20 ) can be rewritten as:y(A,) — k,f'(k,) — - k,f'(k,), 

= ^(+i(l + «). Thus (dA,+ ,/ap) = {-fi/[l -f- n + f//"(/t,n)]}. 
A sufhcient, but not necessary, condition for {dk,^Jdp) to be negative 
is f) < 0 . 

Proposition 3 .—Given p, a lower value of A, will be associated with a 
lower value of A,+ | if consumption in period 2 is a normal good and a 
higher inteiest rate does not reduce saving. 

Proof. — From (20) we obtain {dki+,/dk,) = {— ((1 — c,V)A,/"(A,)]/( 1 ■+ 
n + Crf"{ktf,)]}. This will be positive iff,',. < 1 and f/ ^ 0. (Note that f/,. 
< 1 f f,. > 0.) 


Propositions 2 and 3 immediately imply proposition 4. 

Proposition 4 .—Consider two economies, identical in all respects 
e.xcept the pure rate of time preference, which is higher for the home 
countiy than for the foreign country. When one begins fiorn any 
common initial capital-labor ratio at t = /,i, the capital-labor ratio of 
the high-time-preference country will be below that of the low-timc- 
preference country for all t > /„ if consumption in period 2 is a normal 
g(»od and a highci interest rate does not reduce saving. 

Pioposition 4 tells us something about the behavior of the twt) 
economies were they to start from the same initial condition. It might 
still seem possible that if the high-pure-rate-of-time-preference 
country were to start off with an initial capital-labor ratio, k,^, 
sufficiently above the initial capital-labor ratio of the low-pure-rate- 
of-time-preference country, A,„, k, could remain above k, indefinitely. 
To show that if the two paths of the capital-labor ratio {A,} 

and {A,} will cross, it is necessary to show that the steady-state 
capital-labor ratio of the high-time-prefercnce country. A, is less than 
the steady-state capital-labor ratio of the low-time-preference coun¬ 
try, A, and that the model is globally stable. Proposition 5 states the 
conditions under which the .steady-state capital-labor ratio is a de¬ 
creasing function of the pure rate of time preference. Sufhcient 
conditions for glcjbal stability are given in Carmichael (1979). After 
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the two paths cross for the first time, they never cross again because of 
propositions 2 and 3. 

Proposition 5. —Under autarky, the country with the higher pure 
rate of time preference will have the lower steady-state capital-labor 
ratio if the model is locally stable, consumption in Itoth periods is a 
normal good, and an increase in the interest rate lowers consumption 
in period 1 

Proof. —Consider again the consumption function c\ = r,+,, 

p). From proposition 1 we know that c*. > 0. We find the effect of a 
higher value ofp on steady-state A from/(A) — ^ ’(A) — c' = A(1 + n). 
Differentiating this yields; (dkidp) = Cpl(c,‘- — 1)A/" - {elf" + 1 + 
n)] '. With ci > 0, (dA/dp) < 0 requires (f,V — \)kf" — (clj" + 1-I- n) < 
0. This is ensured by stability (eq. [21]) if 0 < c„, — I and cj =s 0. 

III. The Two-Country Equilibrium 

'['he two countries that will be linked together in an international 
commodity market and an international capital market are identical 
in all respects, except for the pure rate of time preference. The home 
country is assumed to have a higher rate of pure time preference than 
the foreign country. For the home country, we adopt the noiaiion 
developed for the autarkic economy. All variables associated with the 
foreign country are cli.stinguislied by a tilde above the relevant vari¬ 
able. (Comparing the two countries, we have: 


X' = X', 

( 22 a) 

A=/. 

( 22 b) 


( 22 c) 

/ _ / 8 

( 22 d) 

p> p. 

( 22 e) 


Output can be moved costlessly Irelweeii the two countries. 

In this "pure-absorplion” model,® international trade and interna¬ 
tional lending and borrowing (international capital nutbility) are part 
and parcel of the same transaction. In a one-commodity inotlel, the 
only way to pay f or an extra unit of output today is with a promise of 
future output. F.ach trade balance transaction has to involve ciedit. 


’ .Since only Icxral slability o assumed, proposinoii .■> is valid (inly tor small dif fei cnees 
tiftwecn (he two ciiuiitiies' pine rales of lime prelcreiue 

" The inixlel is ea.sily generalized to include scale dillerences, as long as the natural 
rale of growth m Ixrih countries is the same. 

* 1 owe this description to Bill Branson. 
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There can be no current quid pro quo. We make the convenient 
assumption of perfect international mobility of financial capital; 
Ownership claims to domestic and foreign real capital are perfect 
substitutes in private portfolios in both countries. This means that 
interest rates will be equalized in the world economy: 

r, = ft for all t. (23) 

There is a single perfect world financial capital market and a single 
perfect world market for the current production of the homogeneous 
commodity that can be used as a consumption good or a capital 
good.*® With international capital mobility, it is essential to distinguish 
between the capital stock used in production in the home country, Ki, 
and the value of claims on real capital, domestic or foreign, owned by 
domestic residents. A,. In the national income accounts the corre¬ 
sponding distinction is between domestic income or product and 
national income or product. Ltxration and ownership of physical cap¬ 
ital no longer coincide. From interest equalization, free trade in capi¬ 
tal goods, and identical, linear homogeneous production f unctions, it 
follows that capital-labor ratios and wage rates are equal in the two 
countrie.s: 

k, = Ti„ (24a) 

Wt = Wt. (24b) 

The behavior of the integrated world economy is given by the op¬ 
timizing behav ior of domestic and foreign households, factor market 
etjuilibrium conditions, and the world capital market equilibrium 
condition. Domestic household behavior is given by equations (1) (or 
its additively separable version [8]), (2), and (3). Foreign household 
behavior is represented by the same functions, with c\, it, and p 
substituted for r}, cj, and p, respectively. Factor market equilibrium in 
both countries is given by (4) and (5), the world capital market equilib¬ 
rium condition by 

“'f ~ + «)• (25) 

Openness is reflected in the equalization of interest rates and of 
marginal products of capital. Equation (25) states that world saving in 
fieriod t, (w, — c})£,, + {w, — c,)L„ equals the world capital stock in 
period t + 1, Ar,+, K,+t = /!,+, -t- d,+). 

The individual optimization programs yield, in the additively 


When the short-run gams from international economic integration are considered, 
It IS assumed that the preexisting national capital stocks cannot be relocated. Newly 
produced capital gixxts can be located costlessly in cither country. 
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separable case, equation (9') as the first-order condition for the home 
country. Replacing c\, cj, and p by c}, cf, and p, (9') also represents the 
first-order condition for the foreign country. Given the predeter¬ 
mined value of A, (= k,), the model determines the momentary equilib¬ 
rium values of cl. cf, cl, cf, w,. r,+,, and A(+,. 

The stability conditions for this world-economy model are very 
similar to those for the closed economy given in (21). As in the closed 
economy, we can solve the first-order conditions and household 
budget constraint for ej as a function of w,. r,+,, and p, and for cl as a 
function of Wt, r,+,, and p. As in the autarky case, we can now obtain a 
first-order difference equation in k,: 

f{k,) - - I {c^[f(k,) - p] 

c'[f{kt) - p]} = A,+ ,(l -I- n). (26) 

This model will be locally stable if 

I (ch, + cl- 2 )A/"[ 2 (I + n) (cl. -H c')/"]- | < 1 ." (27) 

The interpretation of this stability condition is the same as in the 
closed-economy case. Again, the assumptions of normal goods and a 
nonnegative effect on saving of an increase in the interest rate do not 
suffice to establish stability. Note that, when tastes are identical, the 
model reduces to the closed economy. All international trade and 
international borrowing and lending are due to taste differences. 

Combining the rate of time preference with finite-lived individuals 
in a Samuelson-Diamond type of overlapping-generations model may 
be the only analytical way to introduce different pure rates of time 
preference in models with international trade in goods and financial 
claims. If an economy consists of infinite-lived individuals who 
maximize the sum of the infinite stream of utility (or consumption) 
discounted by their rate of time preference, then in the steady state 
the interest rate equals the pure rate of time preference because 
consumption does not vary over time. Therefore, if the world consists 
of two such economies with different rales of time preferences, there 
cannot be a steady state with international mobility of financial capital. 
Differences in the rates of time preferences imply differences in 
steady-state interest rates, as in Stigliiz (1970), whose model indeed 
does not have financial capital mobility, while financial capiml mobility 
implies an identical steady-state interest rate. The use of the 
overlapping-generations model solves this problem since, even in the 

" Using the more general utility functions u(r',rf) for the home counti v and u(r,‘, fj) 
lor the foreign country, we can solve for cl and c,' as functions of in, and r„, The 
stability analysis carries through unchanged for this more general case. 
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Steady state, first-period consumption need not equal second-period 
consumption (even though each generation’s lifetime consumption 
profile is the same), and thus the interest rate need not equal the rate 
of time preference.*^ 


The Balance-of-Payments Accounts 


In a two-country world, it suffices to analyze the balance-of-payments 
accounts of one of the countries. Without loss of generality, I focus 
on the home country: 

B/ ^ -Bo (28a) 

and 




(28b) 


As there is no official settlements balance, the current account surplus 
(deficit) of a country is identically equal to its capital account deficit 
(surplus). 

The i)alance-of-trade surplus is the excess of domestic product over 
domestic absorption. Domestic absorption is the sum of consumption 
and domestic capital formation: B, = L,f(k,) - c]L, - cfL,-, - {K,+ , - 
K,), or 

l>, = J(h,) - c] - - [( 1 + n)A,+, - A,]. (29) 


Tlie current account surplus is the excess t)f natittnal product over 
domestic absoiption. National product equals domestic product plus 
net Idieign investment income. Ifd, is the wealth of the home country 
at the beginning of period t, net claims on the rest of the world are 
— K, and net foreign investment income r,(Ai — K/). The country’s 
wealth at the beginning of period t consists of the accumulated saving 
of the members ol generation t - A, = (Wr-i ~ f'-i)T(-i, or 


a, = 


XV, 


f('-i 


\ A n 


(30) 


National wealth can, and with different pure rates of time preference 
will, differ from the value of the dome.stic capital stock because of the 
scope for international borrowing and lending. There is a presump¬ 
tion that a nation consisting of people with high rales of lime prefer¬ 
ence will lend to be a net foreign borrower. The truth of this pre¬ 
sumption will be investigated in what follows. 


I am indebted to an anemymous referee for making this point. 
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The current account surplus of the home country is given by G, = 
Lifikt) +■ — Kt) — CfLi — ci-tL)-, — (Ki+i — Ki), or 

gt =f(kt) + rt(a, ~ k,) - c} - - - - [(1 + n)Ai+i - *(]• (31) 

With constant returns to scale in production and marginal produc¬ 
tivity factor pricing,/(i,) = w, + r,k,. Equation (31) can therefore be 
rewritten as: 

gt = w, + r,a, - d - - [(I + n)k,+, - A,]. (31') 

l -r n 

Equivalently, the current account surplus can be viewed as the net 
foreign investment by the home country. Net foreign investment is 
the excess of domestic wealth accumulation (saving) over domestic 
capital formation, that is, G, = d,+ , — A, — (K,+i — Kt), or 

g, = (1 + n)a,+, - a, - [(I + n)A,^, - k,].'-' (32) 

Steady-State Equilibrium 

The steady-state ctjuilibrium of the two-country model is given by 
(IT) and (12) for the home country and their counterparts for the 
foieign countiy, the factor market equilibrium conditions (13) and 
(14), the slationai y world capital market equilibrium conditions (33), 
the definition of home country net worth (34), and the l)alaiice-of- 
paymcnts equations (35), (35'), (36), and (36'): 


IV — ^(f * + f *) = k(\ 4- }}), 

(33) 

IV — c' 
a = —j-, 

1 + n 

(34) 

b - f(k) — f ' — —— - nk. 

(35) 

h = (n -- r)(a — k). 

(35') 

J(k) + r(a - k) - c' ~ 1 + ,i ~ 

(36) 

!1 

1 

(36-) 


Neither the steady-state trade balance nor the steady-state current ac- 

= (w, - ei)l.,\ «,+ , =.■<,+,//.,+ ,; =.-/Kl + r,) orff-,/(I + n| = fl,(I + 

O), w, + r^i, - f,' - T-i(I + n) ' = (1 + n)«i+i ” a,- 


* 
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count balance need be in equilibrium. As regards the current account 
balance, this is the result of a nonzero rate of populadon growth and 
different tastes. If, for example, the home country is a net lender to 
the rest of the world (a > A), steady-state equilibrium requires a 
current account surplus sufficient to maintain the real per capita 
value of net claims on the rest of the woVld, that is, a surplus of n(a — 
k) (eq. [36']). A steady-state current account surplus will be associated 
with a steady-state trade balance surplus if the interest rate is less than 
the rate of population growth, with a steady-state trade balance deficit 
otherwise (3.6’). 

Because the real wage and the interest rate are equalized through¬ 
out the world economy, the present discounted value of lifetime 
resources and lifetime consumption is equal for residents of both 
countries. If and only if the two countries have identical lifetime 
consumption paths (c‘ = c‘ and c* = c*) will there be no net foreign 
lending or borrowing. Each economic agent “lives within his means” 
in the sense that the present discounted value of lifetime resources 
(period 1 labor income) equals the present discounted value of 
lifetime consumption. Still, except when tastes are identical or popu¬ 
lation is stationary, one country will be steadily accumulating claims 
on the other. 

We next establish the following three propositions about the 
steady-state equilibrium of the world economy. 

Proposition 6. —The country whose residents have a higher value of 
consumption when young will run a steady-state current account 
deficit. 

Proof. —From (34) we know thata = ((tc — c‘)/(l + «)] andii = [(ti^ — 
c')/(l + m)1. Thusc' > c* implies a < a. Rewrite (33) as«—A-fa —A = 
0. Therefore a - A= — (a—A)<0. The current account surplus isn(a 
— A) < 0. This proposition, unlike the following two, does not depend 
on the additively .separable specification of the utility function. 

Proposition 7.—The country with the higher pure rate of time pref¬ 
erence has a steady-state current account deficit. 

Proof. —From proposition 6 we know that it suffices to show thatp 

> p implies c‘ > c‘. This was shown in proposition 1. 

Proposition 8. —The common steady-state open-economy capital- 

labor ratio lies between the two autarky capital-labor ratios if the 
model is stable under autarky and if the conditions of proposition 5 
are satisfied. 

Proof. —L.et A* be the common open-economy capital-labor ratio, k,i 
the autarky domestic capital-labor ratio, and kf the autarky foreign 
capital-labor ratio. We again assume that p > p. For brevity we write 
steady-state consumption in period 1 as c'(A,p) and c‘(A,p), with Cp, Cp 

> 0. From the fact that consumption in period 1 is an increasing 
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function ofp, it follows thatc*(fc,p) < Vi[c'{k,p) + c'{k,p)] < c'{k,p). 
To establish the ti^th of proposition 8 we must rank the k'& deter¬ 
mined by the following three capital market equilibrium conditions: 
/(A«) - k„f'(,k„) - c\k„,p) = k„(l + n);/(A*) - k*f'm - V 2 {[c\k*, 
p) + c^{k*,p)]} = A*!! 4- n); and/(*^> - kf-f{ky) - c^(kf,p) = k^{\ + n) 
Assume that k* is below the lower of the two autarky capital-labor 
ratios, kn. In the autarky case, at a value of A below ka such as k*, there 
would be excess demand (i.e., excess saving) in the capital market—if 
the model is stable. A fortiori there would be excess demand if, in the 
home country capital market equilibrium condition, c'{k*, p) is re¬ 
placed by the smaller world-average level of consumption Vi[c'(,k*,p) 
-t- c*(k*, p)]. Thus capital market equilibrium requires k* > k,f. By 
exactly analogous reasoning, it is established that k* < kp. 

Diagrammatically, the stationary equilibrium of the two-country 
model can be represented as in figure 3. Here OF is the stationary 
decentralized consumption possibility locus for the hypothetical aver¬ 
age inhabitant of this two-country world economy, that is, someone 
with a lifetime consumption pattern 54(c‘ -f- c‘), !4(c* + c*). It incorpo¬ 
rates the world capital market equilibrium condition (33) and the 
hypothetical average world citizen’s budget constraint, the sum of the 



Fio. S.—Autarky and opcn-cconomy equilibria for both countries 
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two countries’ private budget constraints. That is, OF represents Yi(c‘ 
+ f') as a function oi !4(<:'■“ + c^), solved from; 

J(k) -kj\k) - |(f' + c') = k(l + «), (37) 

J{k) - kf'(k) - i(c» + c‘) = - -• (38) 

I’hc OF also represents the stationary autarky consumption possibility 
locus for each country individually. 

While (37) and (38) must be satisfied in a full stationary equilibrium 
of the world economy, they are not a complete characterization of the 
stationary decentralized consumption possibility locus. Residents of 
each country must satisfy their budget constraints individually and 
not merely on average, across both countries as in (38). In figure 3, 
the indifference curve of each country is tangent to the common 
world capital market line. The home country, after integration into 
the world economy, consumes at Eh- Its high level of c' is a reflection 
of its high pure rale of lime preference. The f??. is the open-economy 
consumption etjuilibrium of the foreign country with the low pure 
rate of time preference. Average world consumption (and world 
production in both countries) is at Ew. Consistency requites that the 
distance El- — £»- = Ejy — Eh. The home country bortows from the 
foreign country. From (34) and (33) we obtain that a ~ k = '/i[(c‘ - 
c')/(l + «)] < 0. The home country’s debt to the foreign country 
grows at the natural rate of growth. As we have drawn it, E'[f is at a 
capital-labor ratio below the golden rule, that is, r > n. The home 
countiy therefore runs a trade balance surplus. If the equilibrium had 
been beyond E*, the home country would still run a current account 
cfeficit, but with n > r it would now also run a trade balance deficit. The 
autarky equilibrium for the home country is mEH, at a capital-labor 
ratio below the common posttrade capital-labor ratio. The autarky 
equilibrium for the foreign country is at£^’, at a capital-labor ratio 
above the common open-economy capital-labor ratio. 

Corollary. —The steady-state real wage of the capital-exporting 
country is below its autarky real wage and conversely for the 
capital-importing country. This confirm.s the analysis of MacDougall 
(1960), Kemp (1962), and Ruffin (1979). 


Non-Steady-State Behavior 

The behavior of the two-country model outside the steady slate is not 
much more complicated than its steady-state behavior. There is only 
one state variable—the capital-labor ratio. It is easily seen from equa- 
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tion (39) below that the current account and trade account surpluses 
in period t can also be expressed as a function of A,_,. The current 
account surplus of the home country is given by: 


g< 




w-i 

1 + n 


- [(1 + «)A,+ , - k,]. 


(39) 


Will the country with the higher pure rate of time preference run a 
current account deficit throughout the non-stcady-state adjustment 
process, the way it does in the steady state? Corresponding to (39) is 
the current account surplus equation for the foreign country: 

(40) 

1(1 + n)k,,,-kA. 


In this two-country world g, + gi^ 0. Comparing (39) and (40), we see 
that with p > p, c] < c}, c}., < rf^,. and c'f ., > c'f.,. Fhe excess of 
period 1 consumption in the home country over period 1 consump- 
tioti in the foreign country makes for a current account deficit in the 
hoitie country. The excess of period 2 consumption in the foreign 
countiy over period 2 consumption in the home countiy has the 
opposite effect. The home country will run a current account deficit if 
atid otily if 


n(r/ 1 - rt 1 ) 

1 + n 


+ o' ~ c< ^ 


Cf- I - r? 1 

1 + n 


(41) 


When the private sector budget constraints are tised, this can be 
rewritten as 


cl - cj > 


r' — ?' 

I -I- n 


( 41 ') 


With w > 0 this condition is certainly .satisfied in the steady state with c] 
= f'_, and c) — r)_, because c' > f'. Outside the steady state, it will be 
satisfied if the increase in total consumption by the younger geneia- 
tion between period t — 1 and period t in the home country exceeds 
that in the foreign country. This can f)e seen by rewriting (41') as 

L,c\ - , > L,-c] - (41") 

The current account deficit equals the excess of domestic capital 
formation over national saving. This holds for both countries. With 
perfect international capital mobility, A, is the same in Itoth countries 
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in each period. Domestic capital formation will therefore be the same 
in the two countries. The home country will therefore run a current 
account deficit if and only if domestic saving falls short of foreign 
saving, that is, if and only if Lt(w, — c') — — c}-,) < L,{wi — c}) 

- - r?_,). This is equivalent to (41"). While, with p> p, each 

generation in the home country saves less than the corresponding 
generation in the foreign country, national saving in the home coun¬ 
try need not be less in any given period than national saving in the 
foreign country. National saving is the sum of saving by the young 
and dissaving by the old. In the home country the young save less, but 
as a consequence the old dissave less. 

IV. Gains from Participation in the World Economy 

The positive analysis of the determinants of international trade and 
lending can he complemented in a natural way by the analysis of the 
gains from participation in the international economy. When the 
gains or losses from participation in the world economy are evaluated, 
welfare in the world economy will he compared with the welfare that 
would have been achieved under continued autarky. Both short-run 
effects and long-tun effects are considered. 


Skort-Run EJfects 

Short-run effects are the effects on the welfare of the two generations 
alive during the period that the two national capital markets and 
goods markets are integrated. The two countries evolve under au¬ 
tarky up to and including period I — 1. International integration 
occurs in period t. 1 shall consider iti detail only the ca.se in which 
preexisting capital goods cannot be shifted abroad. In this ca.se, inte¬ 
gration of capital and goods markets in period I does not permit the 
owners of capital that already exists at the beginning of period I (the 
savings in period ( — 1 of young members of generation t — 1) to move 
it from country to country. This physical capital was hxed in place in t 
— 1 to be used in the country of its original location. Because the 
physical capital stocks in the two countries in peritxl t are predeter¬ 
mined, both the interest rate faced by the old generation (< — 1) and 
the wage rate faced by the young generation are predetermined and 
unaffected by the international integration. In this case it does not 
matter whether the shift toward openness is anticipated or unantici¬ 
pated. The welfare of the old generation in both countries is there¬ 
fore the same with openness as it would have been under continued 
autarky. The welfare of the younger generation will be higher (lower) 
under optenness than under autarky if the interest rate in period t, 
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IS higher (lower) under openness than under autarky. In figure 1, 
a igher interest rate pivots the budget constraint upward around w,, 
whde a lower interest rate pivots it downward. The quantities {A/*}, 
{At }, and {A,} are the sequences of the capital-labor ratios for the 
home country under autarky, for the foreign country under autarky, 
and for the world economy under openness, respectively. The quan¬ 
tities {r^*}, {r^}, and {r,} are the associated interest rate sequences. It 
is assumed that at the time of integration, t, the capital-labor ratio of 
the home country, , is not above that of the foreign country, kt 

Proposition 9 .—If ki' ^ A?, if consumption in period 2 is a normal 
good, and if an increase in the interest rate increases saving, then Af'+i 
A,+| < k[+, or rf+i < r^+i < 

Proof. —The proof is by contradiction. Assume that r,+ , From 

propositions 2 and 3 we know that t^+, > rf+, . Therefore, if < 0 and 
cl < 0, saving by generation t under openness in both the home 
country and the foreign country is higher than it would have been 
under autarky. Thus the per capita open-economy saving by genera¬ 
tion t in the home country exceeds (1 + n)ki'+,, and the per capita 
open-economy saving by generation t in the foreign country exceeds 
(1 + n)/C(+,. With populations of the same size in both countries, 
average per capita saving by generation t in the world economy 
therefore certainly exceeds (1 + n)AJ{, (rf{, > rf+, implies Afi., < Af+|). 
Average per capita saving by generation t in the world economy 
equals the average per capita capital stock in the world economy in 
period t -f- 1. Therefore A,+ , > Af+|. This contradicts the assumption 
that r,+ | > rf+i. F.xactly analogous reasoning yields the result that A,+ i 

< k{^,. 

The effect of participation in the world economy on the welfare of 
the two generations alive during the period in which integration 
occurs is therefore unambiguous. The welfare of the older generation 
in both countries is unaffected. The welfare of the younger genera¬ 
tion in the high-time-preference country (the home country) is low¬ 
ered, while the welfare of the younger generation in the low-time- 
preference country (the foreign country) is raised. During period t 
capital flows from the low-time-preference, low interest rate country 
to the high-time-preference, high interest rate country, expanding 
the consumption possibility set of the younger generation in the 
low-time-preference country and shrinking that in the high-time- 
preference country. Since in the high-time-preference country the 
older generation neither gains nor loses from integration and the 

'■* This will always be the case either if the two countries started off from the same 
initial condition (proposition 4) or if, from arbitrary initial conditions, the system had 
been evolving under autarky for a sufficiently long period of rime (propositions 2, 3, 
and 5). 
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younger generation loses, a move toward integration will always be 
vetoed by the residents of the high-time-preference country if they 
vote according to their preferences—unless an appropriate transfer 
from the low-time-preference country is forthcoming.*® 


Long-Run Effects 

It is not possible, except in special cases, to make unambiguous wel¬ 
fare comparisons by comparing steady-state utility levels—the rank¬ 
ing of the stationary utility levels may not be the same as the ranking 
of the utility levels achieved during the transition from one steady 
state to another. Nevertheless, the stationary utility levels achieved 
under autarky and openness are of interest. They are analyzed next. 
There are several cases to consider, illustrated in figures 4-9. 

Case 1 .—The autarky capital-labor ratios of both countries are 
above the golden-rule capital-labor ratio. In hgjures 4 and 5, the 
autarky equilibrium of the home country is atfilP^, that of the foreign 
country atEf^ (proposition 5). The golden-rule capital-labrrr ratio is at 
E*. The open-economy consumption equilibrium of the home coun¬ 
try is at Eh, that of the foreign country at World production 
equilibrium is at £5, at a capital-labor ratio between the two closed- 
economy capital-labor ratios. The foreign country (the low-time- 
preference country) has a higher stationary level of utility under 
openness than under autarky (figs. 4 and 5). Two factors contribute to 
this. First, the open-economy capital-labor ratio, while still in the 
inefficient region, is lower than under autarky. The extent of overac¬ 
cumulation of capital is reduced, and the total amount of resources 
available for consumption is increased. Second, the foreign country is 
a net lender at an interest rate higher than the autarky interest rate. 
The home country (the high-time-preference country) can have 
either a lower (fig. 4) or a higher (fig. 5) stationary level of utility 
under openness. The degree of overaccumulation is increased by 
international integration. Against that, the home country is a net 
borrower at an interest rate below the autarky interest rate. 


'* If preexisting ctipital goods tan be shifted abroad, capital goods accumulated 
during period t - I out of the saving of young members of generation 1 — 1 can be 
shifted between countries so as to establish a common interest rate, r„ and marginal 
product of capital/'(*<) = r, for period t. It now makes a difference whether the move 
toward openness in period t was anticipated in period f — 1. If perfect foresight prevails 
and if , « kjii, it is easily shown that the old generation alive dunng the transition to 
openness (generation t - 1) will be worse off under openness than under continued 
autarky in the home country, better off in the foreign country. The effect on the 
welfare of the young generation (/) is ambiguous. It is therefore no longer necessarily 
the case, as with nonshiftable capital, that the high-lime-preference home country will 
always veto a move toward openness. 



OVERLAPPING-GENERATIONS MODEL 


79' 



Fk'.. 4.—l/jng-run -welfare tompanson; b»lh autarky tapilal-laboi ratios above the 
golden rule. 


Case 2. —'I'he autarky capital-labor ratios of both countries are 
below the golden-rule capital-labor ratio. I his case is illustrated in 
figures 6 and 7. I he high-time-preference country (the home coun¬ 
try) has a higher stationary utility level under openness than under 
autarky. Under openness, the capiLal-labor is higher but remains 
below the golden-rule level. The total amount of resources available 
for consumption is therefore increased. The country also is a net 
borrower at an interest rate lower than the autarky interest rate. The 
low-time-prefercnce country (the foreign country) can have either a 
lower (fig. 6) or a higher (fig. 7) stationary level of utility under 
openness. Its capital-labor ratio is reduced in the efficient region, 
lowering the total amount of consumable resources. Against that, the 
foreign country is a net lender at an interest rate higher than its 
autarky interest rate. 

Case 3 .—The autarky capital-labor ratios are on opposite sides of 
the golden rule. Two subcases need to be distinguished. 

Case 3a .—The open-economy capital-labor ratio is above the 
golden-rule capital-labor ratio. In this case the analysis for the low¬ 
time-preference country (the foreign country) is the stune as in case 1; 
Its stationary utility level is unambiguously higher under openness 
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Fk.. 5.—Ixiug-run welfare comparison; both autarky capital-labor ratios above the 
golden rule. 



Kio. 6.—Long-run welfare comparison: both autarky capital-labor ratios below the 
golden rule. 
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Fig. 7.—Ixjng-run welfare compari.son: both autarky capital-labor ratios below the 
goldea rule. 


than under autarky. The stationary utility level of the high-time- 
preference country (the home country) can be either lower (fig. 8) or 
higher (fig. 9) under openness. There is a stronger presumption 
toward a higher utility level in this case than in case 1, when the 
autarky capital-labor ratio of the high-time-preference country was 
above the golden-rule level. In case 3a the of>en-economy capital- 
labor ratio at Ew, while in the inefficient region, may still correspond 
to a stationary volume of consumable resources above that corre¬ 
sponding to E'f^, which is below the golden rule. If it does, will 
certainly correspond to a higher stationary utility level than be¬ 

cause in addition the high-time-preference country is a net borrower 
at an interest rate below the autarky interest rate. 

Casf 3b .—The open-economy capital-labor ratio is below the 
golden-rule capital-labor ratio. The analysis for the high-time- 
preference country (the home country) is now the same as in case 2: 
Its stationary utility level is unambiguously higher under openness 
than under autarky. The low-time-preference country's stationary 
utility level can be either higher or lower under openness. No dia¬ 
grams are provided as they can easily be constructed by analogy with 
figures 8 and 9. 

An important clue to these welfare comparisons is provided by the 
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Fig. 8.—l-ong-nin welfHre comparison, the autarky capital-labor ratios straddle the 
golden rule. 


fact that international trade and financial mobility do not expand the 
consumption po.ssibility set of the “average” world consumer under 
openness beyond that available to the individual countries under 
autarky. 1 he OF in figures 3 and 4 is the stationary, decentralized 
consumption possibility locus for each of the countries under autarky 
and for the integrated world economy. The same applies in the short 
run: There are no potential gains in production from specialization. 
The only difference made by openness is that a worldwide capital 
market is substituted for two national capital markets. The equilib¬ 
rium in this worldwide capital market will be an average of the two 
autarky equilibria in the national capital markets. 


V. Conclusion 

Diamond’s overlapping-generations model is extended to a two- 
country world. A number of results are obtained. The country whose 
residents consume more in the first period of their lives (at a given 
wage rate and interest rate) has a steady-state current account deficit 
if the rate of population growth is positive. The country with the 
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Fn;. 9.—Long-run welfare comparison, the autarky capital-labor ratios straddle the 
golden rule. 


higher pure rate of lime preference has a steady-state current account 
deficit. The common stationary open-economy capital-labor ratio lies 
Ijetween the two autarky capital-labor ratios, with the lower autarky 
capital-labor ratio associated with the country whose residents have 
the higher pure rate of time preference. Outside the steady state, the 
country with the higher pure rate of time preference will not neces¬ 
sarily be the one to run a current account deficit. If preexisting capital 
is not shiftable between countries, the welfare of the old generation 
alive when the two autarkic countries join in an open economy is 
unaffected by this international economic integration. The welfare of 
the young generation in the country with the high pure rate of time 
preference is lower under openness than under autarky, while in the- 
country with the low pure rate of time preference it is higher. 

The comparison of stationary utility levels under autarky and 
openness yields two unambiguous results. If both the autarky and the 
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open-economy capital-labor ratios of the low-time-preference country 
are above the golden-rule level, its stationary utility level will be 
higher under openness. If both the autarky and the open-economy 
capital-labor ratios of the high-time-preference country are below the 
golden-rule level, its stationary utility level will be higher under open¬ 
ness. If the first of these conditions holds, the move toward openness 
can be said to constitute a potential Pareto improvement for the 
low-tiine-preference country, because it always gains in the short run. 
Even if the second condition holds, openness cannot be said to be a 
Pareto-superior regime for the high-time-preference country, as it 
always incurs a welfare loss in the short run. 

The open-economy version of the overlapping-generations model 
can be extended in a number of directions. Within the context of the 
one-commodity model, budgetary and financial policy can be studied; 
this includes public sector lending and borrowing, taxation of wage 
and property income, social security, etc. The two-commodity struc¬ 
ture of the Heckscher-Ohlin model can be grafted onto the 
overlapping-generations framework.'* This allows potential gains 
from trade through specialization in production. If one of the goods 
is a capital good, the distinction between international mobility of 
financial capital—international lending and borrowing—and inter¬ 
national trade in capital goods becomes important. One could have 
either or both. Phe two-commodity structure permits the analysis of 
tariffs, quotas, and other trade policies, familiar from the static 
Heck.scher-Ohlin model, in addition to the financial and budgetary 
policies that can be studied in the one-commodity overlapping- 
generations model. Finally, uncertainty can be introduced into the 
model—most interestingly into the two-commodity version. Without 
futures markets for commodities, households have to make labor 
supply and saving decisions while young, without knowing the future 
spot prices of commodities with certainty. The scope for further 
developments appears to be considerable. 
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Mobility Costs, Frictional Unemployment, and 
Efficiency 


Peter A. Diamond 

Massarhmetti IrLsUtutr uj Teihnology 


With iinpeifect job information flows, it is plausible that the distri¬ 
bution of job offerings becomes more attractive when there are more 
vatanc'ies and more unemployed. With word-of-mouth communica¬ 
tion, this condition is derived. Given this condition, steady-state 
etjuilibrium is not ef'bcient, with welfare increased by the introduc¬ 
tion of unemploymetit compensation even though all agents are risk 
neutral. In this way workers become more .selective in the jobs they 
accept. 


Moving and training co.sis play a .significant role iti job-taking deci- 
stons.' Even if workers were equally productive in all jobs, these ctists 
would make it sometimes worthwhile for an unemployed worker to 
refu.se a job offer while waiting for a more attractive offer. The rate at 
which workers are offered jobs with dilfererit moving costs depends 
on the decisions of other workers as to which jobs to refuse. This 
externality implies that equilibrium will not generally be elficiciit. 
With the plausible assumption that job offerings become more attrac¬ 
tive on average when the number of available jobs increases, efficiency 
increases when workers are induced to pass up jobs with relatively 
high moving costs (by imeiiiploymerit compensation, e.g.). 

These results are derived in a model of steady-state search equilib¬ 
rium similar to one that has been used elsewhere (see Diamond and 
Maskin 1979and in press; Diamond 19806,1980c; Mortensen.in press). 


Comments from Joel Ycllin and a referee, research a.ssi.sianre by Drew Kudenberg, 
and financial support from NSF are gratefully acknowledged. 

' For a discu-ssion of the role of employment in miibility decisions, see Bartel (1979). 
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Earlier analyses have focused on decisions which affect the rate at 
which workers receive job offers, taking as given the distribution of 
the quality of job offers. Here the arrival rate of job offers is taken as 
given, with the quality distribution endogenous. It is assumed that 
there is a fixed-coefficients technology. Workers and firms are taken 
to be risk neutral and to face a common exogenous interest rate. Both 
job termination and the arrival of new job offers are modeled as 
Poisson processes with fixed parameters. It is assumed that the wage 
and sharing of moving costs are negotiated,^ with workers and firms 
equally good bargainers. The steady-state unemployment level is then 
determined by the job-taking decisions of workers. More stringent 
standards for job taking raise the vacancy rale, which in turn im¬ 
proves the distribution of job offers. Thus there can be multiple 
steady-state equilibria. From any steady-state equilibrium, inducing a 
permanent further decrease in the moving costs which workers are 
willing to bear raises the present discounted value of output in the 
economy. The optimal unemployment compensation benefit is de¬ 
rived in Section VI. Presentation of these results takes as given the 
relationship between the unemployment rale and the quality of job 
offers. In Section VII, a simple model of job-information flows is 
presented to derive an example of tbe way higher job availability 
might improve average job-offer quality. An example with an expo¬ 
nential distribution of moving costs is examined in detail in Sections 
VllI and IX. The example results in optimal benefits in the 
neighborhood of 60 percent of the wage. 

I. Employment 

Assume that all jobs are the same, with a fixed-coefficients technology 
that results in a How of output, y. All jobs are subject to a risk of 
termination at the constant breakup rate b. This represents exogen¬ 
ous factors such as a transfer of the worker or the job opportunity to a 
different location. We ignore the possibility of the worker and the job 
moving together. We also ignore endogenous reasons for job termi¬ 
nation, such as quitting for a better job or laying of f workers to hire 
different workers. 

At initial employment, the wage is negotiated, as is the sharing of 
setup costs. For simplicity we shall assume complete symmetry lie- 
tween workers and jobs (including equal numbers) implying that the 
net gain from commencing production is shared equally between 


“ Nej^otiation is assunied lo Ijc in.stantaneous, ruling out one route b\ which the 
presence o1 other workers alfecis a joh seeker—that o1 waiting while a decision is made 
about employing a different worker. 
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worker and employer. In the absence of labor disutility, capital user 
cost, and unemployment compensation, this implies equal sharing of 
the flow of output and so a wage equal toy 12. Workers are assumed to 
be risk neutral and to face a constant interest rate r. With these 
assumptions, it is appropriate to focus attention on the expected 
present discounted value of earnings less moving costs. Denote by Wn 
and Wi, the expected discounted value of earnings less moving costs 
for employed and unemployed workers, respectively. With complete 
symmetry assumptions about both finding and losing jobs, with an 
infinite expected life, and with analysis restricted to the steady state, 
Wt: and W,; will not vary over time or workers. 

For an employed worker, the rate of interest times expiected earn¬ 
ings equals the wage less the expected capital loss from Job termina¬ 
tion;^ 


rWt: = y/2 - b(Wf: - Wa). (1) 

The next step in the analysis is to consider job taking and so the 
determination of Wn. 


II. Job Taking 

Assume that unemployed workers learn about job opportunities with 
an exogenous"* arrival rate, a. We assume no search costs and no 
decisions of an individual which affect this arrival rate. While the 
productivity of all jobs is taken to be the same for all workers, jobs 
differ across workers in the setup costs before production begins.' 
These costs reflect moving costs when workers must relocate to take 
new jobs and specific training costs before production can begin. The 
pattern of a fixed cost followed by constant output can be viewed as an 
approximation to the increased output that comes with on-the-job 
learning. We ignore the variation among jobs in commuting costs 
since these represent a variation in flow benefits rather than setup 
costs. 

Denote by G(c,m) the distribution of setup costs associated with jobs 
an individual learns about when the unemployment rate is u. A higher 
unemployment rate (and higher vacancy rate) is assumed to improve 
the distribution of moving costs in the sense that G«(c,m) > 0 where 


“ Considering a worker as an asset, eq. (1) is slated as the rate of return limes the 
value nf an asset equals the flow return plus expected capital gain. It ran be derived 
alternatively from explicit calculation of the present value of expected returns as in 
Diamond (1980A) or as the limit of a discrete time process; Wg = (1 + rA()’ '[y^t/2 + 
bMW,. + (I - b\lWl:l 

^ Below, we will allow a to vary with the unemployment rale. 

' For a detailed analysis of individual choice with setup costs, see Loikkanen and 
Pursiheimo (1979). 
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G(c,u) is positive and less than one.* In Sections VIl and VIII we 
derive this assumption from specific models of information flows. We 
take the distribution to be constant over time for a given worker. With 
symmetry between workers and jobs, setup costs are assumed to be 
equally divided between worker and job. If an unemployed worker 
accepts any job with a setup cost less than c*, then, in the absence of 
unemployment benefits, we can write the expected discounted value 
of net earnings implicitly as 

rW,j = af (Wt-Wu-c/2)dG(c,u). ( 2 ) 

Jo 

That is, the interest rate times the expected value of earnings equals 
the expected gain from job taking, less setup costs.^ 

The choice problem for the individual worker is the selection of c* 
to maximize W,/. Naturally, this involves accepting any job for which 
the setup cost is less than the expected gain from job taking:* 

c* = 2(fVf - + aj^ cdGj/[r + b + aG(c*,u)l, (3) 

where (1) and (2) have been solved to give the implicit equation for c*. 
Workers are more willing to bear setup costs when output is greater, 
job finding is more difficult, the interest rate is lower, expected job 
duration (t"') is longer, or the unemployment rate is lower. That is, 
from implicit differentiation of (3) we have dc*/dy > 0, dc*/da < 0, 
dc*/dr < 0, dc*ldb < 0, and dc*/du < 0. 

III. Equilibrium 

In steady-slate equilibrium, the aggregate rate of job finding must 
equal the rate of job losing. Denoting the unemployment rate by «, 
this gives us 

b(l — u) = auG{c*,u). (4) 

That is, the job breakup rale times the proportion employed equals 
the job acceptance rate times the proportion unemployed. Since Gu is 
assumed positive, we havec* decreasing with u. One would not gener¬ 
ally exftect to find the rate, a, at which workers learn of potential jobs 


* To interpret this assumption one needs to compare alternative economies with 
different steady-slate employment and vacancy rales, it is not appropriate to consider a 
single economy over a business cycle, since in that case a nse in unemployment is 
accompanied by a decline in vacancies rather than a move in the same direction. 

' As a discrete lime process, we would have W„ = (I + rAt)'‘(aA/{fi(f*,u)B'j + (1 - 
G{c*.u)]W^, - + (1 - aM)Wi,). 

* Eq. (3) comes from joint wealth maximi/ation of hrm and worker and docs not 
depend on the equal sharing rule. 
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to be independent of the unemployment and vacancy rates (which are 
equal by assumption). We would expect a to increase with u across 
steady states. Since the implications of this relationship have been 
explored elsewhere, we take a to be exogenous here but note the 
implied differences in footnotes. Similar considerations hold for b. 

Since c* decreases with u in both (3) and (4), we have the possibility 
of multiple steady-state equilibria, that is, multiple solutions to (3) and 
(4). When more jobs are available (higher u), anticipated mobility 
costs are lower (higher G) and individuals are more selective in the 
jobs they take (lower c*). Greater selectivity by workers, in turn, raises 
the unemployment rate. This is shown in figure 1, where equations (3) 
and (4) are drawn. 

We note that an individual will not accept a job with an expected 
return below .setup costs. Thus the chosen cutoff c* is less thany/(r -t- 
b). Even if expected future setup costs are close to zero, the potential 
loss in forgone wages implies that the chosen cutoff c* will never be 
le.ss thany/(r + h + a). If all offered jobs are accepted, the equilibrium 
unemployment rate is b/(a -F b). If none is accepted, the unemploy¬ 
ment rate goes to one. 

IV. Steady-State Output 

All steady-state equilibria lie along the curve given in (4). At any point 
on this curve we can calculate the steady-state output level per person. 
There are 1 — u employed per person giving a gross output flow per 
person of (1 — u)y. Moving costs per person equal the rate of new job 
taking per person of auG times the average cost of moving,/^ cdGIG. 
Thus net output, Q, satisfies 

Q = (1 — u)y — au J cdG. (5) 

Differentiating with respect to m, with c* given as an implicit function 
of u by (4), evaluating at an equilibrium where (3) holds, and using 



c* 

[y -FoJ^cdG]/[r4b-t-aG] 
0 
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integration by parts, we have 


dQ _ 
du 


-y - 


f'" 

a j cdG — auc*g 


dc* 

du 



= —rc* + au 



( 6 ) 


where g = G^. The second term reflects the externality in lower 
moving costs of a higher unemployment rate. The first factor reflects 
the absence of discounting in steady-state comparisons. 


V. Dynamics 

The economy analyzed here cannot move directly from one steady 
state to another. While it is interesting to compare alternative steady 
states which might have occurred, proper policy analysis requires 
consideration ol' the comparative sUitics of the actual path of the 
economy. We shall analyz.e the effects of a policy which controls c* 
directly. That is, we consider an economy where job acceptance be¬ 
havior is sufficiently closely monitored to make it a government con¬ 
trol variable. Below we will examine how unemployment compensa¬ 
tion can be used to induce the .same steady-state equilibrium without 
monitoring job acceptance behavior. These two modes of control 
correspond to stylized versions of the German and American econo¬ 
mies. 

Unemployment grows by job terminations and declines by job ac¬ 
ceptances: 

ii — h(\ — u) — auG(c*,u). (7) 


Starting at a steady-state equilibrium given by (3) and (4) we shall 
calculate the change in the present discounted value of aggregate net 
output, W, from a permanent differential change in r*. with unem¬ 
ployment given by (7). 

That is, we want to calculate the derivative with respect to c* of 



(1 — u(<)]y “ 



rdG[c\ u 



( 8 ) 


where u{t) satisfies (7) with an initial condition satisfying (3) and (4). 
Calculating this derivative (see Diamond 1980fi), we have 

dW _ - auc*g 

dc* r 
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dW _ / aug \ 

(r + ^ 4- au 1 egude 

r* I" ^ 

dc* \ r j 

r + b + aG - 1 - auGu 


- cg^dc^ ( 10 ) 

r(r + h + aG + auGu) 



Thus a government policy to move the economy out of a steady-state 
equilibrium in the direction of higher unemployment raises efficiency 
by improving the average quality of job offers.'® 

VI. Unemployment Compensation 

There are several inodes of intervention which will raise the equilib¬ 
rium unemployment rate and so the efficiency of this economy. By 
increasing the importance of adjustment costs relative to the financial 
gains of employment, the government can induce greater selectivity 
in job acceptance. Thus, taxing output without allowing a deduction 
for adjustment costs or subsidizing unemployment or vacancies will 
have the desired end. Of these we shall analyze unemployment com¬ 
pensation.^In addition to making workers more selective in accepting 
jobs, unemployment compensation raises the wage by one-half the 
unemployment compen.sation benefit given the negotiation assump¬ 
tions we have made here. 

From the perspective of the worker and employer, the net gain 
from production isy ~ B, whereB is the unemployment benefit. With 
this net gain shared equally, the firm receives 0.5(y — B) and the 
worker receives a payment for forgoing unemployment compensa¬ 
tion ofB in addition to half the net gain. 0.5(y ~ B). Rewriting the two 
value equations (1) and (2), we have" 

’ 11 <2 IS a function of u, (10) becomes 

dIV _ -au‘fr 1_ /o (oGn + a'G)dc _1 

fle* r [ r + fr + aG + auG. + a'uG ) 

'“It is important to remember that we are considering policies that increase both 
vacancies and unemployed together. For an analysis of the gain from aggregate de¬ 
mand stimulation, see Diamond (1980c). 

" To derive (11) iormally, we need first to reconsider the equal-sharing rule above. 
That was derived from equations for fVf, and fVu, similar equations for the values of 
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rWe = 0.5(y + B) - - fVe,) 

rlVc, =B + a r {W^ - JVy - cl2)dG{c,u). 

Jo 


Then, the chosen cutoff cost satisfies 


= 2(We - Wv) = {y - B + aj^ cdG^ir + b + aG). 


( 12 ) 


Increasing the unemployment benefit by $1.00 decreases the cutoff 
level of costs selected at a constant unemployment rate by (r + fc + 
aGy' dollars. The induced increase in unemployment magnifies this 
effect. 

Setting equation (9) equal to zero, we can derive the cutoff cost level 
which would be optimal in a steady state. That is, an optimal trajectory 
will, asymptotically, have the cutoff cost level, c*, satisfying'^ 

cdG + au \ cg^ 

- t ^ - V -• 

r + b + aG + auGu 



Solving (12) and (13), we can derive the unemployment compensation 
benefit which holds asymptotically on the efficient trajectory (i.e., 
which supports the asymptotic optimum): 


B* 


(^y+aJ^cdGjiuG u — a«J* cgudc(r + b + aG) 


r b -t- aG + auG . 


(14) 


Substituting from the first-order condition for c*, B* can be written 
alternatively as'^ 


B* — au^c*G„ — J cg^j = auj' G^dc > 0. (15) 

Passing up a job offer alters the trajectory of the economy and so 
generates a pattern of externalities that varies over time. The present 


filled and vacant jobs, and W,., and an equal sharing of surplus value, Wt - W[ = 
Wf, - W,-. With unemployment compensation and equal sharing of setup costs the four 
value equations can expressed in terms of the wage, w: 

rtf’,.. = - KIV, - IV,), 


rIV,- = y - at - - Wy), 

rWr = B + af (IT* - W„ - c/2)dG. 
rfVy = af iWr - fV,- - r/2)dG. 

Combining these with the equal gain rule gives (11). 

'• From (13) and the first line of (6), we noij that dl^/du < 0 at u*. 
If a varies with u, (15) becomes B* — u (oG, + a'G)dc. 
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discounted value of the social gain from accepting a job with setup 
cost c can be derived by differentiating IV, in (8), with respect to u. 
Evaluated at a steady state this gives 


dfV _ _ _ 2 _ 

dti r + b + aG + auG^ 


(16) 


In an efficient equilibrium this social gain must equal the private gain 
— Wi,) — c. Equating the social and private gains, using (12), we 
have an alternative derivation of the asymptotically optimal unem¬ 
ployment compensation. 


VII. Discrete Example 

Having examined the implications of the dependence of the distribu¬ 
tion of adjustment costs of job offers on the unemployment rate, we 
now consider an example of how word-of-mouth communication of 
job availability can generate such dependence.'" In this section we 
consider an example with two locations. Specific training costs are c,. 
There are no moving costs if the job and worker are in the same 
location. There are moving costs if the job and worker are in 
different locations. For the next section we consider an example 
where c is distributed exponentially with a parameter that depends 
linearly on the unemployment rate. 

Assume that employed workers learn of a job opportunity in the 
same Icxation and inform their unemployed friends of its existence. 
Assume that the process of job communication is such that attempted 
communication about any vacancy is a Poisson process with constant 
parameter a. (In practice it is likely that there are more attempted 
communications of new job vacancies than of old ones.) Each poten¬ 
tial communicator knows n workers who could fill this job. Assume 
that each of the n has a probability u of being unemployed and an 
independent probability/? of being in the same l(H;ation. With proba¬ 
bility (1 — m)" all n friends are employed and there is no one to tell of 
the job. If the communicator does have unemployed friends, he only 
tells a friend in the other location if none of his unemployed friends 
are in the same location. That is, he tells the friend to whom the 
information is most valuable. The probability of his telling someone in 
the other location is [(1 - up)" - (1 - u)"]—the probability of no 
unemployed friends with the same location less the probability of no 
unemployed friends. Then 1 — (I — up)" is the probability of telling a 

For previous aiialy.ses of word-of-mouth communication, see Bfxjrman (1975) and 
Satterthwaiie (1979). 
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friend in the same location. The greater the unemployment rate, the 
greater the probability of telling a I'riend in the same location. 

With everyone following the same behavior rule, there are two 
candidates for equilibrium: accept only jobs at the same location, or 
accept any job offer. LetW] and Uj be the unemployment rates under 
these two behavior rules. Then the equilibria satisfy 

A(1 - u,) = au,(l - (1 - «,/>)"] 

6(1 - Uj) = aUill — (1 — Mj)"]. 

For some parameter values both of these equilibria will exist. Rather 
than pursue this example in more detail, we turn to a similar example 
with a continuum of locations, which can be considered to be around a 
circle. It is then assumed that the likelihood of knowing an individual 
is exponentially distributed with the distance to his location and that 
communication goes to the unemployed person for whom the moving 
costs are smallest. 


VlII. Exponential Example: Steady>State Properties 

We assume that c has an exponential distribution with coefficient nu. 
This example is chosen to fit the discussion above since the minimum 
of a random sample of size nu from the exponential distribution with 
coefficient 1 is exponential with coefficient nu. Thus we assume that 

G(c,u) = \ - e-”'"-. (18) 

With this distribution, llie two equilibrium equations (3) and (4) satisfy 

c*(r + 6 + rt) =jy +—(1 - ? (19) 


6(1 - u) = «u(l - e'"'"’)- (20) 

riiere exists a unique solution to this pair of equations with u between 
6/(« + 6) and one and c* between yf(r + h + a) and y/(r + 6). 

In a steady state (whether equilibrium or not), the net output How 
satisfies, from equation (5). 


Q = (1 - w)y - 
= (1 - M)y - 


(171 '[1 — c 

6(1 - u) 
nu 


""•■•(I + WMC*)] 



j 6(1 - w) 

In 

\n) 

au 



( 21 ) 


au 
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The asymptotically optimal unemployment compensation satisfies, 
from equation (15), 


B* = — [1 - •(nuc* + 1)] 
\nu 


_ b(\ — u) 


+ 


nu 


(—) 

\nu I 


1 - 


^.(1 - M ) 


au 


In 


1 - 


HI - u) 
au 


From (21) and (22) we note that 

Q = (1 — w)y — wB*. 


( 22 ) 


(23) 


Alternatively this follows from (15) since, with the exponential dis¬ 
tribution, / uG„ = j eg. From (13) we note that at the steady-state 
optimum the maximal acceptable moving costs satisfy 


JL = 

c* 



(24) 


To find the optimal unemployment rate, we solve (20) and (24) 
simultaneously. 

Some examples are shown in table 1. For these calculations, 
parameters were chosen for a, b, and r. Next an equilibrium unem¬ 
ployment rate in the absence of government intervention, u, was 
chosen. This implied a particular value for the product ny, and the 
remaining calculations were done for this value. In addition to 
steady-state comparisons, the last column of table 1 reports the per¬ 
centage change in the pre.sent discounted value of net output along 
the trajectory from the equilibrium steady slate to the optimal one 
where the fraction of accepted jobs, G, is held constant at its 
asymptotically optimal value. As detailed in the next section, the 
increase in net output along the trajectory exceeds the difference 
between steady states. 

As a guide to interpreting the table, consider the fourth row. In an 
economy where the unemployed receive 10 job offers a year, the 
expected duration of a job is 5 years, the interest rate is 5 percent, and 
the equilibrium unemployment rate is 4 percent, 3 percent of gross 
output is spent on moving costs, workers are just willing to spend 34 
percent of a year’s output on moving, and workers accept 48 percent 
of jobs they hear about. The asymptotically optimal unemployment 
rate is 4.7 percent, which can be induced by unemployment compen¬ 
sation equal to 60 percent of the wage. In this equilibrium workers 
accept 41 percent of job offers representing an 18 percent increase in 
the expected duration of unemployment, and steady-state net output 
is higher than in the no-compensation equilibrium by 0.23 percent 
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with a gain of 0.24 percent along the constant G trajectory from the 
initial equilibrium to the asymptotic optimum. 

The examples show a surprisingly consistent pattern. When the 
equilibrium unemployment rate is very close to b/{a + b), the 
minimum achievable,'’ the optimal unemployment compensation is 
small. As u rises, B*/w rises very rapidly, reaching the neighborhood 
of 60 percent when u is about one percentage point above the achiev¬ 
able minimum. The ratio B*/w stays in the neighborhood of 60 
percent for all calculated values, which included values of u as integer 
percentages up to 10 percent. The same pattern arose for all calcu¬ 
lated values of a, b, and r. 

IX. Continuous Example Dynamics 

The transition from the equilibrium steady state to the optimal one 
shows a larger change in net output than does the steady-state com¬ 
parison. To see this, let us consider a policy of changing the propor¬ 
tion of jobs accepted by an unemployed worker. (We shall also con¬ 
sider the optimal policy below.) If workers are accepting all jobs with 
costs below then the proportion of jobs accepted equals 1 — 

Let us consider the policy which holds this proportion 
constant over time at the level that occurs in the asymptotic steady 
state which we write dsG*. Then the economy follows the differential 
equations 

U (0 = ^[1 - M ( 0 ] - ^ 25 ) 

c*(/) _ - u (t) 

c*(0 u(t) 

We assume an initial condition at time lo of the steady-state equilib¬ 
rium. The immediate effect of the change in policy is to decrease 
aggregate setup costs without changing gross output. Over time the 
unemployment rate rises, decreasing gross output but keeping aggre¬ 
gate setup costs constant. This pattern is shown in figure 2. To 
calculate IT along this trajectory, begin by considering aggregate 
setup costs. These equal 

au(t) I = —{I - + 1]}. (26) 

Jo n 

Since u{l)c*(t) is constant, we have 

12(0 = Q* + y[u* - M(t)]. (27) 


When a = 10 and A = 0.2, h/(a + b) = .019608. 
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Solving (25) for u(i), we have 

u(t) = u* + (m — 

Thus the present discounted value of output satisfies 


fV' 


j 0 


''Q(0di 


— Q* y(u* ~ m ) 




( 28 ) 

(29) 


Since the optimal unemployment rale exceeds the equilibrium rate, 
the value of output along this path exceeds its value at the optimal 
steady state. The excess of W over equilibrium output, W, is shown in 
table 1. The addition to the value of unemployment compensation 
from analyzing the dynamic path is small but noticeable for the 
examples calculated. 

In addition to considering this path, which was chosen for its ease of 
analysis, it is interesting to analyze the optimal path assuming the 
government could control c*(f)- The choice problem is 

max f ^|[l - Ii(()]y - ±{l - e ^ nu(<)r*(I)]}j dt 

(30) 

s.t. u(l) = 11 - u{t)]b - au{t)[\ ~ 



0+ b 


Fit:. 3 
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The first-order condition for c* is 

c*(t) = (r + b)c*(t) - y + (31) 

The phase diagram for the optimal path is shown in figure 3. Along 
the optimal path, c*{t) is falling while u(t) is rising. Their product, and 
so the proportion of job offers accepted, is rising. In terms of figure 2 
the optimal path of net output converges to Q* more rapidly than the 
path with t/c* con.stant, starting from a higher initial net output level. 

Baily (1977) and Flemming (1978) have analyzed optimal unem¬ 
ployment compensation assuming risk-averse workers and no exter¬ 
nalities in the labor allocation process. This paper takes the opposite 
tack of assuming risk-neutral workers and externalities. It seems likely 
that workers are risk averse and externalities are present, making a 
much stronger case for unemployment compensation. 
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International Transmission of a Real Shock 
under Flexible Exchange Rates: A Comment 

Betty C. Daniel 

Tulane Unwersity 


In a recent article in this journal, Hamada and Sakurai (1978) claim to 
have demonstrated that a recession in a foreign country, generated 
either by a reduction in monetary growth or by a decline in produc¬ 
tivity, is transmitted to the domestic country as a temporary 
stagflation. They emphasize the wage-price spiral as the transmission 
mechanism which operates under fixed and flexible exchange rates. 
In particular, any fall in foreign output causes an increase in the 
relative price of foreign goods. This puts upward pressure on home 
wages through a Phillips curve, which includes expected inflation of 
consumer prices; employment and output fall while inflation in¬ 
creases. The article deserves acclaim for focusing on the neglected 
“wage-price spiral" transmission mechanism; however, it is misleading 
in its assertion that a reduction in output abroad, whatever its cause, 
has temporary effects at home. 

The purpose of this note is to demonstrate that the wage-price 
spiral transmission mechanism, following a decline in foreign pro¬ 
ductivity, generates a permanent fall in domestic output as well as 
temporary stagflation. The productivity decline, which permanently 
lowers foreign output, permanently deteriorates the domestic terms of 
trade and permanently reduces domestic output by reducing the pur¬ 
chasing power of labor’s wage. The deterioration in the terms of trade 
leaves the real producer wage (wage divided by the price of the 
domestic output) unaffected. However, the higher price of imports 

1 wish lo thank my students Rebecca Morton and Vincent Salvo For drawing attention 
to the inconsistency implied by temporary domestic cffecu of a permanent foreign real 
shock. I also wish to thank Koichi Hamada for excellent comments on an earlier draft. 
Partial financial support from the Murphy Institute of Political Economy is gratefully 
acknowledged. 
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reduces labor’s real wage (wage divided by the consumer price index), 
creating an excess demand for labor. The nominal wage is bid up, and 
employment and output fall.' 

The source of the authors’ result is an incompletely specified 
model. Their model is adequate for monetary shocks (their primary 
interest) but not for real shocks. Extension of their model to meaning¬ 
fully incorporate real shocks requires a respecification of their Phillips 
curve. 

I'he authors sf>ecify the domestic and foreign Phillips curves by 
letting the rate of wage inflation in each country (u), w*, where an 
asterisk refers to a foreign variable) equal expected inflation in the 
consumer price index (ir, n*) plus a function (i/i) of output produced 
in each country where subscripts 1 and 2 refer to the com¬ 

modities produced in the domestic and foreign countries, respec¬ 
tively). They are expressed as 

XV = t/l(Xi) + TT, (1-1) 

XV* = i/;(Xt) + -ir*. (1-2) 

(Equation numbers correspond to those of the authors unless pre¬ 
ceded by the letter A. The latter are alternative equations proposed 
by myself.) 

The long-run requirement that xv = tt implies that there is only one 
value of X, consistent with long-run equilibrium. The analogous rela¬ 
tion holds for the foreign country. This implication is clearly incon¬ 
sistent with allowing real shocks to have permanent long-run output 
effects. In particular, it i.s inconsistent with the author’s claim that a 
productivity decline in the foreign country permanently reduces Xf. 
This problem is easily solved by respecifying the Phillips curve to 
allow wage inflation to depend on the deviation of output from 
long-run equilibrium as well as on expected inflation according to 

w = i/t(X, - X,) TT, (A 1-1) 

XV* = i/»(Xf — Xf) -t- v*, (A 1-2) 

where a bar denotes a long-run equilibrium. 

It is now necessary to specify the determinants of long-run output 
implied by the authors’ model. Steady state is characterized by 
equality in the rates of growth of money (ff), prices (p), and wages (xv). 
Output is determined by the authors’ production functions, 

X, = /V®, 0< /3< 1; (3-1) 

Xt=N*»\ (3-2) 

'This eftect is discussed in .Salop (1974), Daniel (1977), and Leiderman (1979). 
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where X denotes output, N denotes employment, and is the em¬ 
ployment elasticity of output. 

Labor demand is determined to equate the nominal wage (w) with 
the value of marginal product according to 

w=pifiN^-\ (4-1) 

w* = (4-2) 

where p denotes price. 

Prices are determined by the assumption that exptenditure on each 
commodity is a constant fraction of total expenditure. This assump¬ 
tion together with the authors’ assumption that exp>enditure equals 
income yields the authors’ equations, 

p^X^=vM, (13-1) 

(13-2) 

To close the model for long-run output, a specification of the 
long-run supply of labor is necessary. I assume labor supplied is a 
positive function of the real wage according to 

N = (wipf, (a) 

N* = {xu*lp*f", (b) 

where S and S* > 0 are the long-run real wage elasticities of the labor 
supply, and p and p* arc the consumer price indices in the domestic 
and foreign countries and arc defined by the authors as /> = p^pl'” 
dtvip* = p’t”pV ''- I his specification is in keeping with the short-run 
Phillips curve whereby a slower growth rate of nominal wages, relative 
to the price level, reduces the real wage and reduces output. By 
alUjwing labor supply to care about the real wage in the long run as 
well as in the short run, the possibility that a disturbance which alters 
the real wage can permanently alter output is admitted. This is pref¬ 
erable to assuming away long-run effects by fixing long-run labor 
supply while allowing short-run labor supply to respond. 

Kquations (3-1), (.3-2), (4-1), (4-2), (13-1), (13-2), (a), and (b) ran l>e 
solved to yield the reduced-form solution for long-run output; 

X, = yih, H. (O 

Xf = y*{k H- (d) 

It is imporuint to notice that a fall in /3* permanently reduces domes¬ 
tic output because it deteriorates the home country’s terms of trade 
thereby reducing labor’s real wage.* 

’ This can tie proved by loliflly diftcieiitiiUing ihc Ior line.ir version of the model 
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It is now possible to meaningfully analyze a shock to foreign pro¬ 
ductivity. The respecified Phillips curve together with explicit allow¬ 
ance for changes in productivity require rewriting the authors’ 
equations (20) and (22) as 


Xj = /3 


a - - X,) + (i + 


a* - 4»(x? - xj) + - TT* . 


(A20) 

(A22) 


where * denotes a proportional rate of change. The long-run re- 
quirement_^at 0—_jr = X, = /§ = 0 {B* — it* = X^ = = 0) implies 

that X, = X, (Xj = Xj). Anything which changes the long-run level of 
output requires an eventual corresponding change in X, (Xj). 

The model is completed with the assumption of adaptive inflation¬ 
ary expectations which yields the authors* equations (21) and (23). 
They are unchanged by the foregoing modifications: 

n = y{6 - (aX, + (1 - a)X|] - tt}, (21) 

TT* = y{a* - [a^{, + {\ - «):>??] - TT*}. (23) 

Figure 1 depicts the relationsh^s between trandXi necessary for/f, 
= 0 and TT = 0 for given /3, X,,X?, and B and the corresponding 

relationships in the foreign country. Equations (A20) and (A22) imply 
that for = 0, inflation must be negatively correlated with output. 
The same type of relation exists in the foreign country. The tt = 0 
relation is determined by substituting equation (A20) into (21) and 
analogously lor it* = 0. Dynamics are described by arrows following 
the authors. 

Now consider a fall in foreign productivity. This can be represented 


composed of (4-1), (4-2). (13-1), (13-2), (a), and (b) where the exchange rate is 
^o^^ 1 dlI^ed at unity. 

fipi ^ -Iffd - g) + (t; - /3)1(1 Y T)* In N*)pi ^ „ 
dp* A 

dpt ^ -ff(l - g)(l 4- 7>* \nN*)p, ^ 
dp* A 

dN* _ {(I - a)l(pip*) + 1 , lnN*l -r (\lp*)(y, - P)}N* ^ 
dp* A 

dN _ (I - g)(l + t;* lnN*)Af 
dp* A 

where A = oiP*{r) - /3) + (1 - a)<T)* - P*)P + (v ~ fl)(V* ~ P*). ■») = (1 + 6)/S,and if - 
(I + &*)!&*. With P = P* initially, as the authors assume, it is obvious that a reduction in 
P* deteriorates the domestic lerm.s of trade and reduces foreign employment relatively 
more than domestic employment. Foreign output also falls relatively more Irom et|() 
(3-1) and (3-2). 
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nimr.ilc Couniiy ror.ljn lounii-y 



by a shock to /3*. In equation (A22), /S* falls temporarily and XJ falls 
permanently. Both effects shift = 0 left in figure I, and since the 
fall in Xf is permanent, it remains left of its original position. In 
equation (23), the fall in yS* stimulates inflationary expectations by 
reducing X| (from [ A20] and [c])and by reducingX f (from [A22] and 
[d]); IT* = 0 shifts up to the left permanently. Under the assumption 
that ^ 0 for a very short period of time, the foreign adjustment 
path can be drawn as by the authors in figure 1. 

The domestic adjustment path and final equilibrium are not as 
described by the authors. The fall in XJ, which occurs due to the 
productivity shock, causes a permanent increase in its relative price 
(see n. 2 above). This reduces home labor’s real wage and perma¬ 
nently reduces X,. T herefore, ^1 = 0 also shifts left. It shifts less than 
= 0. The permanent increase jn the relative price of foreign 
output implies that X, falls le.ss than X^; rf = 0 also shifts left due to the 
decline in X, and Xf caused by the fall in foreign productivity. The 
adjustment path and long-run equilibrium are similar to those 
abroad. The stagflation is temporary in both countries, but the fall in 
output is permanent.’ 

In conclusion, this note confirms that the wage-price spiral does 
cause a fall in foreign output, whatever its cause, to be transmitted as 
stagflation in the short run, as the authors claim. However, it should 
be noted that the wage-price spiral also causes a permanent foreign 


' Hamada has noted that the fall in oulpul could be permanent under the assumption 
that labor supply is indepiendenl of the real wage in the long run. T^is assumption 
scents less appropriate in a paper about the wage-price spiral transmission mechanism 
in which the short-run dynamics depend crudJIy on the effect of changes in the real 
wage on labor supply. 
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productivity deterioration to be transmitted as a long-run decline in 
domestic output. The permanent terms of trade deterioration, im¬ 
plied by the productivity shock, reduces the purchasing power of the 
domestic workers’ wage leading to less employment and output. 
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I. Introduction 

Since the classic paper by Friedman and Savage (1948), it has gener¬ 
ally been accepted that the observed fart that individuals or firms’ 
participate in unfair lotteries and other forms of unl'air risk taking^ 
may be explained by a section in the individual’s utility function in 
which the individual shows risk preference rather than risk aversion. 
In their model, Friedman and Savage specify a utility function which 
is, in turn, concave, convex, and concave, thus allowing for simultane¬ 
ous purchase of insurance (risk aversion) and participation in lotteries 
(lisk preference). 

However, while their specified utility function is indeed capable of 
explaining observed behavior, it does seem to be rather unsatisfactory 
in that it is an ad hoc specification. It is the purpose of this paper to 
suggest a set of circumstances which give rise to a Friedman- 
Savage-type utility function. 

In particular, it is shown that when certain capital market imper¬ 
fections exist the utility function defined over intermediate wealth 
should be distinguished from and may have different properties than 
the (jne defined on final wealth. Then, even if we accept the common 
assumption made in the literature that individuals are risk averse, that 
is, that their utility function is concave over final wealth, it is still 
possible that they participate in unfair gambling (and. of course, may 
al.so purchase insurance). 

An earlier version ol (his papei wa.s presenieti at (he K.iiropean Meeliiij^ of the 
Econometric StKicty, Athens, Scptemtwv 1979. We wish to thank the editor and an 
anonynioii.'S referee for (heir helpful comments 

' The word "individual” is, throughout this p,aper, used to denote economic uiiio, 
lu hiding E>oth individuals and hi ins. 

’The term "unfair lotteiy” in the paper encompasses all investment opportunities 
offering acluarially unfair risks. 
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The motivation behind our endeavor to provide an institutional 
explanation for gambling is a reluctance (shown, e.g., by Stigler and 
Becker [1977]) to use non-well-behaved utility functions as expla¬ 
nations for economic phenomena without some a priori reasons for 
assuming that the utility function is of an unusual shape. Clearly all 
phenomena in economics can be technically explained by recourse to 
tastes. Thus, for example, individuals will necessarily show risk pref¬ 
erence over a section of their utility function which is assumed con¬ 
vex. However, is there any a priori reason to assume that there may be 
such a section.’ It is a positive answer to this question which this paper 
attempts to provide. 

In Section II we discuss certain capital market imperfections and 
their implications about the nature of rates of return on investment. 
In Section III we show how the Friedman-Savage utility function may 
emerge when utility is defined on intermediate wealth and there exist 
capital market imperfections. The possibility of gambling with risk 
aversion and of simultaneous gambling and insurance therefore im¬ 
mediately arises. 


II. Rates of Return and Capital Market Imperfections 

A number of recent studies discuss the effects of uncertainty, im¬ 
perfect information, and various transaction costs on capital markets 
and show that the result may be that capital markets are characterized 
by certain imperfections. These imperfections are usually in terms of 
the nonexistence of certain markets and the fact that there may not be 
free and equal access to other markets. Uncertainty and the possibility 
of costly default may lead lenders to introduce collateral requirements 
or take default costs into account in their loan rates. Consequently, it 
can be said that one has to have certain assets (providing collateral 
services) in order to have easier or cheaper access to capital markets. 

Barro (1976) and Benjamin (1978) derive these conclusions ex¬ 
plicitly and show that market imperfections will lead to loan rates 
being functions of loan sizes and available collateral. In particular, 
they derive a loan-supply function which is constant for some initial 
range and then becomes an increasing and convex function of loan 
sizes. 

In the same vein, Jaffee and Mcxligliani (1969) show that, as a result 
of uncertainty and imperfect and costly information, capital markets 
may be characterized by credit rationing. They show that beyond a 
certain point loan rates will generally depend on loan sizes, with a 
possible upper bound on loan sizes. Furthermore, they provide 
empirical evidence supporting these types of imperfections, 

Empirical evidence indicating capital market imperfections is also 
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given by Eckstein (1961) and Nerlove (1968), who find capital markets 
to be characterized by differential rates of return on given invest¬ 
ments. 

The important implication of the capital market imp>erfections is 
that individuals face a variety of capital market constraints and, there¬ 
fore, do not have free and equal access to the market. Consequently, 
capital provides additional services by either weakening the accessi¬ 
bility constraints or by reducing the cost of the acquisition of capital. 
By providing collateral services, capital, therefore, increases the set of 
“feasible activities” or reduces their cost, so that its “full rate of 
return” should take this additional role into account. 

A consequence of these capita) market imperfections is that for 
some levels of asset holdings the rates of return on assets may depend 
on the levels of the asset holdings, even from an individual’s (rather 
than an aggregate) point of view. The range within which this is likely 
to happen is when wealth levels are low, but not below some minimum 
level. When an individual’s wealth is below some minimum level the 
capital market constraints may be so effective that except for the 
possibility of obtaining small loans the market is in effect inaccessible. 
Since small loans can usually be obtained at constant loan rates (as is 
shown in Jaffee and Modigliani [1969], Barro (1976], and Benjamin 
[1978]), the additional role of asset holding is ineffective within this 
initial range, and consequently the rates of return will not depend on 
asset holdings. However, as an individual's asset holdings increa.se 
above some minimum level, his higher level of wealth will provide him 
with the additional “collateral services," and rates of return within this 
range will depend on the levels of wealth. 

Since rates of return are usually not known with certainty, the 
dependence of rates of return on asset holdings is in a probabilistic 
sense. In other words, the rates-of-return probability distribution 
function changes with asset holdings, that is, the distribution is condi¬ 
tional on asset holdings.’ 

In addition to the above considerations, rates of return may be 
increasing functions of wealth (over some initial range) for other 
reasons as well. First, a large number of fixed costs may be incurred in 
effecting profitable investment. Information costs incurred in locat¬ 
ing high-return investments may be considerable, and a large element 
of these costs may be relatively fixed. Transaction costs, where again 
the large part is fixed, are also likely to comprise a high percentage of 
small investments. In the presence of these costs the rate of return will 
be an increasing function of the investment. Second, there may exist 


* Thus, e.g., an increase in asset holdings will over some range make the probability 
distribution more stochastically dominant. 
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significant indivisibilities which (especially in view of the capital mar¬ 
ket constraints) will imply increasing returns to scale. 

In view of these considerations, we conclude that individuals, espe¬ 
cially those with small or moderate levels of wealth, may very often 
face various capital market constraints which lead to rates of return 
being functions of wealth. Furthermore, the relationship between 
wealth and the rates of return is .such that rates of return are (at least 
over some range) increasing functions of wealth. 

III. Gambling with Risk Aversion 

Having discussed some of the capital market imperfections and the 
constraints they impose on individuals, we now consider their effects 
on the individual’s attitudes toward risk taking. 

In order to focus on the gambling problem and to separate it from 
the investment problem, we assume that the rate of return is nonran¬ 
dom. This assumption is not crucial and does not change the results. 
For the .sake of simplicity, we also assume that there is only one asset. 
A discussion of the case with many assets and random rates of return 
can be found in Appelbaum and Katz (1979), where the same results 
are derived. 

In line with the discussion in the previous section, it will be assumed 
that the individual faces a rate of return R on his investment. A, such 
that 

R' = 0 0 ^ A ^ A* 

R = R{A) R' > 0 for A* ^ A A** (1) 

/{' = 0 for A > A**, 

where R' is the partial derivative of R. In other words, the rate of 
return is constant over some initial range, increasing over some sub¬ 
sequent range,'* and then constant again. 

An individual investing an amount A will end up with final wealth 
W, such that 

W All + R(A)]. (2) 

Clearly, convexity of R(A) over some range is sufficient, but not 
necessary, for the convexity of FF(A) over the same range; both convex 
and concave rate-of-return functions may lead to a convex IV(A).' 

‘This type of return function is discussed also in Blinder (1974) and Appelbaum and 
Harris (1978). 

* It docs, however, seem reasonable that at least over some initial range (when Af is not 
constant) R" > 0, i.e., the function is convex. The convexity of R(j 4) at the initial lange 
would, e.g., follow from the convexity of the loan-supply function derived by Jaffee and 
Mcxligliani (1969), Barro (1976), and Benjamin (1978). 
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Let us now consider the individual’s decision problem. Following 
the literature,® we assume the individual has a utility function defined 
on final wealth, IJ(W), and that 

U' > 0. U" < 0; (3) 

that is, the individual is risk averse. 

Now, consider the individual’s utility function when it is defined 
over A —his intermediate wealth. This is given by 

y{A) = U{All + R(A)]} (4) 

and 

y'(A) = U' ■ (I y R + AR'), (5) 

r'(A) = U" •(• + « + AR'f + U' ■ {2R' + AR"). ( 6 ) 

Therefore, in accordance with our above discussion, F' > 0 for all 
values ofyf. Ford <d’'‘andd > A**, F"is clearly negative. Within the 
ranged* < d s d** the second term on the right-hand side of (6) may 
be positive and, hence, V" may be positive. If this occurs there will be a 
section of F(d) that will be convex and, hence. F will show the 
Friedman-Savage shape of Ireing in turn concave, convex, and con¬ 
cave. 

'I'he result may also be derived diagrammatically, as is shown in 
figure 1. 'Fhe curve F(d) drawn in the first quadrant is clearly a 
Friedrnan-Savage type of utility fuiu tion which allows both gambling 
and insurance in intermediate wealth to take place simultaneously. 

Even if F(d) is concave within the range d* < A < A**, we can still 
get a convex region, as in the Friedman-.Savage utility function, since 
F(d) will, in general, have a kink aid*. Thus, although in this case 
F(d) is made of concave segments only, it will nevertheless not be 
globally concave. I'his case is shown in figure 2, where again it is 
possible to observe gambling and insurance simulutneously. 

Hence, we have provided an explanation of the shape of the 
Friedman-Savage utility function without suggesting that an indi¬ 
vidual is anything but everywhere risk averse. Rather than focus on 
the la.sles of the individual, we have fix'used on his institutional 
constraints.’ 

"See, e.g , Arrow 1970, 

' An alternative way of obtaining these results is to employ the state preferenee 
approach and consider the effects ol capital market imperfet lions on the set of accept¬ 
able gambles. It can be easily shown (see Appellraum and Kal/. 1979) that the isoutility 
curves defined over twti random states are not nrces-sarily convex, and thus both 
gambling and insurance buying may occur simultaneously. 
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IV. Conclusion 

This paper provides a possible rationale for the Friedman-Savage 
utility function. It sugge.sts that the existence of market imperfections, 
in particular in the capital markets, may impose various constraints on 
individuals and thus affect their behavior. The existence of these 
constraints could lead individuals to participate in unfair gambling, 
since this may reduce the implicit costs of the constraints. 
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Our explanation of gambling focuses on the constraints facing an 
individual rather than on his preferences, and consequently it does 
not require a modification of the standard assumption that individu¬ 
als are risk averse. Of course, it may be that individuals show risk 
preference over a .section of their utility function. Such a section is 
not, however, necessary in order to derive a Friedman-Savage type of 
utility function. 
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Foreign Trade Regimes and Economic Development: Anatomy and Consequences of 

Exchange Control Regimes. By Jagdish N. Bhagwam. 

Cambridge, Mass.: Ballinger Publishing Co. (Ibr National Bureau of Eco¬ 
nomic Research), 1978. Pp. xix-(-232. $16.50. 

This book is one of two volumes synthc.sizing the results of the NBER's 
project oit Foreign Trade Regimes and Economic Development. The country 
studies covered Turkey, Ghana, Israel, Egypt, the Philippines, India, South 
Korea, Chile, and Colombia. Taken together with the country studies under¬ 
taken as part of the OECD’s Industry and Trade project, whose results were 
summarized in the well-known volume by Little, Scitovsky, and Scott (1970), 
development economists now have a wealth of analytical and empirical mate¬ 
rial from which to Judge the past trade policies of developing countries, as 
well as to make firmer policy pronouncements about future trade policy. 

Bhagwati's synthesis volume covers two of the three issues with which the 
project was concerned, namely, “the characterization of the anatomy of ex¬ 
change control regimes, including the interaction between quantitative and 
price measures" (p. 2), and the relationship between difi'erent types of 
exchange-control regimes and economic efficiency both in its static and 
dynamic aspects. In its third dimension, the project attempted to discover if 
there was “something systematic about the forces that governed the likelihood 
of success in moving from a restrictive exchange ctintrol regime to a 
liberalized legimc” (p. 3). This is the subject matter of the companion volume 
by Anne Krueger (which is not reviewed here). 

Within the compass that Bhagwati has set for himself, he has succeeded in 
providing a rigorous analy.sis and scrupulously fair summary of the evidence 
that he and others think is relevant to the anatomy and consequences of 
exchange-control regimes. 

The starting point for Bhagwati’s taxonomy of alternative exchange- 
control regimes is the limitations of the traditional Marshallian prtiposition on 
the equivalence of tariffs and tjuotas when its implicit assumption that the 
supply and demand schedules lor foreign exchange are given breaks down. 
The actual ways in which imports (and/or exports) are regulated therefore 
become impirtant in determining the relative domestic prices of traded 
goods and, hence, the welfare effects of the trade regime. As Bhagwati's 
subsequent taxonomy of the regulation mechanisms observed in the country 
studies shows, they are a bewildering variety, and if nothing el.se his taxonomy 
should help the practitioner attempting to analyze a particular LDC’s trade 
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regime to organize his or her thoughts and, as such, is a valuable addition to 
the literature. The rest (and major part) of the book summarizes the evidence 
from the pntject (and elsewhere) on the consequences of exchange-control 
regimes. 

A.side from dtKuinenting the reasons for and evidence on the illegal trans¬ 
actions that different types of exchange-control regimes have entailed, the 
bulk of Bhagwati's book (pp. 82-218) is concerned with the static and 
dynamic efficiency eflects of exchange-control regimes. The project used 
estimates of domestic re.sourcc costs (I>RCs) as indicators of static efficiency. 
Bhagwati provides a lucid discussion (pp. 88-90) of the statistical and ana¬ 
lytical pitfalls in the use of DRCs as opposed to “shadow prices” as measures 
of the misallocation of given rc'sources. But despite these shortcomings, he 
rightly argues that “there is no easy way out of the conclusion that the pattern 
of investment was less than optimal” (p. 91) under the project Phase Two (i.e., 
QR [quantitative restrictions] based) regimes. Moreover, estimates of .scH:ial 
rates of return based on “shadow prices” for at least India (see Lai 1980) show 
that these conclusions are unlikely to l>e overturned by the use of more 
appropriate indicators of allcxative efficiency. 

1 he evidence for the NRER country studies thus prtivides f urther support 
for the major conclusion of the earlier OF.fiD study on the misallocation of 
resources resulting from import-substituting QR regimes. Thus Bhagwati 
wiites, 

the weight of the evideiue in the countries in the Project then is in 
support of the hypothesis that exchange control regimes by 
che.ipening the tost of intei mediates and capital goods influent c and 
disttirt both the < hoice of techniques within activities anti the relative 
composition of outputs and investments among activities. . . . while 
pan of the evidence is inferential in deducing effects from the 
tibserved institutional fe.itures of the exthange lontrol regimes of 
the Phase 11 variety and only pat t is direct, the o\ ei whelming hulk of 
it is consistent with the view that the static elfiiienty effetls of such 
regimes have been adverse in practice. 1P|). 117. 121] 

I'he evidence presented on the dynamic eflects of altern.uive ti ade regimes is 
less tout lusive, ihough Bhagwati stales that the major difleremc between the 
NBKR and OKCD projects was "in analyzing in far greater depth the growth 
effects as well as the conditions tlcleimining a suciesslul transition from 
lesirictive to liheiali/ed legimes" (p. 6). Tfie latter aspect is not discussed in 
tills review, as it is die subjet l mailer of Krueger's soluine. 

Tfierc IS a long t h.ipiei i eviewing the possible .malytical links (and evidence 
ihercoii) lietween domeslit and foieigii saving and allei native Made regimes 
Bhagwati rightly concludes that the evideiue from the country studies and 
other economelnt exeicises does not .support the view that i esii ictionist 
exchangc-tontrol legiines “will, or are likely to contribute to increased 
dome-stic savings and/or to augniciued capital lormalion. If anything, much 
tif our evidence—at least on the domestic savings issue—suggests an opptisite 
relationship" (p. 174). 

I he project studies also provide dtuumentation foi the exi>ecied ptisitive 
response tif exports to a 1 eduction in the bias imposed by an ini(x>u- 
subsiituting QR regime; and moreover, as Bhagw.ui rightly emphasizes, "the 
project results indicate strongly that it is really a shift to successf ul liberaliza¬ 
tion and therefore rorifiriwng liberalization that is critic.il to improved export 
performance on a sustained basis" (p. 184). 
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The evidence cited and assessed on various other possible dynamic el'tects 
on entrepreneurship, innovation, and technical change and income distribu¬ 
tion is veiy inconclusive, though it does support the negative conclusion that 
there is little basis for the view that there are beneficial dynamic effects in 
these dimensions which outweigh the static efficiency costs of import- 
substituting QR regimes. 

Similarly, while "the cross-sectional contrasts among the Project countries 
indicate strongly that the countries that have shifted to [liberalized] Phases IV 
and V have better growth performance than those that have continued in 
Phase Il-type restrictive regimes” (p. 191), the evidence for the reasons cited 
foi why this may tie the case again appears to be inconclusive. 

Bliagwati, like many other development economists, asymmetrically labels a 
movement from the neutral free-trade position as import substituting, while 
a movement back toward this point is labeled export promoting. In an age 
given to slogan mongering, I am not sure that this redefinition of a polity of 
promoting free ti ade (or “optimal" trade) is necessarily helpful, as it could 
lead people (who have not read Bhagwati with care) to believe that a policy of 
export subsidization above the level which restores the equality between the 
effective exchange rales on imports and exports was desirable. Like the 
Little-Scitovsky-Scott volume, Bhagwati’s mes.sage, though mote mured, is 
that both in theory and practice a movement ttiward free trade is desirable in 
most dereloping countries, and it would .seem best to .say so clearly and 
emphatically. 

I was also troubled by Bhagwati’s hesitancy in arriving, as he puts it, “at 
strong prescriptive conclusions in regard to many of the detailed questions 
that are examined" (p. 5). In particular, on the dynamic elfects ol tiade he 
stales that, while most “genetal propositions [concerning the links between 
trade, savings, and growth] are not sustainable, tndtvidwil country expel iences 
may and often do indicate the dominance of only a subset ol the possible 
linkages and eflects, thus sustaining stronger propo.sitions in regaid to their 
own experience" (p. 5). But this raises the methodological question ol the 
nature of the evidence required to substantiate the cau.sal links in a chain of 
dediutive economit argument linking various intluctions. For, if one accepts 
the view that the dynamic case lor free trade is that it is, in Irving Kravis’s 
(1970) phrase, "a handmaiden of growth,” it is not clear what other sort of 
geneial evidence would need to be adduced to support this view, apait trom 
the historical and therefore counterfaclual evidence presented for each 
country in the NBFR sample. Surely on this ba.sis we should be able to form a 
general prc.sciiptivc judgment in the dispute that arose during the 1950s 
between Nurkse and his followers on the one .side, and Habeiier and Viner on 
the other, on the desirability of ftee trade fcjr postwar developing countries 
(see the quote in Bhagwati, p. 296). 

The Nutkseian skepticism about the desirability of liee trade for contem¬ 
porary I.DCs was based on thiee implicit or explicit assumptions. First was a 
pessimistic assumption about the external obstacles to LUC export growth, so 
that a shortage of foreign exchange was perceived to be chronic and an 
indcfiendent limiting factor in economic devefopmenl. Second, economic 
agents m I.UCs were assumed to lie congenitally .shortsighted in basing their 
investment and production decisions on current static rather than emerging 
dynamic comparative advantage. Finally, it was assumed that bureaucrats and 
planners could both piedict the country’s dynamic comparative advantage 
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better than private agents and ensure that production and investment pat¬ 
terns conformed to it through detailed stale intervention. The OECD and 
NBER studies for the various countries have d(x:umented the inadmissibility 
of all three sets of assumptions. The obstacles to LDC export growth were 
largely internal, not external: economic agents in these countries have reacted 
to the distorted .set of incentives created by protectionist trade-control re¬ 
gimes much in the way that standard economic theory would predict; plan¬ 
ners have often shown a lack of foresight which would have swiftly bank¬ 
rupted any private agent. 

It is therefore surprising to find Bhagwati writing that “the statistical 
evidence and analysis of this pcjssible cause and others of asymmetrical ad¬ 
vantage of the EP strategy is not yet available in anythitig like the degree that 
would be reasonably compelling” (p. 215). Given the largely simple 
econometric evidence that Bhagwati cites in his discussion of the dynamic 
effects of trade regimes, 1 suspect that this is the kind of “compelling" 
evidence he is seeking. However, it is doubtful whether it will ever be possible 
to find evidence for the truly dynamic effec ts of trade regimes from which the 
usual statistical inferences could be made, and for a very compelling reason: 
Economics, as Hicks (1979, p. 38) has put it, “is on the edge of science and of 
history.” The historical aspects are particularly important for what he terms 
“sequential causality,” which is the relevant notion of causality when we seek 
to understand the dynamic effects of trade regimes. But in studying such 
dynamic historical processes, the use of techniques of statistical inference may 
not be very useful, because “when we cannot accept that the observations 
along the time scries available to us are iiidc(x:ndcnt or cannot by some device 
he divided into groups that can be treated as independent, we gel into much 
deeper water, for we have then, in strict logic, no more than one observation, 
all of the .separate items having to be taken together. We are left to use our 
judgment, making .sense of What has happened as best we ran, in the manner 
of the histori.tn. Applied economics does then come back to history after all” 
(Hicks 1979, p. 112). 

The great strength of the NBER and OECD country studies is that they do 
provide analytital etonomu histories of particular LDC's trade regimes. 
rhe.se can and should be used to provide judgments on the validity or 
otherwise of one or the other “view of the world," but it is probably a vain 
hope U) expect that it will ever be po.s.sible to derive "general laws" like those 
of physu s for these dynamic historical proce.sses by tbe use of statistical 
inference. May the inconclusiveness of Bhagwali's discussion of the dytiamic 
effects of allei native trade regimes perhaps be due to his .setting hinrself an 
impo.ssible standard of evidence for .substantiating the view that Iree trade 
regimes are the "handmaidens of growth"? 

But pel haps this is Ijeing unfair to Bhagwati, who has written a lucid and 
scholarly book which, in his words, “the reader should appioach ... as a 
manual. Anyone who wishes to examine a sjjecific countiy’s foreign trade 
regime and its consequences can read the present volume and readily have 
access to possible hypotheses and techniques of analysis . . . that may be used 
for the reader’s own analysis" (p. 5). In this he has succeeded and deserves 
commendation. 

Df.f.pak Lai. 

Uninersity College, London 
World Bank 
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Population and Economic Change in Developing Countries. Edited by Ru.hard A. 

Ea.stf.rmn. 

Chicago and London: University of Chicago Press, 1980. Pp. 581. $38.00. 

This volume, number 30 of the Universities—National Bureau Committee for 
Economic Research conference reports, makes available papers and dis- 
cus.sants’ comments onginally presented to the Conference on Population and 
Economic Change in Less Developed Countries held in Philadelphia in 1976. 
Nine distinguished authors have written major papers (a lact wliich must l>e 
stated when reviewing a conference volume, for, far too often, prominent 
researchers produce their second best for such occasions). Every paper is 
worth reading, and most are exciting. The volume will be necessary reading 
for people working in this and allied fields. 

Yet the book may, in the years tocome, be thought ot as a monument in the 
hnal years of a profoundly unsatisfactory period—the Ari.stotelian era—in the 
study of the economic demography of developing countries. There are two 
major problems in the volume, and both are essenliully problems of the 
di.scipline, faithfully reflected in the conference. 

The first problem ts that hardly any of the authors, and very few of the 
discussants, have ever spent any substantial period living and researching in a 
developing country. More limited experience is often just enough to convince 
the researcher, quite falsely, that he is in contact with a poorer version of our 
own society. It is as if anthropologists could still work without years of field 
experience, or as if geographers could write Isooks ftom afar by merely 
interpreting Icjcal maps. 

This is not just carping criticism. The central theme ol fxith the conference 
and the book, addre.ssed by five of the nine papeis, is Third World household 
economics and individual behavior and decision making. These papers are 
saturated with hesitant inferences, drawn from piles of anonymous data 
collected in a society which (he authors really do not know, about how these 
households must work. Very rarely does one of the writers say with any 
assurance that this or that kind of situation is usually found in the country 
being di.scussed. The loss is immense and can be illustrated by .some related 
examples. The value i>i children’s labor is compared with that of the adults in 
the family, or of outside paid labor, or that ol slaves. No one points out that 
one reason for preferring children to adults is that they do various forms of 
menial work—“children’s werrk’’—that adults hate doing. No one points out 
that, throughcjut much of Africa and Asia, cjiily family members are trusted, 
and the presence of hired lalxir in the house or even the surrounding helds 
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gives rise to disquiet. No one shows that children’s labor is valuable only so 
long as they work hard with little resentment or protest, and that this is less a 
product of their existence than of a social system in which children expect to 
have their work directed by their elders and the older generation takes such 
direction for granted. It is a system which can be subtly transformed if 
cultural change (or the advent of schooling) alters the .social relations between 
the generations. The market wage rate is frequently taken to measure the 
relative value (which must in a sense mean the intrinsic value and not be 
merely a synonym for market wage) of the labor of men, women, and 
children. Yet in a South Indian village in which my wife and I have lived and 
researched, although women are the preferred workers in certain postharvest 
ojrerations because of the superior skills they are said to pos.sess, their daily 
wage rate for that work is five-sevenths of that paid for the same work by men. 
Children’s wages are lower still, even for ta.sks where they arc demonstrably 
just as efficient as their elders. Wage levels do not measure work inputs, but 
differentials by sex and age are maintained in order to preserve a certain 
social fabric. 

The second problem—related in the .sense that it also arises from sub¬ 
stituting pontification of a type for which we have condemned medieval 
schoolmen for long experience of the societies being analyzed—is the ten¬ 
dency to treat each society as autonomous. To a considerable extent this was 
the case in premodern Europe in that external cultures had no major impact, 
and hence Ronald Lee’s perspective of preindustrial England escapes this 
censure. Yet it is not true in today’s developing world. When we have asked 
people in both Africa and Asia about the major distinctions between their 
ways of life and those of their parents or more distant ancestors, they have far 
more often quoted aspects of scx ial change than of economic development. 
Furthermore, they overwhelmingly explain such social change—and almost 
certainly correctly—in terms of impirrted Westernization (or social moderni¬ 
zation) 01 because of learning other ways of life m si hool (seen quite clearly as 
an imported institution), rather than as the minimum alteration needed for 
accommodation to economic ihange. The .social influence of the West, 
whether approved or disapproved, is felt to be immen.se. The deiline of the 
patriarchy with all its economic advantages to older males is seen to have come 
about because of the new ideas. For most of the developing world the real fart 
of life is not a gradual development from the preexisting economic and social 
base but an aicelcrating revolution that has come about l>erause of the 
extraordinary movement in this century toward a global economy (frequently 
noted in contexts other than this volume) and a global six-iely (much less 
frequently noted). Yet almost every analysis in the book implies an almost 
autonomous economic and .social change in every society, even tiny national 
states. That this is not really so can be inferred from some of the findings; for 
instance, Samuel Preston’s prixif that perhaps only one-third of the mortality 
decline this century in developing countries can be explained by rising income 
levels, while the other two-thirds (although it is not pul this way) arises from 
the ever stronger interrelations Ix’twcen industrial and noniiidustrial societies 
and indeed among all societies. This blind spot ixrcurs not only liecause of 
inexperience of the Third World, but because the researcher becomes a 
prisoner of his own methods of analysis. The data tor each countr)' or MX'ieiy 
can easily be interrelated, but they cannot easily be made to divulge the extent 
to which change in their magnitudes is a result ot external forces. In addition, 
the treatment of each stx'iely as autonomous allows apparently meaningful 
comparisons over di.stance and time. 
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There is a feeling, at least in demographic circles, that the era when 
approaches of this type have been possible is drawing to a close. The Interna¬ 
tional Review Group of Social Science Research on Population and Develop¬ 
ment (IRG) Report, a forthcoming report of the International Union for the 
Scientific Study of Population (lUSSP), and other individuals and organiza¬ 
tions are all urging population researchers toward microstudies, an¬ 
thropological demography, and saturation in specific cultures. We are in the 
situation anthropology found itself in 60 years ago when Malinowski in 
Britain and Boas in the United States were trenchantly stating the nece.ssity 
for knowing the real world. Economic theor y has not crumbled through lack 
of such experience when it deals with the industrialized world only because 
most of its practitioners have known from a lifetime’s experience the basic 
nature of that world. First World specialists in the population and economy of 
the Third World have not found themselves in such a fortunate position. 

The first chapter takes the form of an excellent review and analysis by Peter 
Lindert of child costs from both contemporary and historical data. Yet today’s 
Third World seems almost as far off as eighteenth-century England, and 
inferences about the Third World seem aliout as ditect as today’s inferences 
about eighteenth-century England. 

The heart of the debate is provided by chaplet 4, an attempt by Paul 
.Schultz to apply the theories of the Chicago household economists to 
Taiwanese data, and chapter 5, a restatement by Rithartl F.asterlin, Robert 
Poliak, and Michael Wachter of the University of Pennsylvania’s tastes and 
preferences approach, this time written as ;i specific reaction to the hou.sehold 
economists. 

I he application of the Chicago school’s theories to developing countries 
has always been disconcerting because of its implication that the options for 
the u.se of time open to the middlc-cla.ss American woman bear some relation 
to the situation of village women in the Third World. We have never been 
able to detect much difference in the allocation of time in tire latter group 
between those who went to school and tho.se who did not (ignoring the tiny 
fraction with higher education). The reason .seems to he a cultural enforce¬ 
ment of the tasks which women must do and a sharing of child care between 
different members of the family and other relatives regardless of the mothers’ 
level of education, plus a fairly rough-and-ready raising of children that often 
means little time spent exclusively on each child and little emotional drain. 
The differences that can he detected seem to run in the diiection opposite 
that required by the Chicago .sch<M>l: A more educated mother can sometimes 
exert enough influence to increase the amount of lime she spends with hci 
children (and to insist that they are more exclusively her children) at the 
expense of more onerous work in the fields or in food processing. The 
application of the theories to Taiwan is particularly disconcerting for two 
reasons. The first is the now customary use of past education for current 
value of lime in a society where education probably has far greater direct 
impact on age at marriage and use of contraception (as is shown by Ronald 
Freedman in his comments) and a profound impart on family relations and 
hence the ability to profit from family members’ labor. It is high time that the 
Chicago school gave itself a basis for working in the Third World by en¬ 
couraging studies at the household level of real time options and the making 
of choices. Such studies of 20 households would be as worthwhile as the 
comparison of thousands with macroscopic data employing probably wholly 
inappropriate proxies. The second, one readily conceded by Schultz, is the 
fact that comparisons could not be made with population units smaller than 
provinces (which, as F'reedman pointed out, tend to be either city or noncity). 
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The Pennsylvania paper makes the point, which is universal knowledge in 
the Third World but is often ignored in the economic models of the First 
World so as to make them apparently universal, that before the onset of 
fertility transition there is very rarely any worry at all about the level of 
marital fertility (except when it is too low) and no greater attempts are made 
to restrict fertility at higher than at lower parity levels (a situation known 
somewhat unhappily as “natural fertility"). The age at marriage or the dura¬ 
tion of sexual abstinence has nothing to do with intended ultimate family 
size. Indeed, decision making about family size does not exist and therefore 
cannot be studied until the onset of fertility transition—a forbidding situation 
for economic demographers because it rules out the universality of most of 
their models and the ability of these models to explain not only the conditions 
of stable high fertility but also the situation immediately preceding (and 
crucial in determining the timing oO the beginning of transition. The 
Pennsylvania trio makes the point that tastes and preferences must exist, but 
it seems to regard them as having some kind of life of their own rather than 
having underlying determinants (this is almost possible if you can convince 
yourself that the wobbles in the Western fertility decline that produced the 
so-called baby boom of the 1950s, and the falling fertility of the 1970s, are 
important phenomena in the transformation of stable high-fertility societies 
into low-fertility ones). They make many gtxxl points in producing a model so 
general that it borders on being a philosophic discussion rather than a tight 
description of the conditions of change (and, as Warren Sanderson charges, 
“the model is consistent with almost any kind of behavior"). 

Allen Kelley, employing a fraction of other people's data collected for other 
purposes in urban Kenya, comes to some fascinating conclusions: The 
number of children does not influence the rate of household financial saving; 
“the income effects of an increase in education may in some instances out¬ 
weigh the price-substitution effects, resulting in an increase in family size with 
increa.sing education levels”; and (stating the obvious, although it is said to be 
a radical departure from the conventional wisdom) “contrary to the usual 
model where children are con.sidered only as consumer durables, there is 
some evidence that children may also serve as producer durables in the 
low-income setting" (p. 436). The basic weakness of the paper is that the 
determination of savings by subtracting vastly greater expenditure figures 
from income figures, in a .situation where neither is reliable, is wholly uncon¬ 
vincing. 

Yorain Ben-Porath, in a paper which can be related only loosely to the 
major theme of the book, economic development and social change in a 
continuing society, examines the impact of the migration of Jews from .Africa 
and Asia on their fertility and their replacement of dead children. His 
methods are ingenious, and he appears to show that "the income effects [in 
raising fertility] may precede in time the adjustment in life-style and aspira¬ 
tions that generates the fertility-depressing substitution effects" (p. 181). One 
problem is that more than life-style and aspirations changed: Economic 
realities both outside and inside the family dia (the youth, who would work 
uncomplainingly in the family business in Yemen, wanted to seek an outside 
job in Israel and to control the spending of his earnings), as did the mortality 
of children. The other is the fact that the separation of demographic and 
other events between the pre- and postmigration peritxis is so difficult that 
one is not quite convinced that it has been wholly successfully achieved. 

The volume does contain three fascinating macroscopic papers. One bv 
Simon Kuznets argues (in a way that is necessary but should not be) that the 
mortality decline in developing countries over the last 60 years has not only 
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been a major contribution to real living standards but has disproportionately 
assisted the least privileged and thus has reduced social inequality. He argues 
that mortality reduction is an indispensable prerequisite tor fertility decline 
and that we do not yet know the social impact of fertility decline in the Third 
World. He also shows that it is probable that the mortality declines in de¬ 
veloping countries were as steep between the two world wars as since World 
War 11. That is, they began when colonial administration was strong enough 
and sufficiently responsible to keep law and order and to apply the available 
health technology (as well as beginning the spread of schooling). It is a myth 
that the I.DCs had to wait for antibiotics and WHO programs. Indeed, there 
is support for the widespread belief in the Third World and the historic West 
that it was strong regimes that benefited populations and that such benefit 
came in the torm of prosperity and was evidenced by population growth 
arising from mortality decline. 

There is evidence also (although it is not spelled out this way) in an 
important paper by Ronald Lee which extends his previous work. Mortality in 
F.ngland apparently declined consistently through the long years of the 
Tudor peace. The central theme of the analysis, however, is the apparent 
demonstration that population growth in England was for 600 years, from 
1200 to 1800, controlled almost solely by changes in mortality levels (not 
themselves determined either economically or by variations in fertility) and 
that population growth so determined real wages that during the whole 
period a 1 percent movement in population gave rise to a 1.5 percent move¬ 
ment of wages in the opposite direction. The note of caution is .sounded 
because there is less certainty about mortality trends for the whole population 
than for the upper classes and because the argument is most powerfully 
buttressed by swings in population lietween 1250 and 1450 which are far 
from certain. 

Preston’s paper, referred to earlier, also analyzes the mortality decline in 
terms of changing patterns in the cause of death and bravely attempts to work 
out a lange of consequences of mortality decline. 

Finally, the inclusion of Michael FcKlaro’s paper on internal migration, 
clearly setting out his model and presenting a bibliogiaphical guide to other 
studies, does something to justify the broad title of the book, though it 
appeals somewhat anomalous in terms of the main thrust of the work. 

rhe NBF.R volume prc.sents, usually at one remove from the collection of 
data, much interesting material and a good deal of sophisticated analysis. 

John Cai nwt 11 . 

Auslraluin National University 


The Regulated Industries and the Economy. By Paul W. Mac.Avoy. 
New York: W. W. Norton Sc Co., 1979. Pp. 160. $11.95. 


The regulatory economics field has grown rapidly since the publication of 
Alfred E. Kahn's (1970) seminal text 11 years ago. Powerful new institutions 
have been created to regulate health, .salety, and envircrnmental problems, 
several of the older industry-management agencies have relaxed their regu¬ 
latory grip in rcspon.se to active or threatened legi.slative deregulation, and 
the economics profession’s understanding of the effects of regulation on the 
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economy has improved. Paul W. MacAvoy’s latest b<x)k. The Regulated Indus- 
trus and the Economy, evaluates both the success (or, more accurately, the 
failure) of the new social regulations and the increasing costs imptosed by 
industry-management agencies in this recent inflationary period, concluding 
that the quality and the quantity of goods and services prcxfuced in the 
regulated industries have declined due to regulation. 

The book aims at a general audience interested in problems of regulation 
but not necessarily well versed in economics. Its lack of coverage of many 
imprortant topics in regulatory economics makes it unsuitable for a main text 
in the area. As a re,searrh monograph, the MacAvoy book is too loosely 
constructed to support satisfactorily many of its hypotheses. I found the data 
presented to suggest strongly some fascinating hypotheses, but often compet¬ 
ing hypotheses were ignored, sufficiently refined daia-analysis techniques 
were not used, and some broad, only weakly supported hypxnheses were 
presented. 

The book focuses on three main topics; the effects of price and entry 
regulation on the economy; the eff<x;ts of health, safety, ancl environmental 
(social) regulation on the economy; and reform piopntsals. 1 found the evalu¬ 
ation of the impacts of price and entry regulation to be the most valuable 
section because its central hypothe.sis is important, as well as little recognized, 
and because that hypothesis is convincingly supported. 1 he main finding 
states that due to their backward-looking, cost-based nature, the regulatory 
lag and money illusion that exist under price and entry regulation keep price 
increases behind and below cost increases, which causes a severe problem in 
high-inflation periods such as the 1970s (and. more important, the 1980s). 
This price .squeeze pushes profit rates l>elow average returns, thereby slowing 
investment and eventually leading to production shortages and reduced 
quality of service. MacAvoy illustrates his thesis with a careful analysis of the 
natural gas shortages that were caused by FPC price regulation, and he then 
persuasively argues that the electric power, railroad, anef telephone industries 
may be headed along the same path. The sericnisncss of this result places 
reltirm of the regulatory process for these industries high on the political 
priority list. 

Although rc’asonable evidence* is presented to support most of the conclu¬ 
sions, many are ovei stated, withotit necessary quabbeations. and without 
exploration of leasonable alternative hypotheses, .■Xs only one example. 
MacAvoy cites rate lelief less than that requested by electric com|)ames as 
evidence that regulation had some price-recfucmg effect, when the discrep¬ 
ancy between the requested and the granted rate increases can also be 
explained by gaming between both parties. 

In contrast to his analysis of price and entry regulation, MacAvoy's analysis 
of the effects of scxial legulation is too simplistic. He studies three agencies. 
F.PA, NH rSA, and ()SH.\, finds each of them to impose high costs and no 
significant Ix-nehts, and then concludes that scxi.cl regulation is generally 
counterproductive. 

The consistency of the conclusion may he satisfying to some, hut it simply 
cannot lx.* supported. The literature does suggest that OSH.A and .\H rs.\ 
have provided lew benefits. However, the conclusion ot Mdc,\\ov’,s own 
study, that the EPA air programs have yielded no air-quality Irenefits, is 
difficult to accept, Ixith Itecause of questions absiut his methodology and 
because of its inconsistency with other studies. Even ii C)SH.\, NH FS.^, and 
the EPA air program did provide no benefits, it would not follow that other 
examples of social regulation provide benefits less than their costs or, worse 
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yet, provide no benefits at ail. Although imposing severe costs on the economy 
in terms of forgone benefits from consumer products, the CPSC has also 
provided some benefits, if only from the added safety features it has induced 
manufacturers to adopt "voluntarily" to avoid CPSC action. And few knowl¬ 
edgeable about EPA water programs would argue that tho.se programs have 
provided no significant improvements in water quality relative to what water 
quality would have been without the 1972 Water Pollution Control Act 
Amendments. 

Regulatory reform comprises MacAvoy’s third focus. He suggests reforms 
such as automatic rate increases (subject to later repayment if found to be 
exce.ssive), the use of future te.st years, a requirement of economic impart 
statements for all significant regulations, and deregulation where competi¬ 
tion, .service quality, and the economy dictate this solution. His analysis is 
quite biief. I would like to see a book which thoroughly analyzes each of these 
and other reform proposals, compares them to existing procedures, and 
evaluates all their ctmsequences to both indu.stry and consumers. 

Other than the problem of structuring a transition to competition in cur¬ 
rently regulated “competitive” industries that will gain as many of the benefits 
of deregulation as possible, the most difficult regulatory problems in the next 
decade will consist of the reform of social regulation. For example, how 
should we implement Charles Schultze’s (1977) proposal to replace 
command-and-control regulatitms with price incentives, such as marketable 
fx-rmits or effluent fees for environmental pollution? In rule making, how 
tan we speed up the prtKess while allowing open access to all interested 
parties and while guaranteeing that high-quality, unbiased scientific and 
etonomic analyses arc u.sed to support regulatory deci.sions? Given the ram¬ 
pant growth of regulation in the 1970s, can mechanisms like the existing 
Regulatory Cttuncii, the proptised regulatory budget, or the suggested legis¬ 
lative veto l>e expected to effectively coordinate many diverse and costly 
regulatory activities? T hese problems illustrate the need lor careful thinking 
by ecofiomi.sts about a variety of regulatory reforms that public policy must 
s<x>n address. 

Wksm- y A. Maoai' 

Duke Univer'iity 
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The Rational Peasant: The Political Economy of Rural Society in Eletnam. By 

Samvei. L, Popkin. 

Berkeley: University of California Press, 1979. Pp. 306+xxi. $14.95 (cloth); 

$4.95 (paper). 

The Origins oj English Individualism: The Family, Property and Social Transition. 

By At.AN MAr.FARl.ANE. 

New York; Cambridge University Press, 1979. Pp. 216+xv. $19.95 (cloth); 

$6.95 (paper). 

Perhaps there is yet an economist who thinks peasants are irrational and 
collectivist. It is difhcult to believe that such a benighted view of the coun¬ 
tryside could persist, but this is the age of wonders. In truth, it remains a 
commonplace ttf development planning and scientific analysis. The two books 
under review here are signs of a counterattack. 

('lear and important arguments can be described by their rhetorical dual as 
well as by their primal, by what they attack as well as by what they affirm. So 
here. As his title suggests, Samuel Popkin, a young political scientist at the 
University of California at San Diego, is attacking the World Bank view that 
pca.sants are idiots and the more sophisticated view, articulated recently by 
James C. Scott in The Moral Economy of the Peasant (1976), that peasants are 
rational but rationally communitarian. Popkin believes they are not com¬ 
munitarian in general and were not in particular over the past century in his 
country of study, Vietnam, He believes that the economic rationality of 
Vietnamese villagers in choo.sing crops was at one with their political rational¬ 
ity in choosing allies, both in the service of individual well-being. 

'['he evidence for his beliefs is persuasive. The first third of the book is a 
lotig es.say on ‘‘ The Pea.sant," not necessarily Vietnamese, marshaling the 
latest work by medievalists and anthropologists (except Macfarlane's, pub¬ 
lished too late) against the views of Karl Polanyi, George Dalton. Eric 
Hobsbawin, Eric Wolf, James Scott, Joel Migdal, and Cliffoid Geertz. The 
"moral economists" “assume that peasants are antimarket, [that they) prefer 
common property to private, . . . [and] that peasant welfare depends on the 
closed corpoi ate villages.... The fabric of the moral community is rent ... by 
the intrusion of the ‘cash nexus’” (pp. 5-6). The "political economy" view, as 
Popkin calls it, notes that the evidence for the moral economy view is weak. 
Peasants from 'rcpotzlan to Vilyatpur and from the 1280s to the 1970s, 
among other evidences of selfish and clearheaded maximizing, did not help 
each other to pay taxes (p. 41), demanded strict reciprocity in charity (p. 47), 
used gleaning rules to prevent misallocation of labor (not to help the poor [p. 
52|), expected to grow rich out of village offites (p. 60), rc.sisted innovation 
when it was rational to do so (p. 66), and engaged in exchanges of land or 
crops without social disaster (p. 68). The opening chapters constitute a bril- 
lant survey of the context within which economists measure the supply re¬ 
sponse of African farmers and the transformalion of traditional agriculture. 
A central planner who can read Popkin without some tw'inge of doubt that his 
bureaucracies, communes, and other substitutes firr voluntary exchange are 
gcxid for the pcxir (p. 29) is a hard man to persuade. 

The rest of the txK>k may do the trick. It is a narrative of Vietnamese 
political history down to 1946 emphasizing the impact of coloniali.sm on the 
politics of villages. The narrative, like the analyucal e.s.say, is skillfully done, 
although of course it is difficult in any case for an American over 25 10 yawn 
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at the names Hue, Tonkin, Cho Lon, and the like. The writing is modest, 
clear, and warm. So tar as can be judged from his surefcxitedness in the 
literature on the European peasantry, Popkin’s opinions in fields outside my 
competence are to be taken seriously. The index testifies to his heavy reliance 
on Blum and Hilton for Europe, and on Gourou, Gram, Ory, Osborne, 
Rambo, and Woodside for Vietnam. The book is generous to its enemies and 
almost embarra.s.sing in its praise for allies. The footnotes are filled with 
references to studies that are “wonderful," “excellent,” “must reading.” The 
burden of bringing a chaotic history to bear on wider issues is sometimes too 
great, leading to excessive repetition and .summarization. And the narrative 
portion can be faulted as ratber too broadly focused to make the case about 
the details of village politics the argument requires. But the book is loaded 
with good observing and good thinking. 

The major thought inspired by Popkin's history of Vietnam under the 
French is that the village was no pa.ssive victim of colonization and commer¬ 
cialization. On the contrary, village notables themselves called the new world 
into existent e to redress the balance in the old. The new colonial taxes, for 
instance, were (x:casions for the notables to consolidate their power over the 
rest of the villagers (p. 149). Before the French came, “villages were not 
egalitarian, leveling, welfare oriented, or neces.sarily harmonious” (p. 132). In 
consequence, the many antifeudal movements of the twentieth century, 
whether religious or communist, did not look back to a Confucian golden age: 
the Vietnamese themselves knew well the rapacity of rural life. It is the 
analysts of the peasant wars of the twentieth century, not the wariiors them¬ 
selves, who idealize the communities the peasants came ft om. 

Popkin criticizes too the other half of the idealized portrait of the death of 
community, namely, the vile and domineering colonialists who murdered it. 
The Haw, Popkin argues, is to attribute effectiveness in evil to the French 
administration. The French supported 10 limes as many bureaucrats per 
head of local population as did the British in India yet failed repeatedly in 
their schemes to manipulate the villagers. The story of colonialism is not one 
of cffeciive and evil govetnmenl, which needs only to be replaced by effective 
and good government to reestablLsh the communitarian values of the golden 
age. It is a story of ineffective government lost in a sea of selfish pea.sants. The 
stoiy should have a certain familiarity to Americans lately embroiled in 
Southea.si Asia. 

Popkin recognizes that the source of the mythology (jf the un.selfi.sli peasant 
is history, especially English history. So does Maclarlane, whose book is a 
more direct challenge to the mythology: “a great deal of modern India, 
Africa, or South America cannot Ize understood without .some comprehension 
of what happened on a small island with only a lew million inhabitants between 
the thirteenth and eighteenth centuries” (p. 7). 'Fhe dual of Macfarlane's 
interpretation of English history is the one traditional since Marx, that E-ng- 
land was like other economies long ago in having many “peasants.” The 
peasant is here not merely a countryman but the denizen of a society with 
production in the home, no markets in prtiducts, no markets in factors of 
prtxluction, no geographical mobility, great emphasis on kinship, inheritance 
by the family instead of the individual, ownership by the family instead of the 
individual, an all-pervasive family life in which women marry young and are 
subordinate to their husbands, and .sharp lines between this peasantry so 
described and the nobility and its state. In brief, the peasant is the man we all 
know and love from the works of Marx, Weber, Tawney, and their recent 
(collective) heirs—the same list that Popkin called the “moral economists." 
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The history that all educated people tarry about in their heads is “a progres¬ 
sion from small, isolated communities inhabited by ‘peasants' . , . towards the 
market, monetized, ‘open’ social structuie of the eighteenth century” (p. 54), 
The history, says Macfarlane, is wrong. England was in fact a nonpeasant 
society from earliest times for which we have records: there was no revolution 
of the sixteenth century. Like Popkin, he buttresses his own primary evidence 
(from two English villages that he himself is studying intensively) with a 
brilliant survey of work by others bearing on the point. No medievalist will 
find the facts reviewed novel, though he may be irritated by the single- 
mindedness with which they are deployed. Indeed, anyone who finds Mac- 
farlane’s (or Popkin's) theme entirely surprising has been relying for his view 
of English medieval society cither on the works of Maitland or Vinogradoff, 
brilliant in their day (ca. 1900) but still in the grip of nineteenlh-centuiy 
German scholarship and its romantic view of the brotherhtxtd of times past, 
or on textbooks by political historians who rely in turn on Maitland and 
Vinogradoff. 

As you and I would turn to markets, and the politital scientist Popkin turns 
to pt)litic's and power, it is natural that Macfarlane as an anthropologist turns 
to family iclation.s to test the aniicjuity of English individualism. He argues 
that the English family was at an early stage nuclear, formed rather late in life, 
oriented toward the individual, respectful of women’s rights, and un¬ 
confining. In partitulai, the individual, even women, inherited the family's 
land and could dispose of it at will. The central evidence is simply the records 
showing individuals dispiosing of land, in large amounts and with great 
freedom—as they did labor and produce. The notion that the early medieval 
(not to speak of late medieval) economy was a “natural economy,” as the 
Germans used to put it, is flatly mistaken. .S'lc tramtt all manner of hypothetical 
histories, from Marx's notion of precapitalist etonomic formations, necessar¬ 
ily unsubstantiated in an age before the professionalization of history, down 
to such newer abominations, which may not be so ea.sily paidoncd, as 
Polanyi's The Great Transforrruition (1944), Hitks's.-f Theory of Economic Hutory 
(1969), and Wallerstein’s The Modern World System (1974). 

Like Popkin, again, Macfarlane dex’s not prove his case beyond all doubt. 
He is as uncomfoi table with cfuanlitative ai guments as wei e the medieval men 
he studies and tries repeatedly to force the evidence to s.ty yes or no when it 
can only say more or less. The most troublesome feature of the book is its 
insistence throughout that individualism was a peculiarity of England. What 
might be interpreted as a device of rhetoric to silence the cries of “it’s- 
dilTerent-in-Oijon-and-liacca" becomes in the end a disfigurement .Mways it 
IS “in E.ngland" or “at least in relation to England" that one after another m\ th 
of pea.sant life debunked, until finally (p. 199) we aie told that by Adam 
Smith’s time "England had probably long been different from almost every 
other major agrarian civilization,” The evidence lor this key assertion has the 
same weakness that Macfarlane detects thioughout in others, as in the work 
ol George Homans: “ rhere appears to lx- a strong, self-confirming, and 
circular hypothesis. It is thought to be .self-evident that the rural inhabitants 
of Emgland [for present purpises read ‘of any particular communitv'] were 
basically like ‘jjeasants’ elsewhere. ... It is [then] justifiable to fill in the vast 
gaps ... by drawing information from peasants elsewhere. The picture of [the 
sixiety] which emerges then seems to show that there were reallv peasants” (p. 
191). 

Eor all his iccrntxlasni in the English case, then, Macfarlane retains a 
respectful attitude toward the notion that peasants elsewhere are a very txld 
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sort. The antidote is of course Popkin—Vietnam is most assuredly 
“elsewhere.” From the union of these two books little remains of the myth of 
the peasant, historical or modern. Popkin's brief survey ot the medieval 
evidence should be strengthened by Macfarlane; and Macfarlane’s reluctance 
to extend his evidence outside England should be overcome by Popkin. 

The books are not economics, though they talk much of economic matters. 
T he question arises whether an economist should break off the scrutiny of his 
out-garbage from the comp center long enough to peruse a book on historical 
political science and a book on historical anthropology. Yes, he should. His¬ 
torical mythologies live on in the premises of the economist who believes 
himself, whether proudly or humbly, to be a mere technician. To learn that 
“neither decline nor decay of peasant institutions is necessary for peasants to 
enter market.s" (Popkin, p. 267) or that ‘“homt) economicu.s' and market 
society Lwere] present in England for centuries” gives the economist new 
fields to conquei, and the conquered helds may in turn civilize his economics. 

Donaid N. McCt.OSKKY 

Ifnwf^sily of Iowa 
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This paper analyzes compensation schemes which pay according to 
an individual’s ordinal rank in an organization rather than his out¬ 
put level. When workers arc risk neutral, it is shown that wages based 
upon rank induce the same efficient allcKation of resources as an 
incentive reward scheme based on individual output levels. Under 
some circumstances, risk-averse workers actually prefer to be paid 
on ifie basis of ratik. In addition, if workeis are heterogeneous in 
ability, low-quality workers attempt to contaminate high-quality 
firms, resulting in adverse selection. However, if ability is known in 
advance, a competitive handicapping structure exists which allows all 
workers to compete efficiently in the same oiganization. 


I. Introduction 

It is a familiar propo.sition that under competitive conditions workers 
are paici the value of their marginal products. In this paper we show 
that competitive lotteries are often efficient and sometimes superior 
to more familiar compensation schemes. For example, the large 
salaries of executives may provide incentives for all individuals in the 
firm who, with hard labor, may win one t>f the coveted top positions. 

This paper addresses the relation between compensation and in¬ 
centives in the presence of costly monitoring of workers’ efforts and 
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output. A wide variety of incentive payment schemes are used in 
practice. Simple piece rates, which have been extensively analyzed 
(see, e.g., Cheung 1969; Stiglitz 1975; Mirrlees 1976), gear payment 
to output. We consider a rank-order payment scheme which has 
not been analyzed but which seems to be prevalent in many labor 
contracts. This scheme pays prizes to the winners and losers of labor 
market contests. The main difference between prizes and other in¬ 
centive schemes is that in a contest earnings depend on the rank order 
of contestants and not on “distance.” That is, salaries are not contin¬ 
gent upon the output level of a particular game, because prizes are 
fixed in advance. Performance incentives are set by attempts to win 
the contest. We argue that in many circumstances it is optimal to set 
up executive compensation along these lines and that certain puzzling 
features of that market are easily explained in these terms. 

Central to this discussion are the conditions under which mecha¬ 
nisms exist for monitoring productivity (Alchian and Demsetz 1972). 
If inexpensive and reliable monitors of effort are available, then the 
best compensation scheme is a periodic wage based on input. How¬ 
ever, when monitoring is difficult, so that workers can alter their 
input with less than perfect detection, input-wage schemes invite 
shirking. The situation often can be improved if compensation is 
related to a more easily measured output level. In general, input- 
based pay is preferable because it changes the risk borne by workers 
in a favorable way. But when monitoring costs are so high that moral 
hazard is a serious problem, the gain in efficiency from using output- 
based pay may outweigh the risk-sharing losses. Paying workers on 
the basis of rank order alters costs of measurement as well as the 
nature of the risk borne by workers. It is for these reasons that it is 
sometimes a superior way to bring about an efficient incentive struc¬ 
ture. 

In the development below we start with the simplest case of risk 
neutrality to illustrate the basic issues. Then the more general case of 
risk aversion is treated in Section III. Section IV considers issues of 
sorting and self-selection when workers are heterogeneous. 


II. Piece Rates and Tournaments with Risk Neutrality 

To keep things simple and to avoid sequential and dynamic aspects of 
the problem, we confine attention to a single period in all that follows. 
I'herefore, the reader should think of the incentive problem in terms 
of career development and lifetime productivity of workers. The 
worker’s (lifetime) output is a random variable whose distribution is 
controlled by the worker himself. In particular, the worker is allowed 
to control the mean of the distribution by investing in costly skills 
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prior to entering the market. However, a gjiven productivity realiza¬ 
tion also depends on a random factor which is beyond anyone’s 
control. Employers may observe output but cannot ascertain the ex¬ 
tent to which it is due to investment expenditure or to good fortune 
or to both, though workers know their input as well as output. Worker 
j produces lifetime output q j according to 

= + ( 1 ) 

where /Xj is the level of investment, a measure of skill or average 
output, chosen by the worker when young and prior to a realization of 
the random or luck component, €j. Average skill, /xj, is produced at 
cost C(/x), with C, C" > 0. The random variable €j is drawn out of a 
known distribution with zero mean and variance o-*.' Here e is lifetime 
luck such as life-persistent person-effects or an ability factor, which is 
revealed very slowly over the worker’s lifetime. The crucial assump¬ 
tion is that productivity risk is nondiversifiable by the worker himself. 
That is another reason for choo.sing a long period for the analysis. For 
example, if the period were very short and the random factor was 
independently distributed across periods, the woiker could diversify 
per period risk by repetition and a savings account to balance off 
good and bad years. Evidently a persistent person or ability effect 
cannot be so diversified when it is undiscoverable quickly, as appears 
true of managerial talent, for example. It is assumed, however, that e 
is i.i.d. across individuals, so that owners of firms can diversify risk 
either by pooling workers together in one firm or by holding a 
portfolio. 

To concentrate on incentive aspects of various contractual ar¬ 
rangements, we adopt the simplest technology for firms. Production 
requires only labor and is additively separable across workers. By 
virtue of the independence assumptions, managers act as expected 
value maximizers or as if they were risk neutral. Free entry and a 
competitive output market set the value of the product at V per unit. 
Again, these assumptions are adopted to illustrate basic issues in the 
simplest way. The analysis also applies when there are complemen¬ 
tarities among workers in production, which is more realistic but more 
difficult to exposit. 

Piece Rates 

The piece rate is very simple to analyze when workers are risk neutral. 
It involves paying the worker the value of his product. Let r be the 

‘ In this paper the worker has no choice over tr. This docs not affect the risk-neutral 
solution but does have an effect if workers are risk averse, since they tend to favor 
overly cautious strategies. Also, virtually all the results of this piaper hold true if the 
error structure is multiplicative rather than additive. 
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piece rate. Ignoring discounting, the worker’s net income isr^ — C(fi). 
Risk-neutraJ workers choose /x to maximize expected net return 

E[rq - C(/t)] = rfi - C{ix). 

The necessary condition is r = or the familiar requirement that 

investment equates marginal cost and return. On the other hand, the 
expected profit of a firm is 

E{Vq - rq) = (V - r)fjL, 

so free entry and competition for workers imply r = V. Consequently 

V =C\iL). 

The marginal cost of investment equals its social return, yielding the 
standard result that piece rates are efficient. 


Rank-Order Tournaments 

We shall consider two-player tournaments in which the rules of the 
game specify a fixed prize W, to the winner and a fixed prize W-i to the 
loser. All essential aspects of the problem readily generalize to any 
number of contestants. A worker’s production follows (1), and the 
winner of the contest is determined by the largest drawing of The 
contest is rank order because the margin of winning does not affect 
earnings. Contestants precommit their investments early in life, 
knowing the prizes and the rules of the game, but do not communi¬ 
cate with each other or collude. Notice that even though there are two 
players in a given match the market is competitive and not oligopolis¬ 
tic, because investment is precornmiited and a given player does not 
know who his opponent will be at the time all decisions are made. 
F.ach person plays against the “field.” 

We seek to determine the competitive prize structure (Wi.Wj). I'he 
method proceeds in two steps. First, the prizes IT, and W 2 are fixed 
arbitrarily and workers’ investment .strategies are analyzed. Given 
these strategies, we then find the pair (W,,W 2 ) that maximizes a 
worker’s expected utility, subject to a zero-profit constraint by firms. 
It will be seen that a worker’s incentives to invest increase with the 
spread between winning and lasing prizes, W, — ITj. Each wants to 
improve the probability of winning because the return to winning 
varies with the spread. The firm would always like to increase the 
spread, ceteris paribus, to induce greater investment and higher 
productivity, because its output and revenue are increased. But as 
contestants invest more, their costs also rise. That is what limits the 
spread in equilibrium: Firms offering too large a spread induce exces¬ 
sive investment. A competing firm can attract all of these workers by 
decreasing the spread because investment costs fall by more than 
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expected product, raising expected net earnings. Increasing mar¬ 
ginal cost of skill implies a unique equilibrium spread between the 
prizes that maximizes expected utility. 

More precisely, con.sider the contestant’s problem, assuming that 
both have the same costs of investment C(/x.), so that their behavior is 
identical. A contestant’s expected utility (wealth) is 

{P)[W, - C(^)J + (1 - P){W^ - C(/c)] = PW, + (1 - P)W^ - C(/i), 

(2) 

where P is the probability of winning. The probability thatj wins is 
P = prob (qj > q^) = prob (n) - fc* > c* - cj) 

. (3) 

= prob (fjij - /z* > f) = G(/Aj - ^*), 

where ^ = c* - Cj, ^ ~ g(f),G{ ) is the cdf of = 0, ^ndE(^) = 

2o’* (because Cj and are i.i.d.). Each player chooses p., to maximize 
(2). Assuming interior solutions, this implies 

(IT, - = 0 

, dfJL, 

and 

(W, - ^ 2 ) <0,1= j, k. (4) 

We adopt the Nash-Cournot assumptions that each player optimizes 
against the optimum investment of his opponent, since he plays 
against the market over which he has no influence. Therefore,/ takes 
ft*, as given in determining his investment and conversely for A. It then 
follows from (.S) that, for player/ 

dP/dnj = dG(M* - = g(fij - ft*), 

which upon substitution into (4) yields/’s reaction function 

(W, - IV2)g(ftj - ft*) - C'(ftj) = 0. (5) 

Player k's reaction function is symmetrical with (5). 

Symmetry implies that when the Nash solution exists, ftj = ft* andP 
= G (0) = Vi, so the outcome is purely random in equilibrium. Ex ante, 
each player affects his probability of winning by investing.* 


* However, it is not necessarily true that there is a solution because w’lih arbitrary 
density functions the objective function may not be concave in tbe relevant range. It is 
possible to show that a pure strategy solution exists provided that tr* is sufbciently large: 
Contests are feasible only when chance is a significant factor. This result accords vyith 
intuition and is in the spirit of the old saying that a (sufficient) difference of opinion is 
necessary for a horse race. Stated otherwise, since = g(fij — #**) andg(') is a pdf, 

#‘P/d*i5 ^ ~ ft») "i3y be posiuve, and fulfillment of sccond-oider conditions in (4) 

implies shaip breaks in the reaction function. If o’ is small enough the breaks tx:cur at 
very low levels of investment, and a Nash e<{uilibrium in pure .strategies vsnil not exist 
Existence of an equilibrium is assumed in all that follows. 
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Substituting at the Nash equilibrium, equation (5) reduces 

to 

C'itM) = iW^-W^)g(0). i=j,k, (6) 

verifying the point above that players’ investments depend on the 
spread between winning and losing prizes. Levels of the prizes only 
influence the decision to enter the game, which requires nonnegativ¬ 
ity of expected wealth. 

The risk-neutral firm’s realized gross receipts are + y*) • V, and 
its costs are the total prize money offered, IT, -I- ITj. Cdmpetition for 
labor bids up the purse to the point where expected total receipts 
equal costs IT, + IT^ = (p., -F /i*) • V. But since /*, = /z* = /x in 
equilibrium, the zero-profit condition reduces to 

F/x = (IT, + fT*)/2. (7) 

The expected value of product equals the expected prize in equilib¬ 
rium. Substitute (7) into the worker’s utility function (2). Noting that 
F = Vz in equilibrium, the worker’s expected utility at the optimum 
investment strategy is 

Fm - C(a*)- (8) 

The equilibrium prize structure selects IT, and ITj to maximize (8), or 

[F - C'(^)]{dnldWi) = 0, t = 1, 2. (9) 

The marginal cost of investment equals its marginal social return, F = 
C'(fji), in the tournament as well as the piece rate. Therefore, compet¬ 
itive tournaments, like piece rates, are efficient and both result in 
exactly the same allocation of resources. 

Some further manipulation of the equilibrium conditions yields an 
interesting interpretation in terms of the theory of agency (see Ross 
1973; Becker and Stigler 1974; Harris and Raviv 1978; and Lazear 
1979): 

IT, = F/x -I- C'(p.)/2g(0) = F/x + F/2g(0) 

IT, = F^ - C'(pi)/2g(0) = Vn- F/2g(0). 

The second equality follows from V = C'{n). Now think of the term 
C'(fjL)l2g(0) = F/2g(0) in(10)asan entrance fee orbondthatis posted 
by each player. The winning and losing prizes pay off the expected 
marginal value product plus or minus the entrance fee. That is, the 
players receive their expected product combined with a fair winner- 
take-all gamble over the total entrance fees or bonds. The appropriate 
social investment incentives are given by each contestant’s attempt to 
win the gamble. This contrasts with the main agency result, where the 
bond is returned to each worker after a satisfactory performance has 
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been observed. There the incentive mechanism works through the 
employee’s attempts to work hard enough to recoup his own bond. 
Here it works through the attempts to win the gamble. 

Comparadve statics for this problem all follow from (9) and (10) 
once a distribution is specified. For example, if € is normal with 
variance c*, theng(O) = ‘Acr's/n. It follows from (10) that the optimal 
spread varies directly with F and cr®. While several other interesting 
observations can be made of this sort, we note a somewhat different 
but important practical implication of this general scheme. Even 
though the optimal prize structure determines expected marginal 
product through its effect on worker choice of /x and the zero-profit 
condition (7) implies that expected prizes equal expected produc¬ 
tivity, nevertheless actual realized earnings definitely do not equal 
productivity in either an ex ante or ex post sense. Consider ex ante 
first. Since p,j = = /x, expected products are equal. Since W, > IV 2 

required to induce any investment, the payment thatj receives never 
equals the payment that k receives. It is impossible that the prize is 
equal to ex ante product, because ex ante products are equal. Nor do 
wages equal ex post products. Actual product is rather than Fp. 
Butq is a random variable, the value of which is not known until after 
the game is played, while Wt and W 2 are fixed in advance. Only under 
the rarest coincidence would W, = yqj and Wj = Fy*. 

Consider the salary structure for executives. It appears as though 
the salary of, say, the vice-president of a particular corporation is 
substantially below that of the president of the same corporation. Yet 
presidents are often chosen from the ranks of vice-presidents. On the 
day that a given individual is promoted from vice-president to presi¬ 
dent, his salary may triple. It is difficult to argue that his skills have 
tripled in that 1-day period, presenting difficulties for standard 
theory where supply factors should keep wages in those two occupa¬ 
tions approximately equal. It is not a puzzle, however, when inter¬ 
preted in the context of a prize. The president of a corporation is 
viewed as the winner of a contest in which he receives the higher 
prize, fVi- His wage is settled on not necessarily because it reflects his 
current productivity as president, but rather because it induces that 
individual and all other individuals to perform appropriately when 
they are in more junior positions. This interpretation suggests that 
presidents of large corporations do not necessarily earn high wages 
because they are more productive as presidents but because this 
pardcular type of payment structure makes them more productive 
over their entire working lives. A contest provides the proper incen¬ 
tives for skill acquisition prior to coming into the position.-’ 

’ Iff is a fixed effect, there is additional information from knowing the identity of 
winners and losers. The expected productivity of a winner is p, + £(*, (> q^), while 
that of a loser is p + E(t, [q, < q^). In a one-penod contest there is no possibility of 
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Comparisons 

Though tournaments and piece rates are substantially different in¬ 
stitutions for creating incentives, we have demonstrated the surpris¬ 
ing result that both achieve the Pareto optimal allocation of resources 
when workers are risk neutral. In fact other schemes also achieve this 
allocation. For example, instead of playing against an opponent, a 
worker might be compared with a fixed standard 'q, with one payment 
awarded if output falls anywhere below "q and another, higher, pay¬ 
ment awarded if output falls anywhere above standard. Attempting to 
beat the standard has the same incentive effects as attempting to beat 
another player. Using the same methods as above, it is not difficult to 
show that there are spread-standard combinations that induce Pareto 
optimum investments. Since all these schemes involve the same in¬ 
vestment policy, and since average payout by the firm equals average 
product for all of them, they all yield the same expected rewards and, 
therefore, the same expected utility to workers.'* 

In spite of the apparent equality of these schemes in terms of the 
preferences of risk-neutral workers, considerations of differential 
costs of information and measurement may serve to break these ties in 
practical situations. Ihe essential point follows from the theory of 
measurement (Stevens 1968) that a cardinal scale is based on an 
underlying ordering of objects or an ordinal scale. In that sen.se, an 
ordinal scale is “weaker” and has fewer requirements than a cardinal 
scale. If it is less costly to observe rank than an individual’s level of 
output, then tournaments dominate piece rates and standards. On 
the other hand, occupations for which output is easily ob.served save 
resources by using the piece rate or standard, or some combination, 
and avoid the necessity of making direct comparisons with others as 
the tournament requires. Salesmen, wht>se output level is easily ob¬ 
served, typically arc paid by piece rates, whereas corporate executives, 
whose output is more difficult to observe, engage in contests. 

In a modern, complex business organization, a person’s produc¬ 
tivity as chief executive officer is measured by his effect on the 
profitability of the whole enterpri.se. Yet the costs of measurement for 

taking ativ.iiiiage of this iitf'oiinalioii. However, in a sequential contest with no firm- 
specific capital, the information would Ire valuable and would constrain subsequent 
wage payments m sutce.ssive rounds through competition from other firms. It is not 
dilficult to show that this does not alfect the general nature of the Irond-gamble 
solution. .Alternatively, if the investment has firm-specific elements or firms adopt 
policies that bind wen kers to it (as iti Lazear 1979), these restrictions do not necessarily 
apply. 

^ Tht^ U*vt*l of ihc standaid is inclelerminaic. since for anyf/ a corresponding spread 
can be chosen to achieve the optima! investment. This is also true o! contests among 
more than two playeis. With N contestants, the prizes of TV - 2 of them are indetermi¬ 
nate. When nsic neutrality is dropped, the indeterminacy vanishes in both cases. 
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each conceivable candidate are prohibitively expensive. Instead, it 
might be said that those in the running are “tested” by asse.ssments of 
performance at lower positions. Realizations from such tests are sam¬ 
ple statistics in these assessments, in much the same way that grades 
are assigned in a college classroom and IQ .scores are determined. 
The point is that such tests are inherently ordinal in nature, even 
though the profitability of the enterprise is metered by a well-defined, 
cardinal ratio scale. It is in situations such as this that the conditions 
seem ripe for tournaments to be the dominant incentive contract 
institution. 

Notice in this connection that the basic prize and piece-rate struc¬ 
tures survive a broad class of revenue functions other than summable 
ones. Even if the production function of the firm includes compli¬ 
cated interactions involving complementarity or substitution among 
individual outputs, there exists the possibility of paying workers 
either on the basis of individual performance or by rank order. The 
revenue function itself can even involve rank-order considerations, 
and both possibilities still exist. For example, spectators at a horse race 
generally are interested in the speed of the winning horse and the 
closeness of the contest. I'hen the firm's (track) revenue function 
depends on the first few order statistics; yet the horses could be paid 
on the basis of their speed rather than on the basis of win, place, and 
show positions. Both methods would induce them to run fast.® 

There has been very little treatment of the problem of tournament 
prize structure and incentives in the literature. Little else but the 
well-known paper by Friedman (1953) based on Friedman-Savage 
preferences for lotteries exists in economics. In the statistics literature 
there is an early paper by Gallon (1902) that is worthy of brief 
discussion. Gabon inquired into the ratio of first- and second-place 
prize money in a race of n contestants, assuming the prizes were 
divided in the following ratio: 

wjw^ = « 2 , - - (h)- 

Here Qi is the expected value of the first- (fastest) order statistic, etc. 
While a moment's reflection suggests this criterion to be roughly re¬ 
lated to marginal productivity. Gallon proposed it on strictly a priori 
grounds. He went on to show the remarkable result that the ratio above 


® The reader is rcniindect that throughout this .seruon and the next workers arc 
identical a priori and dillcr only ex po,st through the rcaluauon of e. In the real world, 
where there is population heterogeneity, market parucipants arc soiled into dirferent 
contests. There players (and horses, for that matter) who arc known to he of higher 
quality ex ante may play in games with higher stakes. If it can be accomplished, the 
sorting is by anticipated marginal products. In that sense, pay tliflerences among 
contestants of known quality resemble the effect of a "piece rate.” These is.sues are 
more thoroughly discussed Itelow. 
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is approximately 3 when the parent distribution of speed is normal. 
Hence, this criterion results in a highly skewed prize structure. From 
what we know today about the characteristic skew of extreme value 
distributions, a skewed reward structure based on order statistics is 
less surprising for virtually any parent distribution. In the more 
modern statistical literature, the method of paired comparisons has 
tournament-like features. Samples from different populations are 
compared pairwise, and the object is to choose the one with the 
largest mean. Comparing all samples to each other is like a round- 
robin tournament. An alternative design is a knockout tournament 
with single or double elimination. The latter requires fewer samples 
and is therefore cheaper, but does not generate as much information 
as the round robin (David 1963; Gibbons, Olkin, and Sobel 1977). 

Gallon’s original work and the more modern developments it has 
given rise to are not helpful to us; they deal with samples from fixed 
populations, so the reward structure is irrelevant for resource alloca¬ 
tion. The problem we have treated here is that of choosing the reward 
structure to provide the proper incentive and elicit the socially proper 
distributions. 

III. Optimal Compensation with Risk Aversion 

All compensation systems can be viewed as schemes which transform 
the distribution of productivity to a distribution of earnings. A piece 
rate is a linear transformation of output, so the distribution of income 
is the same apart from a change in location and scale. A tournament is 
a highly nonlinear transformation: It converts the continuous dis¬ 
tribution of productivity into a discrete, binomial distribution of in¬ 
come. When workers are risk neutral, both schemes yield identical 
investments and expected utility because their first moments are the 
same. In this section, it is shown that with risk aversion one method or 
the other usually yields higher expected utility, because the interac¬ 
tion between insurance and action implies substantially different first 
and second moments of the income distribution in the two cases.® 

We have been unable to completely characterize the conditions 
under which piece rates dominate rank-order tournaments and vice 
versa, but we show some examples here. Truncation offered by prizes 
implies more control of extreme values than piece rates but less 
control of the middle of the distribudon. Different utility functions 


' One might think that risks could be pooled among groups of workers through 
sharing agreements, but that is false because of moral hazard. A worker would never 
agree to share prizes since doing so would result in ^ = 0, and consequendy £ (f / + q^) 
= 0 and bankruptcy for the firm. As a result, firms offering tournaments or piece rates 
in the pure sense yield higher expected utility than the sharing arrangement. 
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weight one aspect more than the other so that tournaments can 
actually dominate piece rates. 


Optimum Linear Piece Rate’’ 

The piece-rate scheme analyzed pays workers a guarantee, /, plus an 
incendve, rq, where r is the piece rate per unit of output. The problem 
for the firm is to pick an r, I combination that maximizes workers’ 
expected utility 

max f£(f/) = max/f/(y)0()i)dy], (11) 


where 


y + rq - Cifi) 

= I + Tfi, + re — C(ix) 


( 12 ) 


and 0(y) is the pdf of y. 

The worker’s problem is to choose /x to maximize expected utility 
given I and r. If e ~ f(f), the worker’s problem is 

max £(!/) = / U[I + rfi + re — C{fj.)]f(e)de. 

The first-order condidon is 


= /[C/'(y)][r - C’(fime)de = 0, 

which conveniently factors so that 

r = CV)- (13) 

Condition (13) is identical to the risk-neutral case, because e is inde¬ 
pendent of investment effort, p,. 

Assuming risk-neutral employers, Kp. is expected revenue from a 
worker and / -I- rp is expected wage payments. Therefore, the zero- 
profit market constraint is 

Fp=/-t-rp. (14) 

Solving (14) for I and substituting into (12), the optimum contract 
maximizes 

/ U{Vfi{r) +re - Clp(r)]}/(e)d6 

with respect to r, where p = p(r) sadsfies (13). After simplification the 


' The following is similar to a problem analyzed by Stiglitz (1975). A linear piece-rate 
structure is a simpliflcation. A more general structure would allow for nonlinear piece 
rates (see Miirlees 1976). 
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[V - + E€U'= 0. (15) 

Since risk aversion implies EeU' < 0, (15) shows that V > C'(fi) in the 
optimum contract for risk-averse workers. This underinvestment is 
the moral hazard resulting from insurance / > 0 and r < V implied by 
(15). 

Using familiar Taylor series approximations to the utility function 
and a normal density for e, the optimum is approximated by 

and 


(1 -t- ’ 


(17) 


where s = —W'lU' evaluated at mean income is the measure of abso¬ 
lute risk aversion. Investment increases (see [16J) in V and decreases 
in s, C", and because all these changes imply similar changes in the 
marginal piece rate r which influences investment through condition 
(13). I he same changes in F, .?, and C" have corresponding effects on 
the variance of income (see [17J), but an increa.se in cr* actually 
reduces variance, if cr* is large, because it reduces r and increases/.® 


Optimum Prize Structure 

The worker’s expected utility in a two-player game is 

E(U) = P{U{W, - C(M*)1} + (1 - P){U[W^ - C(/c*)]>, (18) 

where * denotes the outcome of the contest rather than the piece-rate 
scheme. The optimum prize structure is the .solution to 

max {E ([/*) = max {P ■ //[IT, - C(/j,*)] + (1 - P) • //[IFj - C(^i*)]}) 

(19) 

subject to the zero-profit constraint 

F/z* =PfV, + (1 - P)fV2. (20) 

The worker selects p,* to satisfy d/^(C/)/5p.* = 0. Since cost functions 
are the same and Cj and are i.i.d., the Nash solution implies juj = fz/c 


* Furthermore, r = r/( 1 + .vCV*) and / = sV^o^/{ 1 + rCV*)*, so that r = V and/ = 0 in 
the case of risk neutrality (s = 0). All these approximations use first-order expansions 
for term.s in and second-order expansions for terms in V{’). Fhe same is true of 

the approximations below for the tournament. 
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and P = V 2 as before. Then the worker’s investment behavior sim¬ 
plifies to 


= 2[f/(l) - f/(2)k(0) 


U'(\) + U'(2) 


( 21 ) 


where U (t) = U\Wj — C(/;x*)| and U'(t) = U'\Wr — C(/i*)] for t =1,2. 
Equation (21) implies 


fj.* = fi*(W„W2), ( 22 ) 

and the optimum contract maximizes 

E(U*) = '/2 U[Wi - C{fjL*)] + '/2 U[W., - C(jLt*)] (23) 

subject to (20), with P = 'A, and (22). Increasing marginal cost of 
investment and risk aversion guarantees a unique maximum to (23) 
when a Nash solution exists. Again, assuming a normal density for e, 
second-order approximations yield 


'(l + sCVtt ) 


and 


where 


2 

“ (1 + nsC'a^y’ 




= IE, - C(/i.*) if^j > 

= W,- Cin*) ifqj < q. 


(24) 


(25) 


and €j ~ N(0,cr^),e* ~ A^(O.fr^), and cov (ej,e*) = 0.” The comparative 
statics of (24) and (25) are similar to the piece rate (16) and (17) and 
need not be repeated. 


Comparisons 

Equations (16) and (24) indicate that investment and expected in¬ 
come*® are lower for the contest than for the piece rate at given values 
of 1 '. Moreover, for values of cr* in excess of MsC'^/tt, the variance of 
income in the tournament is smaller than for the piece rate. This 
would seem to suggest that contests provide a crude form of insur¬ 
ance when the variance of chance is large enough, but the problem is 
significantly more complicated than that because there is no separa¬ 
tion between tastes and opportunities in this problem: The optimum 


• Kuihermore, C'(p,*) = g'COKW, - W,), so the spread U still crucial for investment 
incentives, as in the risk-neutral case. 

Since 31 - Vfi - and since /a is below the wealth-maximizing level of fi when 

workers arc risk averse, lower n implies lower y because revenue falls by more than cost. 
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mean and variance themselves depend on utility-function parameters. 
Thus, for example, for the constant, absolute risk-aversion utility 
function U = —the insurance provided by the contest is in¬ 
sufficient to compensate for its smaller mean: It can be shown that the 
expected indirect utility of the opumal piece rate exceeds that of the 
optimal tournament for all values of <t*, at least with normal distribu¬ 
tions and quadratic investment-cost functions. However, when there 
is declining absolute risk aversion, we have examples where the con¬ 
test dominates the piece rate. 

Illustrative calculations are shown in table 1 using the utility func¬ 
tion U = ay", which exhibits constant relative but declining absolute 
risk aversion, s(y) = (1 — a)/y. Again quadratic costs and normal errors 
are assumed. However, this utility function is defined for positive 
incomes only, so an amount of nonlabor income yo is assigned to the 
worker to avoid a major approximation error of the normal, which 
admits negative incomes (i.e., the possibility of losses). 

Table 1 shows that when yo = 100 so that s = .005, the contest is 
preferred until o-’ ^ 3. However, if yo = 25 so that s = .020, the 
contest is only preferred for <r^ < .2. The intuition is that piece rates 
concentrate the mass of the income distribution near the mean, while 
contests place 50 percent of the weight at one value significantly below 
the mean and the other value significantly above. Strongly risk-averse 
workers seem to dislike the binomial nature of this distribution when 
cr* is high because it concentrates too much of the mass at low levels of 
utility. However, when cr* is small, the contest which truncates the tails 


TABLE 1 

(Constant Relative Risk Aversion 


<T* 


I** 


E(V) 

E{U*) 



y„ = 100. v(yo) = 

.005 



1 

.9995 

.9984 


5.012155 

5.012465 

,5 

9975 

9922 


5.012150 

5.012445 

1 

.9950 

.9846 


5.012100 

5.012295 

3 

.9852 

.9552 


5.01 1940 

5.011925 

6 

9710 

.9142 


5.011800 

5.011415 

12 

.9436 

.8420 


5.011420 

5.010515 



y„ = 25; s(y„) = 

.020 



.1 

.9980 

.9938 


2.524665 

2.524725 

.2 

.9960 

.9878 


2.524616 

2.524575 

1 

.9807 

.9419 


2.524237 

2.523437 

12 

.8094 

.5741 


2.519930 

2.514282 

Ntnc —U =* ay“iy 

“»• + / + W 

- C(f») lor piece rale; 7 =7* + M'l “ C^*.) for conteM (1 ** 1, 2); a — 

5,F= 


* <t* 
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of the income distribution associated with a linear piece rate has 
higher value. 

Income Distributions 

While it is not possible to make a general argument based on an 
example, table 1 suggests that persons with more endowed income 
and smaller absolute risk aversion are more likely to prefer contests, 
and those with low levels of endowed wealth and larger absolute risk 
aversion are more likely to prefer piece rates. Consider a situation in 
which all persons have the same utility function, such as the one in 
table 1, and face the same costs and luck distribution, the only dif¬ 
ference being the fact that some workers have larger endowed in¬ 
comes than others. If this difference is large enough, it can be optimal 
to pay piece rates to those with small values of endowed income and to 
pay prizes to those with large values. Individuals will self-select the 
payment scheme in accordance with their wealth. The distribution of 
earnings among those selecting the piece-rate jobs is normal with 
mean Vp. and variance r*o-*. It is binomial with mean Vft* and vari¬ 
ance (AWy/4 for those who enter tournaments. Note that ft and ft* 
depend upon s(y), which is smaller for workers who select contests, 
and it can turn out as it does in table I that expected income is larger 
in the contest than in the piece rate; for example, if tr* = 1 then the 
rich prefer contests (5.012295 > 5.012100) and the poor prefer piece 
rates (2.524237 > 2.523437), but/x* = .9846 exceeds ft = .9807. This 
situation is shown in figure 1. 

The overall distribution is the sum of a binomial and a normal with 
lower mean, weighted by the number of individuals in each occupa¬ 
tion (see fig. 1). It is positively skewed because Vft* > Vft. Note also 
that the distribution of wage income will be less skewed than that of 
total income. The reason is that and mean-wage income are posi¬ 
tively correlated because the likelihood of choosing a contest increases 
with yo- These implications conform to the standard findings on the 
distribution of income in an economy. 

This example is interesting because it is very closely related to some 
early results of Friedman (1953), who studied how alternative social 
arrangements can produce income distributions that cater to workers’ 
risk preferences. He showed that the Friedman-Savage utility func¬ 
tion leads to a two-class distribution. Persons in the risk-averse region 
are assigned to occupations in which income follows productivity, 
while persons in the risk-preferring region buy lottery tickets in very 
risky occupations in which few win very large prizes. The overall 
distribution is the sum of these two and exhibits characteristic skew. 
The Friedman-Savage utility function implies that a person’s risk 
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t(y) 



preferences depend on the pan of his wealth that is not at risk. 
I herefbre, Friedman’s assignment of people to jobs really follows 
endowed wealth (yo)>Jusi as in our example. However, our framework 
offers two improvements. First, the problem of incentives is directly 
incorporated into the formulation of the optimum policy. Second, 
workers in this model are risk averse for all values of incomes, but 
even so gambles can be the optimal policy. 

Erior Structure 

Relative costs of measurement are still important in choosing among 
incentive schemes, but the error structure plays additional roles when 
workers are risk averse. Suppose the output estimator for worker i in 
activity t is qtj = qi^ ■¥ p, + where v,r is random error and is an 
error that is specific to activity t but common to all workers within that 
activity. In the piece rate the common error p adds noise which 
risk-averse workers dislike, while the common noise drops out of a 
rank-order comparison because it affects both contestants similarly. 
I’hat is, the relevant variance for the contest is 2(tI, while that for the 
piece rate is crl -)- o-J. It is evident that this can tip the balance in favor 
of tournaments if cr'f, is large enough and/or workers are sufficiently 
risk aver.se. 

The common error p bears two interesting interpretations. One is 
activity-specific measurement error. For example, j and k may have 
the same supervisor whose biased as,sessments affect all workers simi¬ 
larly. This is similar to monitoring all workers by a mechanical count- 



rank-order tournaments 


857 

ing device that might run too fast or too slow in any given trial. Fhe 
other interpretation of p is true random variation that affects the 
enterprise as a whole. For example, suppose all firms produce with 
the same technology, but that in a given period some firms do better 
or worse than others. Then risk-averse workers prefer not to have 
their incomes vary with conditions facing the firm as a whole, and 
wages based on a contest eliminate this kind of variation. Without its 
elimination there would be excessive losses due to moral hazard. 

It must be pointed out that, in the absence of measurement error, 
using a contest against a fixed standard^ discussed above has lower 
variance than playing against an opponent. As shown in Section II, 
the relevant variance in a contest is that of f = e* — €j, which has 
variance 2o-“ against an opponent and only cr^ against a standard 
(since the standard is invariant, = 0). Consequently, we might 
expect risk-averse workers to prefer absolute standards." Again, 
however, the crucial issue is the costs of measurement and the error 
structure. For the complex attributes required for managerial posi¬ 
tions, it is dif ficult to observe output and therefore difficult to com¬ 
pare to an absolute standard. Insofar as samples and tests are neces¬ 
sary, it bears repeating that these are inherently ordinal in nature. But 
this leads us back to the problem of common error, where it is often 
impossible to know whether a person’s output is satisfactory without 
comparisons to other persons. Further, when there are changing 
production circumstances in the firm as a whole, it is difficult to know 
whether the person failed to meet the standard because of insufficient 
investment or because the firm was generally experiencing bad limes, 
a problem of measuring “value added.” Risk-averse workers increase 
utility by competing against an opponent and eliminating this kind of 
firm effect. 


IV. Heterogeneous Contestants 

Workers are not sprinkled randomly among firms but ratlier seem to 
be sorted by ability levels. One explanation for this has to do with 
complementarities in production. But even in the absence of com¬ 
plementarities, sorting may be an integral part of optimal labor- 
contract arrangements. Informational considerations imply that 


" Playing against a standard is like Mirrlees’s (1976) notion ol an "instruction " It is 
clear tliat using standards as well as piece rates micst l)e superior 10 u.sing one alone. That 
scheme would allow workers to lie paid / il ^ ^ and /o + rq tor ij ^ ij. 1 his is important 

because it truncates the possibilities when Vq < 0. (liven ihe technology, it is possible 
that very large negative values of output can occur, and since it is impiossible to always 
tax workers the full extent of this loss, some form of truncauon is desirable. contest is 
an alternative way to control the tails of this dtstnbuiioii. 
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compensation methods may affect the allocation of worker types to 
firms. Therefore, this section returns to the case of risk neutrality and 
analyzes tournament structures when investment costs differ among 
persons. Two types of persons are assumed, a’s and b’s, with marginal 
costs of the a’s being smaller than those of the b’s: Cg^fjL) < Ci(/x) for all 
/j,. The distribution of disturbances/(«) is assumed to be the same for 
both groups. Many of the following results continue to hold, with 
usually obvious modification of the arguments, if the a’s and b's draw 
from different distributions. The following section addresses the 
question of self-selection when workers know their identities but firms 
do not. The next section discusses handicapping schemes when all 
cost-function differences can be observed by all parties. 


Adi>frse Selection 

Suppose that each person knows to which class he belongs but that 
this inCoimation is not available to anyone else. The principal result is 
that the a's and b's do not self-sort into their own “leagues.” Instead, 
all workers prefer to work in firms with the best workers (the major 
leagues). Furthermore, there is no pure price-rationing mechanism 
that induces Pareto optimal self-selection. But mixed play is 
inefficient because it cannot sustain the proper investment strategies. 
Therefore, tournament structures naturally require credentials and 
other nonprice signals to differentiate people and assign them to the 
appropriate contest. Firms select their employees based on such in¬ 
formation as past performances, and some are not permitted to com¬ 
pete. 

The proof of adverse selection consists of two parts. First we show 
that players do not self-sort into a leagues and b leagues. Second, we 
show that the resulting mixed leagues are inefficient. 

1. Players do not self-sort. —Assume leagues are separated and con¬ 
sider the expected revenue generated by playing in league i = a,b 
with an arbitrary investment level fjc. Then 

Rdp) = Wi 4- (IV, - W\)P\ i=a,b, (26) 

where (ffj.fPj) is the prize money, andP* is the probability of winning 
in league i. Recall that P* depends on the individual’s level of invest¬ 
ment and that of his rivals. Therefore, F*" = G{jjb — (jl^) and P* = G(//. 
— p,?), where pj is the existing players’ investments in the a league, 
where V — Ca(pJ), and similarly for p?. Recalling from (6) and (9) that 
W[ — W 2 = y/g{0) and from (10) that = Vfjtf — T/2g(0), equation 
(26) becomes 

P,(p) = Fpf - - G(p - pf)]. 


(27) 
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Note that Rti/x) = Vfxf when fi = fji^ and that dRjdfi « RK/m) = 

~ fj^)/g(0) > 0. Since Ca(fi*) = K and Cti/xg) = V, then /u,f < /i? so that 
Rilp'i) >Rt(.fiS). Furthermore,- (/Lt? - Therefore, 

Rb^fx) is a pure displacement of R„(fi). Since R't(fx) = F for fi = fxf and 
Riifx) < V elsewhere, and since R,(fi) is increasing, the revenue func¬ 
tions never cross. So Rb(fx) lies to the southwest of Ra(fx) (see fig. 2). 
Therefore, independent of cost curves, it is always better to play in the 
a league than the b league: Workers will not self-select. 

2. Mixed contests are inefficient. —Supjxjse the proportions of a's and 
ft’s in the population are a and (1 — a), respectively. If pairings among 
a’s and ft’s are random, then expected utility of a player of type i is 

PFj -t- [aPj. + (1 - a)n](W, - W^) - C,(^d. 


where (IT,, 1 ^ 2 ) is the prize money in mixed play and P/ is the proba¬ 
bility that a player of type i defeats a player of type^. The first-order 
condition for investment of type i in this game is 


a 


dP‘a 

dfii 


+ (!-«) 


an 

dni 


• (W, - W,) = c;(/i,). 


A development similar to Section II implies equilibrium reaction 
functions 


[ag(0) + (1 - a)g(iZa -flb)m, - IV^) = 





Fig. 2 
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-/A„) + (1 - a)g-(0)](W^, - W 2 ) = 

for h's. If the solution is efficient, then = V = C'a^iXa), which 

implies 

«ff(0) + (1 - - fit,) = agiiih -M„) + (1 - Q:)g-(0). 

Sinceis symmetric and nonuniform, this condition can hold only if a 
= '/j. Therefore, except in that very special case, mixed contests yield 
inefficient investment: One type of player overinvests and the other 
underinvests depending upon whether or not a S 'A. 

We conclude that a pure price system cannot sustain an efficient 
competitive equilibrium in the presence of population heterogeneity 
with asymmetric information. Markets can be separated, but only at a 
cost. Consider, for example, the case where a’s want to prevent //s 
from contaminating their league. By making the spread, W1 — 
sufficiently large, /?a(/4) becomes steeper than >'i figure 2 and 

crosses it so that the envelope covers at low values of fi and 

Ra(f^) at high values. Then, for some high levels of fi, it is more 
profitable to play in the a league and, for low levels of fi, the l> league 
is prcfeiabic. Individuals may self-sort, but the cost is that w’s overin¬ 
vest. I he result is akin to that of Akerlof (1976) and to those of 
Spence (1973), Riley (1975), Rothschild and Stiglit/ (1976), and Wil¬ 
son (1977). As they show, a separating equilibrium need not exist, 
but, even if it does, that ecjuilibrium may be inferior to a nonseparat¬ 
ing equilibrium. 

The obvious practical resolution of these difficulties is the use of 
nonprice rationing and ceitification to sort people into the appro¬ 
priate leagues based on past performance. Similarly, firms use non¬ 
price factors to allocate jobs among applicants. The rules for allocat¬ 
ing those jobs may be important for at least two reasons that we can 
only briefly describe here. 

First, .sorting workers of different skill levels into appropriate posi¬ 
tions within a hierarchy may be beneficial. In this paper, production is 
additive, so it does not matter who works with whom. I’o the extent 
that the production technology is somewhat more complicated, sort¬ 
ing may well be crucial. A series of pairwise, sequential contests may 
efficiently perform that function. Suppose that q,, = fit + Si + t)(„ 
where 8j is an unobserved ability component for player i and rj is white 
noise. Suppose it is ef ficient for the individual with the highest 8 to be 
the chief executive. There will be a tendency to have winners play 
winners because 


>?*,) > t'(8fc|9j, > qki) 
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in the first round. A sequential elimination tournament may be a 
cost-efficient way to select the best person. 

Second, workers may not know precisely their own abilities or cost 
functions. A worker who is ignorant about his cost function values 
information before selecting a level of investment expenditure. 
Therefore, firms may offer “tryouts” to provide information about 
optimal investment strategies. In fact, one tan imagine the existence 
of firms which specialize in running contests among young 
workers—the minor leagues—which provide information to be used 
when and if the workers opt to increase the slakes and enter a bigger 
league. 

These issues point up an important dif ference between piece rates 
and contests. In ihe pure heterogeneous case, where information is 
asymmetric and workers are risk neutral, a piece rate always yields an 
efficient solution, namely, V = However, once slot¬ 

ting of workers is important because of complementarities in produc¬ 
tion, or if it is desirable for workers to gain information about their 
type, it is no longer obvious that a series of sequential contests does 
not result in a superior allocation of resources. 

Handicap Systems 

This section moves to the opposite extreme of the previous discussion 
and assumes that the identities of each type of player are known to 
everyone. Competitive handicaps yield efficient mixed contests. 

Consider again two types a and b now known to everyone. Prize 
structures in a-a and h-b tournaments satisfying (II) and (12) are 
efficient, but those conditions are not optimal in mixed a-h play. 
Denote the socially optimal levels of investment by and fj.g, their 
difference by A#/., and the prizes in a mixed league by Wi and IV 2 - Let 
h be the handicap awarded to the inferior player b. Then the Nash 
solution in the a-b tournament satisfies 

g(jLi„ - Ms - h)AW = C;(m„) (28) 

and 

g(M« - Ms ~ A)AIT = CftfMs)- 

(The second condition in [28] follows from symmetry of g[^].) Since 
the efficient investment criterion is K = C^Im?) = C6(Mf)' independent 
of pairings, the optimum spread in a mixed match must be 

AIT = P/g(AM " /*)- (29) 

From (28), condition (29) insures the proper investments by both 
contestants. The spread is larger in mixed than pure contests unless a 
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gives b the full handicap h = fj* — Otherwise, the appropriate 
spread is a decreasing function of h. Prizes Wi and W 2 must also satisfy 
the zero-profit constraint IP, + 1^2 = F • (/z^' + /xf) independent of A 
since the spread is always adjusted to induce investments /z? and fig. 

The gain to an a from playing a b with handicap h, rather than 
another a with no handicap, is the difference in expected prizes: 

y„(h) = PIP, + (1 - P)W 2 - Caifig) - [(IP-J + IP3)/2 - C„(Ai?)l (30) 

= PIP, -t- (1 - PWi - (IP? + lPg)/2, 

where ya(h) is the gain to a andP = G(Afi — h) is the probability that a 
wins the mixed match. The corresponding expression for b is 

y^{h) = (1 - P)IP, + PW 2 - (IP? + W%)I2. (31) 

The zero-profit constraints in a-a, a-b, and b-b require that yaih) + 
yt,(h) = 0 for all admissible h. The gain of playing mixed matches to a 
is completely offset by the loss to b and vice versa. 

If C„(fi) js not greatly different from C(,(/z), then A/z ~ fi* - fig is 
small and P = V 2 + [g(Afi — h)](Afi — h). This approximation and 
the zero-profit constraint reduce (30) to 

ya{h) = y ■ - h^. (32) 

The expression for ^^(A) is the same, except its sign is reversed, so the 
gain tort decreases in h, and the gain to b increases in h. Therefore, h* 
= Afi/2 is the competitive handicap, since it implies ya{h*) — yt,(h*) = 
0. If the actual handicap is less than k*, then y„ is positive and a's 
prefer to play in mixed contests rather than with their own type, while 
b’s prefer to play with b's only. The opposite is true if /i > h*. 

A two-player game is said to be fair when the players are handi¬ 
capped to equalize the medians. The competitive handicap does not 
lesult in a fair game, since h* = Afi/2 < Afi. The a’s are given a 
competitive edge in equilibrium, becau.se they contribute more to total 
output in mixed matches than the b’s do. This same result holds if €„ 
has a dif ferent variance than €*, but it may be sensitive to the assump¬ 
tion of statistical independence and output additivity. 

Alternatively, h can be constrained to be zero. In this case, different 
wage schedules would clear the market. Since y„(0) = -y6(0) = j8, 
paying fk, — and W 2 — P to a’s, while paying + (3 to b's, 

leaves the spread and, therefore, the investments unaltered. It is easy 
to verify that a’s and b’s are still indifferent between mixed and pure 
contests, because expected returns are equal between segregated and 
integrated contests for each type of player. With no handicaps, the 
market-clearing prizes available to a’s in the mixed contest are lower 
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than those faced by b's. Still, expected wages are higher fora’s than b's 
in the mixed contest, because their probability of winning is larger. 
The b's are given a superior schedule in the mixed contest as an 
equalizing difference for having to compete against superior oppo¬ 
nents. This yields the surprising conclusion that reverse discrimina¬ 
tion, where the less able are given a head start or rewarded more 
lucratively if they happen to accomplish the unlikely and win the 
contest, can be consistent with efficient incentive mechanisms and 
might be observed in a competitive labor market. 


V. Summary and Conclusions 

This paper analyzes an alternative to compensation based on the level 
of individual output. Under certain conditions, a scheme which re¬ 
wards rank yields an allocation of resources identical to that gener¬ 
ated by the efficient piece rate. Compensating workers on the basis of 
their relative position in the firm can produce the same incentive 
structure for risk-neutral workers as does the optimal piece rate. It 
might be less costly, however, to observe relative position than to 
measure the level of each worker’s output directly. This results in 
paying salaries which resemble prizes: wages which differ from 
realized marginal products. 

When risk aversion is introduced, the prize salary scheme no longer 
duplicates the allocation of resources induced by the optimal piece 
rate. Depending on the utility function and on the amount of luck 
involved, one scheme is preferred to the other. An advantage of a 
contest is that it eliminates income variation which is caused by factors 
common to workers of a given firm. 

Finally, we allow workers to be heterogeneous. This complication 
adds an important result: Competitive contests do not automatically 
sort workers in ways that yield an efficient allocation of resources 
when information is asymmetric. In particular, low-quality workers 
attempt to contaminate firms composed of high-quality workers, even 
if there are no complementarities in production. Contamination re¬ 
sults in a general breakdown of the efficient solution if low-quality 
workers are not prevented from entering. However, when player 
types are known to all, there exists a competitive handicapping 
scheme which allows all types to work efficiently within the same firm. 
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An Exploration in the Theory 
of Exchange-Rate Regimes 


Elhanan Helpman 

Tel-Aviv University 


I'hrce exchange-rale regimes—a floai, a one-sided peg, and a 
cooperative peg—are evaluated and compared in terms of efficiency 
and welfare levels. The framework of analysis embodies country- 
specific monies, with the money of each country being used to 
transact in its commodity tnarkets and its currency-denominated 
bonds. Wellare levels depend only on consumption levels. In the 
presence of perfect foresight all equilibrium allocations are Pareto 
efficient. In a floating exchange-rale regime the perfect foresight 
e(]uiltbriiitn allocation coincides with an equilibrium of a costless 
barter economy. The same result holds in a one-sided peg if the 
pegging country's exchange-rate stabilizing authority breaks even 
over titne. In a cooperative peg regime theie is a difl'etent equilib- 
iiutn allocation for each combination of exchange-rate levels and 
monetary policies. Problems of policy coordination and conflicts in 
desiretf moneiai y policies are di.scussed. 


I. Introduction and Summary 

The literature on international economics lacks systematic compari¬ 
sons of welfare levels that are attained in different exchange-rate 
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systems. Nevertheless, this type of information is essential for a well- 
founded opinion about the relative desirability of alternative 
exchange-rate regimes, like fixed as opposed to floating. The need for 
research in this area can hardly be overemphasized, and it is more 
surprising that only very recently have there been attempts to provide 
such comparisons. 

It is clear that in any analysis of exchange-rate systems the 
specification of the role of money will play a major role because, after 
all, an exchange rate is a relative price of two monies. It is, however, 
unsatisfactory to model money as an asset which serves only as a store 
of value, as is frequently done, because in this case money is just 
another asset without intrinsic features. An explicit modeling of these 
roles is required for a proper evaluation of the relative desirability of 
alternative exchange-rate regimes. 

The pre.sent investigation is based on a specification of the role of 
money. Money can be used as a store of value, but it also has to be 
used in every market transaction. Purchases of commodities require 
payments in money, lending requires money to be transferred from 
the lender to the borrower, and debt repayments require money to be 
transferred from the debtor to the creditor. This is, of course, an 
application of the constraint transactions approach to the demand for 
money that was suggested by Glower (1967) (for a recent application 
see Lucas [1980rt]). Most important, since I deal with more than one 
country—each country having its own money—it is assumed that for 
purchases of a particular country’s goods one has to pay with that 
country’s money. (This specification is also adopted by Stockman 
[1980], Helpman and Razin [1980], and Lucas [1980(>].) Moreover, 
particular currency-denominated loans are given by providing the 
borrower with this currency. Debt repayments are done with the 
currency in which the debt is denominated. This specification of 
the transactions role of money introduces natural liquidity constraints 
into the economic system, and it clearly distinguishes money from 
other financial assets. 

My framework of analysis, which incorporates money in the 
above-described way into an intertemporal model in which utility 
levels depend only on consumption levels, is presented in Section II. 
In the remainder of the paper the model is used to investigate three 
types of exchange-rate regimes: a floating exchange-rate regime and 
two types of fixed exchange-rate regimes. I discuss the implications of 
these regimes about global efficiency and about the distribution of 
welfare, as well as the determination of financial variables like prices, 
exchange rates, and interest rates. The emphasis is on financial as¬ 
pects of exchange-rate regimes. 

The fixed exchange-rate regimes that are considered are a one¬ 
sided peg and a cooperative peg. The one-sided peg is a regime in 
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which one country pegs its exchange rate vis-a-vis another currency, 
and the foreign exchange authority of the pegging country engages in 
foreign borrowing (lending) in order to finance its foreign exchange 
operations. The foreign exchange authority is constrained to repay its 
debts. In the cooperative regime there is a foreign exchange authority 
which is supported by all countries. This means that whenever it 
needs to sell a currency of a particular type, the country whose 
currency is required will provide the currency to the foreign ex¬ 
change authority. There are important differences in the implications 
of these systems. At this stage I wish only to point out that, in the 
former, balance-of-payments adjustments do not change the quantity 
of money in circulation of the passive country but may change the 
quantity of money in circulation of the pegging country. This means 
that the value of money in circulation in the world economy may 
change as a result of balance-of-payments adjustments. In the latter 
system the aggregate value of money supply does not change as a 
result of balance-of-payments adjustments, which make it fall in line 
with recent modelings of the process of international adjustment 
under a fixed exchange rate (see, e.g., Mundell 1968, chap. 18). 

Following the description of the framework of analysis, I discuss in 
Section III some general properties of equilibria. Most notably, it is 
shown that in the presence of perfect foresight the equilibrium allo¬ 
cation of consumption is Pareto efficient in every exchange-rate re¬ 
gime. The assumption of perfect foresight is extremely important for 
this result. It requires that governments specify their policies in ad¬ 
vance and do not change them. For example, surprising monetary 
changes may destroy the efficiency of the resulting equilibrium. Al¬ 
though perfect foresight implies efficiency, the distribudon of welfare 
depends on the exchange-rate system. 

Section IV is devoted to a discussion of a floadng exchange-rate 
regime. It is shown that in the presence of perfect foresight, and 
independently of monetary policies, the equilibrium allocation of 
consumption coincides with an equilibrium of a costless barter econ¬ 
omy. Monetary policies which are specified in advance do not affect 
equilibrium consumption levels. It is clear from this result that in an 
equilibrium the liquidity constraints do not impose special constraints 
on the allocation of real resources. They do, however, help to de¬ 
termine nominal variables—prices, exchange rates, and interest 
rates—as is explained in that section. 

The one-sided peg regime is discussed in Section V. It is shown 
that, provided the foreign exchange authority breaks even, in this 
regime too the resulting perfect foresight equilibrium allocation of 
consumption coincides with an equilibrium of a costless barter econ¬ 
omy. Hence, a floating exchange-rate system and a one-sided peg 
have the same efficiency and distributional implications. There are. 
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however, also important differences. In particular, in the one-sided 
peg regime there is only one exchange-rate level which enables the 
foreign exchange authority to break even in the face of a given policy 
of monetary transfers (taxes). Hence, there is need for coordination 
of the exchange-rate level with the required accompanying 
monetary-fiscal policy mix. CiKirdination of policies is not required in 
a floating exchange-rate regime. 

Section VI discusses the cooperative fixed exchange-rate regime. In 
this regime there is no need for policy coordination. But the perfect 
foresight equilibrium allocation of consumption and the distribution 
of welfare levels depend on tbe exchange-rate level and the 
monetary-fiscal policy mix that each country adopts. Only for very 
particular policy specifications will the cooperative peg equilibrium 
allocation of consumption coincide with the equilibrium allocation of 
consumption in a bai ter economy. 

There are several interesting directions in which the pre.sent study 
can be extended in order to sbed more light on the relative desirabil¬ 
ity of alternative exchange-rate regimes. Some of them are discussed 
in Section VII. 

II. The Framework of Analysis 

The world that I consider consists of two countries, a home {H) 
country and a foreign {F) country. Every country produces a single 
output, and the two outputs are perfect substitutes in consumption. 
In every period a country faces a given unaltetable level of output, 
rime is discretely measured by time periods. 

Neither the assumption of a .single output nor the a.ssumplion of 
perfect substitutability in consumption is restrictive for the purpose of 
this paper; the main results do not depend on these assumptions. 
However, the assumption that output levels are unalterable does not 
enable me to discuss employment problems which are of major coti- 
cern in discussions of exchange-rate systems (see, e.g., Friedman 
1953; Flanders and Helpman 1978). 

Every country has its own money. Purchases of the home country’s 
goods are paid for with the home country’s money; purchases of the 
foreign country’s goods are paid for with the foreign country’s 
money. Firms sell goods and transfer the profits to their stockholders. 
Home country firms are owned by home country residents while 
foreign country firms are owned by foreign country residents. As¬ 
suming that there are no costs of production, the value of a country’s 
output in terms of its own money is transferred by firms to its resi¬ 
dents.' 


Since X will not deal with uncertainty elements and I will assume the existence of 
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The precise specification of money flows is important. Money is 
needed for transacting in the market, and it imposes liquidity con¬ 
straints on economic agents. One cannot buy goods unless one has the 
money to pay for them, and firms cannot distribute dividends unless 
they have the money to do so; all payments are done in the form of 
money. It is assumed, therefore, that firms distribute dividends at the 
end of each period. During a single period a firm accumulates money 
from sales of its goods, and it distributes this money to its owners at 
the end of the period. 

An individual arrives at the beginning of period t with money 
balances that he left in his possession in period I — 1 (these may be 
home country as well as foreign country money balances) and with 
money balances that he has received from firms via dividend pay¬ 
ments. At the beginning of period t he has to repay his short-term 
debts (debts may be positive or negative) that were acquired in period 
1—1, including interest payments; these may be debts denominated 
in the home or foreign country’s currency. 

The individual can freely exchange home currency for foreign 
currency and vice ver.sa. In a fixed exchange-rale regime this is done 
with the Exchange Rate Stabilizing Authority (ERSA for short), while 
in a floating exchange-rate regime this is done in the foreign ex¬ 
change market. The individual can al.so borrow (lend) home or 
foreign country money at the beginning of period t. The completion 
of these transactions leaves him with certain quantities of home and 
foreign money as well as home and foreign currency-denominated 
debts (some of these may, of course, be zero). Now, part or all of the 
money balances can be spent on goods during period l —home cur¬ 
rency on home goods and foreign currency on foreign goods—while 
the remaining part can be kept in order to increase next period’s 
purchasing power. Thus, part of the total money holdings can be 
“money to spend” and part of it can be “monev to hold” (see Hicks 
1967). 

In this paper 1 consider the case in which there arc no restrictions 
on borrowing and lending behavior, except that debts have to be 
repaid. Foreign country decision variables are denoted by an asterisk, 
while home country decision variables are denoted without an as¬ 
terisk. The world economy faces a horizon of length T —the .same for 
both countries.^ 


bonds, ihere is lU) need to allow trade in owneiship shares m firms, l>etause such ir.«le 
adds no new real trading opportunities. 

* I assume a finite horizon, it is, however, easy to see from the following analysis that 
the results do not change when the horizon is infinite, provided equilibria exist. In 
order to analyze the infinite horizon case one has to replace tonstraiiils (If) and (2f) 
with the requirement that the present value of defits is nonposiuve for 7 going to 
infinity (see, c.g., Helpman and Razin 1981). 
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The home country’s representative consumer solves the following 
problem: 

Choose 

fFt< ^Ht> Mf-t, B/ftt ^Ft}T-l ( 1 ) 

to maximize 

T ^Fl> T ^F2y • • • . Cf-r), 

subject to 


c«(. fpit Mhci ^Ft ^0, < = 1, 2, . . • , T, (la) 

Af//i + FiMfi ^ + eiBpi + X^, (lb) 

pHt^Ht ^ t = I, 2, ... , T, (Ic) 

pFt^Ft ^Ft> t = \, 2, . . . ,T, (Id) 


^ HI ft^FI ® (Mfft-l ~ pHt-l^Ht-i) + ^li^Ft~l ~ pFt-I^Ft-l) 

"*■ pH(-iynt-i ~ (1 + Ht-\ ~ (1 + b'(-i) (le) 

ftBfi-i + But + FiBfi + X/it, t — 2, S, ..., 7, 

(If) 

where 

u,f= home country utility function that is strictly increasing in all 
arguments, 

Chi — consumption of home output in period t, 

Cff = consumption of foreign output in period t, 

:y, = beginning-of-period holdings of home money in period /, 
i', = beginning-of-period holdings of foreign money in period t, 

= home currency borrowing (lending when negative) at the be¬ 
ginning of period I, 

r, = foreign currency borrowing (lending when negative) at the 
beginning of period t, 

H — the initial stock of home country money, 

= government transfers (taxes if negative) to home country resi¬ 
dents in terms of home money at the beginning of period I, 
e, = the exchange rate in period t, defined as the price of foreign 
currency in terms of the home country’s currency, 
put = the price of home output in terms of home currency in period 
I, 

pf, = the price of foreign output in terms of foreign currency in 
period t, 

ynt = home output in period t. 
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in, = the nominal one-period interest rate on period t home 
currency-denominated debt, and 
if, = the nominal one-period interest rate on period i foreign 
currency-denominated debt. 

At the beginning of the first period, home country residents have 
Mh units of home currency, and they receive a transfer of X,,, units of 
home currency from their government. They borrow units of 
home money and Bf, units of foreign money. Then, they reallocate 
total money holdings between home and foreign currency via the 
foreign exchange market in a floating exchange-rate regime or the 
ERSA in a fixed exchange-rate regime. T’his way they end up holding 
Mhi units of home money and Mf, units of foreign money. No further 
financial transactions are needed during the first period. Consumers 
spend part or all of their money holdings on goods during the period. 
Thus, total beginning-of-period home money holdings are composed 
ofpHiCH, units of money to spend and — Ph\Chi units of money to 
hold. This is true for foreign money holdings as well. 

At the beginning of the second period, home consumers have M^, 
~ units of home currency and Mf, - pf,Cf, units of foreign 

currency. They receive dividend payments which consist of pHiyu, 
units of home currency and government transfers which consist of 
X,ii units of home currency. They repay their first-period debts, 
including interest payments—(1 + in home currency and (1 -f 

ifi)Bfi in foreign currency—and acquire new debts— B^^ in home 
currency and Bf 2 in foreign currency. They can make instantaneous 
transactions in the foreign exchange market or with the ERSA in 
order to coordinate the momentary money flows. Finally, they reallo¬ 
cate total money holdings between home and foreign money— 
and Mf 2 . And the process continues. 

We may now turn to the decision problem of the foreign country’s 
representative consumer. His decision variables are denoted by an 
asterisk, the initial stock of foreign money is denoted by Mf, foreign 
output in period t is denoted byy,.,, and government transfers (taxes if 
negative) to foreign residents in terms of foreign money at the begin¬ 
ning of period t are denoted by X^,. All other variables are as defined 
above (except for the asterisk). 

Choose 


{r^ro cjS(, Mfj,, M%, B%(, 


(2) 


to maximize 


Uf{c%, + cpi. cHi + efi, . . . , c)}r efr). 
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subject to 
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t=l,2 . T, 

+ Mh ^Mf+ + Bjf, + Xft, 

e, Pi 

<= 1 . 2 , ..., 7 , 

pfttpt « M%. / = 1, 2, .... J, 




+ M% 


Pi 


+ (M?,-, - py,-^c%-,) 


(2a) 

(2b) 

(2c) 

(2d) 


+ Prt^,y,.,^, - (1 + t«-,)-5^ - (1 + (2e) 

+ + B% + Xf.„ / = 2, 3.7, 

(20 

I bis is the foreign counterpart of the home counti y’s decision prob¬ 
lem (1). 


III. A Fundamental Property of Equilibria 

The purpose of this section is to show that all perfect foresight 
equilibtia, independently of the exchange-rate regime, are Pareio 
optimal, rhis is an extremely strong result because it implies that in 
the present framework the exchange-rate regime is irrelevant from 
the efficiency viewpoint; that is, there is no exchange-rate regime that 
IS more efficient than other exchange-rate legimes. This does not 
mean, of course, that exchange-rate regimes arc irrelevant from the 
viewpoint of other etonomic considerations. For example, it may 
happen that different exchange-rate regimes have different distribu¬ 
tional implications in the sense that in some of them a particular 
country is better off than in others. I will address this i,s.sue in the 
following sections when discussing particular exchange-rate regimes. 
At this stage 1 should like only to point out that the ecjuivalence result 
depends strongly on the ability of governments to pursue nondis- 
torting absorption policies whenever they are needed in order to 
accompany an exchange-rate regime. I will have more to say about 
this in Section V. 

First, independently of the exchange-rate system, in an equilibrium 
the interest rate and purchasing power parity conditions have to be 
.satisfied. In order to have a .solution to (1) and (2) with finite values of 
Bhi, Bf i, B%„ Bff, t = 1, 2, .... 7 — 1 (last-period debt is always zero 
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due to [lil and [2n), there has to be interest rate parity in every 
period; 

(I +i„d= (1 <= 1.2. T-l. (3) 

For if (3) does not hold, an investor can arbitrage between foreign and 
domestic debt so as to make infinite profits. One can use problems (1) 
and (2) to show that when (1 + iffi) < (1 + if-,)(e,+ ,/«,) the solutions to 
these problems yield B„„ B%t +00 and Bp,, Bf, -> — 00 , and when 
(1 + im) ^ (1 the solutions to these problems yield B„,, 

BJf, -♦ -00 and Bp„ Bf, +ao. These asset demands are, of 
course, inconsistent with an equilibrium. 

Second, in order to have equilibrium in commodity markets there 
has to be purchasing power parity in every period: 

pHt ~ ^ipFit i — 1,2, . . . , T. (4) 

For it pin > ^ipFt< the solutions to (1) and (2)—taking into account 
(3)—yield zero demand for the home country's output in period t, 
which is inconsistent with the clearing of commodity markets. Simi¬ 
larly, if pm < <^tpnt demand for foreign output drops to zero. 

In order to prove (he main result of this section, it is useful to 
rewrite the consumer problems so as to replace the sequence of 
budget constraints by a single wealth constraint. I'his can be done in 
the present context despite the existence of liquidity constraints, be¬ 
cause (a) (here exist bonds, {/>) financial assets (including monies) are 
Ireely traded at the beginning of each period, and (c) the interest 
parity and purchasing power parity conditions are satisfied. 

First, observe that in a solution to problem (1) constraints (lb) and 
(le) will never be satisfied with strict inequality. For if there is a strict 
inequality in one of these constraints, say, in period t, one can increa,se 
domestic or foreign money holdings at no extra cost and use it as 
money to spend, thus increasing consumption in period t without 
altering all other choice variables. This will increase the attained 
utility level. We can rewrite, therefore, constraints (lb) and (le) in 
equality form without altering the nature of the problem. Second, 
denoting by m„, and m,-, the components of home and foreign monies 
to hold in period t, 

mm — Mnt ~ pm^m-' ^ ~ t,2 . T, {5a) 

Mp, = Mp, — pp,Cp,, t = 1,2,...,/, (l^b) 

we can replace (Ic) and (Id) with nonnegativity constraints on mm and 

mp(. 

Combining (lb) and (le)—written in equality form—with (5), and 
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applying the parity conditions (3) and (4), we get: 

pH\^i ^Hi ~ Mu + Bm + (6) 

(1 + + (1 + + e,mp,) - (I + 

+ e/mn_,) = (1 + im-tY^pm-iym-i + X„,) - ( 7 ) 

— + (1 + ^(Bm + BiBpt), t = 2, 3, . . . , r, 

where C( = c„r + aggregate consumption in period t. 

Now define the discount factors: 

I 

= n^' < = 2, 3, . . . , T, andd„, = 1. (g) 

T = 2 

Then, by adding (7) for < = 2 to (6), adding the result to (7) for < = 3 
multiplied by dn 2 , then adding the result to (7) for < = 4 multiplied by 
dfj 3 , and so on, we get (using the interest parity condition [3]) the 
single wealth constraint: 

T T-_t 

^ d'litpHtCt + ^ d.nt+x{iHt^Ht + <’r+itVfttif'() 

<“( 1=1 

+ d/fTiTri/fT + ej-tHfr ~ Bi/r — ej-Bfr) (9) 

r-i T 

(=I /-I 

On the left-hand side of (9) there are three terms. The first represents 
the present value of consumption. The second represents the present 
value ol interest costs associated with holding money as a store of 
value, that is, associated with money to hold. The last term represents 
the present value of last-period money to hold minus debt. On the 
right-hand side of (9) there are three terms too—initial money hold¬ 
ings, the present value of dividends, and the present value of gov¬ 
ernment transfers. The sum of the right-hand-side terms represents 
initial wealth. Dividends are distributed at the end of each period, 
which explains why the firms’ proceeds in period t are discounted by 
and not by d^. 

Now problem (1) can be reformulated as follows. Choose c„ mm, m^t 
^ 0,t = 1, 2,. . . , T, and Bf,T,BfT < 0, so as to maximize Uh(ci, c^, ■ ■ •, 
c-j) subject to (9). Having found a solution to this problem, one can 
find a solution to problem (I) by choosing an arbitrary division of 
consumption in each period between home and foreign goods, then 
using (5) to solve beginning-of-period money holdings, and finally 
using (6) and (7) to solve home and foreign borrowing. 
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Applying to problem (2) the above-outlined procedure, one can 
derive the foreign counterpart of (9). Due to the interest and pur¬ 
chasing power parity conditions, the foreign wealth constraint can be 
written using foreign currency prices and discount factors or home 
currency prices and discount factors. The foreign country’s decision 
problem can then also be reformulated using a single wealth con¬ 
straint. 

The new formulations of problems (1) and (2) reveal an important 
characteristic of equilibrium allocations. If the interest rate on cur- 
rencyj-denominated debt is positive in period t (j = H orF), neither 
the home nor the foreign country will use currency j as a store of 
value in period t (i.e., mj^ and m% will be chosen to equal zero). If, 
however, the interest rate on currency ^-denominated debt is zero in 
period t, both countries will be indifferent toward using currency^’ as 
a store of value in period L This means that, in an equilibrium, 
implicit interest costs associated with money to hold are zero in every 
period.® This can be seen by observing that in the reformulated 
problems the “price” of home money to hold is«/H,+ ,f„, and the “price” 
of foreign money to hold is while the marginal utility of 

money to hold is zero. The new formulation reveals also that in an 
equilibrium consumers make sure that the present values of last- 
period monies to hold and debts are all zero. This means that in an 
equilibrium the effective constraints on the choice of consumption 
plans are; 

T r-i r 

^ . dfftpHtCt — Mh -t- ^ dfft+xpHtyHl + ^ d.ntXnit (10) 

(-1 r=i (-1 


T T-l T 

^ d'f'tpFfi* ~ dpt+ipjr,yf, ^ dfiXf,. (11) 

(-1 t-i (-1 

Thus, equilibrium consumption plans are solutions to: chooser, 2= 0, 
t - 1, 2, . . . , T, to maximize m„(c,, , Cj) subject to (10); and 

choose c*t^ 0,t = 1,2,... , T, to maximize c|, . . ., cf) subject to 
(11). Due to the interest and purchasing power parity conditions 
{dFtpFi)/idfipFi) = {d„,pHt)l{dHipHi), fort = 1, 2, .... r, implying that 
consumers face the same relative prices of consumption in both 


’ This implication is a direct consequence of the assumpuoii that one can switch 
costlessly from money lo bonds and vice versa. With f>ositive transaction costs associated 
with purchases and sales of bonds, money is used as a store of value even when interest 
rates are positive (sec Baumol 1952 and I'obin 1956). 1 his suggests that additional 
insight might be gained by extending the present model to allow for transaction costs in 
hnancial markets. 
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problems. Hence, every equilibrium allcKation of consumption ir 
which consumption demands are solutions to these problems is Paretc 
efficient. This proves the main result of this section. 

It is important to observe at this stage that the implications of being 
in an equilibrium which have been discussed so far are independent 
of the exchange-rate system; they apply to systems of fixed and 
lloating as well as managed exchange rates. Every exchange-rate 
system has associated with it specific financial market-clearing condi¬ 
tions (the clearing conditions for commodity markets, -I- c^, = 
and Cp, + cf, = yp„ are the same for every exchange-rate regime), 
which may have specific distributional implications. These implica¬ 
tions are considered in the following sections. 

IV. Floating Exchange-Rate Regime 

By a floating exchange-rate regime 1 refer to a regime in which no 
govcrtiment agency intervenes in the foreign exchange market for 
the sake of intervention. Thus, if one was to reformulate the present 
model to include government purchases of goods and the govern¬ 
ment was instructed to acquire foreign exchange in order to purchase 
foieign goods, this would introduce intervention in the foreign ex¬ 
change market but not for the sake of intervention. However, if a 
government agency purchases foreign exchange, keeps it for several 
periods, and then resells it, we refer to this action as intervention for 
the sake of interventiott. In the presence of this type of policy the 
exchange-rale regime is a managed float. 

Now consider the economies described in the previous section in a 
Hoatitig exchange-rate regime. The governments are assumed to en¬ 
gage in no economic activity except for the monetary transfers 
and Xp, (which may be zero), 'fhis means that each currency has only 
one source of money creation—government transfers—that all debt 
is private debt, and that all commodity demands are private demands. 
Under these circumstances a floating exchange-rate regime equilib¬ 
rium can be defined as follows; 

Definhion; Prices {pm, exchange rates interest 

rates , and the allocation {r,,,, Cfi, 'ffn, ift, M„i, 

Bnt, By,, Bf,„ constitute a floating exchange-rate regime equi¬ 
librium relative to [u„, Uy, M„, My, {X„,, Xy,, ym, ‘f: 

a) for p„, = p„(, py, = pyi, e, = e,, t = 1,2,... ,T, and i„i = Im, tyi — Ifi, 

t = 1,2. T - 1, 

i) {ch(> c> 7 . Mm, ^Fi, Bm, solves (1), 

•i) Y%,rh,m„Mf„B^„Bf,Y,., solves (2), 

b) Cm + c^i ^yni, t = 1, 2. T, 
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c) Cft + = yFi, t = 1,2, ... ,T, 

t 

d) M„, + Mh = M„ + Y, t= 1,2._ T, 

T«1 

t 

e) MFt + mt = Mf + Y, XFr, t= 1 . 2 .- T, 

T=-l 

f) B„t+Bh = 0,t = \,2,...,T, 

5,, + = 0. t = 1,2. T. 

In an equilibrium, individual optimization based on self-fulfilling 
expectation (condition a) is consistent with clearance of all markets in 
every period; market for home output (condition h), market for 
foreign output (condition c), market for home money(condition d), 
market for foreign money (condition e), market for home currency- 
denominated debt (condition/), and market for foreign currency- 
denominated debt (condition g). 

The right-hand side of condition d represents the supply of home 
money in period t\ it consists of the initial stock of money plus 
monetary injections up to (and including) period 1. Similarly, the 
right-hand side of condition e represents the supply of foreign money 
to the private sector in period t. 

The main point that I wish to make in this section is that— 
itidependent of the transfer policies (X^, and Xp,) —in a floating 
exchange-rate regime the equilibrium allocation of consumption con¬ 
forms to a barter equilibrium allocation of the economies under 
consideration, and vice versa. That is, if we allow the economies to 
exchange goods over time, making all trading decisions in the begin¬ 
ning of the first period (as in an Arrow-Debreu model), the resulting 
equilibrium allocation of consumption will be an equilibrium alloca¬ 
tion of consumption in a floating exchange-rate regime, and vice 
versa. This is independent of the transfer policies (or, as some would 
call it, of monetary policies). 

Despite the fact that the equilibrium allocation of consumption is 
determined only by real factors (the factors which determine an 
exchange equilibrium), every real equilibrium has associated with it 
equilibrium values of monetary variables like commodity prices, ex¬ 
change rates, and interest rates (cf. Kareken and Wallace, in press). 
The monetary variables are determined within the system. 

In order to prove the main result of this section recall the formula¬ 
tion of the optimal consumption plan problems that was developed at 
the end of Section III. 

Choose 


Q, t = \,2,. . . ,T, 


(12) 
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to maximize 


^ 2 . - • • » Cr) 


subject to 


T T-1 r 

y dut pHtf^t — ^ dHt+ipHtym + dHtXtff 


Choose 


c,*^0, t= 1.2. ....T-, 


(13) 


to maximize 

up(ct,c}, . . . ,cf), 

subject to 

T r -1 r 

^ df'tPh'I^T ~ ^ dpf^-ipFtyFt + ^ dfiXff 

t=i r*i <-i 

Equilibrium consumption schedules are solutions to these problems 
independent of the exchange-rate regime. 

It is shown in the Appendix that, in a floating exchange-rate regime 
equilibrium, the right-hand side of the wealth constraint in ( 12 ) is 
equal to '^\,vdmpHtyHt while the right-hand side of the wealth con¬ 
straint in (13) is equal to This means that in each country 

the present value of output equals the value of initial money holdings 
plus the present value of dividends plus the present value of govern¬ 
ment transf ers. Hence, a country's assets reflect precisely the value of 
its real output. Now, since {df;pf-t)l{dyipf;) = (diftpHt)KdHipHi) (due to 
interest rate and purchasing power parity), an equilibrium allocation 
of consumption in a floating exchange-rate regime—which is a solu¬ 
tion of ( 12 ) and (13)—coincides with an equilibrium allocation of 
consumption in a costless barter economy.^ Hence, despite the fact 
that the monies play a real role in facilitating transactions, thus intro¬ 
ducing potential divergences from barter equilibria, in a floating 
exchange-rate regfime with perfect foresight no such divergences 
exist. 

The fact that the equilibrium allocation of consumption in a float¬ 
ing exchange-rate regime coincides with the equilibrium allocation ol 
consumption in a costless barter regime enables a rather simple cal¬ 
culation of equilibrium prices, exchange rates, and nominal interest 
rates. First, compute the equilibrium relative intertemporal prices in 
the barter regime. Let them be ^1 = 1, ^ 3 , . . . , qr- Then we know 


This result is also derived in Lucas (1980b). 
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that in the floating exchange-rate regime prices and interest rates 
satisfy: 


dmpHt 


d’Ht-lpHt-X 


_ dpippt _ 

d'Ft-tpFI-i 9/-1 


t = 2,3,....T. (14) 


Now, in the last period all money is spent on goods. Therefore, 
last-period prices are 



Having last-period prices, we can determine prices and interest rates 
in period T — 1. From (14), using= (1 + k = H, F, we 

have 


( 1 -K iHT-x)pHT-, = . (1 + i,r-,)pFT-X = . (15) 

qr qr 

First considerand it/r-t- Suppose > 0. Then all H money is 
spent on H goods in period T — 1 and 



Substituting this into (15), we calculate the interest rate t^r-x which 
satisfies (15). If the calculated interest rate is nonnegative, we have the 
equilibrium price level and interest rate. If the calculated interest rate 
is negative, then the equilibrium interest rate is zero. Substituting inr-x 
= 0 into (15), we can calculate the equilibrium price A similar 

procedure is applied to calculate iVr-i and Pft- i • It is now clear that we 
can use the same procedure to calculate pHT-i< ^ht- 2 < pf'T-i> and itfT- 2 - 
Thus, starting from last-period prices, all prices and interest rates can 
be recursively calculated. 

Observe that prices are determined according to the quantity 
theory of money. When a country’s money is not used as a store of 
value, all of it “chases” that country’s goods, and the country’s price 
level is determined according to the quantity equation with unitary 
velocity of circulation. When part of the money is used as a store of 
value, which may happen in this model only if the nominal interest 
rate is zero in this country, only part of the stock of money in 
circulation chases goods, and this part determines the price level. In 
fact, if the algorithm described above produces a zero interest rate 
in a certain period, the calculated price level of this period can be used 
in conjunction with output to calculate the quantity of money re- 
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quired for transactions. The residual quantity of money will be used 
as money to hold.'’ 


V. Fixed Exchange^Rate Regime: One-sided Peg 

In this section I consider a fixed exchange-rate regime in which the 
home country pegs its exchange rate to the foreign country. For this 
purpose the home country establishes an exchange-rate stabilizing 
authority (ERSA) whose task is to buy and sell foreign currency at the 
fixed rate of exchange e, which is assumed to remain the same in every 
period. The ERSA replaces the foreign exchange market. Since the 
home country cannot issue foreign money, its one-sided decision to 
peg the exchange rate makes it necessary for the ERSA to borrow 
abroad whenever it is asked to sell foreign currency, and 1 assume that 
whenever it is asked to buy foreign currency it lends it out. This 
assumption avoids unnecessary interest costs to the ERSA (these 
interest costs are sometimes considered as part of the costs of 
exchange-rate stabilization). Also, it is assumed that, whenever the 
ERSA buys foreign currency, it pays for it with new issues of home 
currency, thus increasing the quantity of home money in circulation. 
Whenever the ERSA sells foreign currency, it leaves in its posses.sion 
the home currency that it receives against these .sales, thereby reduc¬ 
ing the quantity of home money in circulation. This mechanism of 
exchange-rate stabilization approximates practices that were used in 
many countries (central banks usually perform the tasks of the 
ERSA). 

In what follows 1 wish to make two main points. One concerns the 
distributional implications of the exchange-rate system that is being 
considered (we know from Sec. II that the equilibrium allocation of 
consumption is efficient). It will be shown that when the ERSA is 
constrained to end up with no foreign debt, the resulting equilibrium 
allocation of consumption is the same as in a costless barter economy 
and, therefore, also the same as in a floating exchange-rate regime. 
Second, in order to meet the ERSA’s intertemporal budget constraint 
(the constraint on last period’s foreign debt), the transfer policy can¬ 
not be arbitrary. This means that in the present system the transfer 
policy cannot be divorced from the stabilization of the exchange rate, 
which may be considered as a disadvantage of this fixed exchange- 
rate regime.® Apart from these points we will have a closer look at the 


* An explicit calculation ot an equilibrium for two toy economie.s is pre.sented in 
Helpman (1979). Helpmaii and Razin (1981) present a similar calculation for the 
infinite horizon ta.se. 

* A similar argument, but in aditterenl setup, appearx in Helpman and Razin (1979). 
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behavior of prices and money stocks and the relationship bettveen the 
exchange rate and the required transfer or absorption policy. 

I start by presenting the ERSA’s budget constraint. In order to save 
space let us use the convention that variables with a subscript t — 1 are 
all zero for (= I. The ERSA's foreign debt in period ( is denoted by 
and its addition to// money in circulation in period i is denoted by 
Xm may, of course, be positive or negative). The exchange rate is 
constant; that is, e, = e. 

There are two economic units which actively transact with the 
ERSA: the private sector of the home country and the private sector 
of the foreign country. The private sector of the home country has in 
period t an excess demand for foreign currency which equals Mf, + (1 
+ - wtf,-, - Bf,; it equals total desired holdings of foreign 

currency, plus foreign debt repayments, minus the quantity of 
foreign currency transferred from the previous period (the previous 
period’s foreign money to hold), minus foreign borrowing. This ex¬ 
cess demand is satished by purchases of foreign currency from the 
ELRSA, for which the private sector pays with home country money. 
The foreign private sector has an excess demand for home currency 
which equals M/5,-f (1 + — B/5,. This excess demand 

is also .satisfied by transacting with the ERSA. For the purchase of 
home currency the foreign private sector pays [M/5, -l- (1 
- wj5,_, - Bfj,]/e units of foreign currency. Hence, the deficit in the 
balance of payments which equals the net excess demand for foreign 
currency with which the ERSA is faced in period t is: 

~ F (1 -I- if(-f)Bft-i ~ nipi-i “ Bfi] 

mt„ + (I + - B%,] 

e 

Against this excess demand for foreign currency there is an equal 
value excess supply of home currency which the ERSA has to absorb. 
Hence, X^t = and 


Xhi — e[Myt + (1 + l/■,_|)Bf•,_, Bf,] (17) 

+ [Mi5, + (1 + t„,-,)Bi5,-. - - 5 j5,|. 

Apart from the net excess demand for foreign currency by the private 
sector in period t, the ERSA needs foreign currency in order to repay 
the debt that it acquired in period < — 1. Hence, the ERSA's foreign 
borrowing in period t equals the private sector’s net excess demand 
for foreign currency plus its repayments of foreign debt: 


Bf; — [M^., -(-(!-(- if;^i)B Pf-i B f;] 


[M/5, + (1 + t//,_|)fl/5(-i wjjt-i B’^il ^ (1-1- i/r,_,)B^,_, 


(18) 


e 
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The last equation is based on the assumption that the ERSA does not 
carry forward any foreign currency; when it has foreign currency it 
either sells it or lends it out. This assumption assures efficient man¬ 
agement of the exchange-rate stabilizing operations.' From (16) and 
(18), we have; 

Dpi = Bfi — — ipi_iBfi^i. (19) 

The deficit in the balance of payments equals the increase in the 
ERSA’s foreign debt minus its interest payments. This is the definition 
of the deficit in the balance of payments which associates the deficit 
with the ERSA’s net sales of foreign currency to the private sectors, 
which also reflects the contraction of home currency in circulation 
due to the balance of payments. However, in this framework it might 
be more appropriate to define the deficit as the increase in the ERSA’s 
foreign debt. The difference between the former and latter defini¬ 
tions amounts to the exclusion or inclusion, respectively, of interest 
payments on the ERSA’s foreign debt in the definition of the deficit in 
the balance of payments. Clearly, if the latter definition is used, then 
the contraction in home money in circulation as a result of exchange- 
rate stabilization is not equal to the deficit in the balance of payments. 

Now we are prepared to define a one-sided peg fixed exchange-rate 
regime equilibrium. A formal definition is provided in Helpman 
(1979). Here I discuss informally the difference between the one¬ 
sided peg equilibrium and the floating exchange-rate equilibrium. 

In the one-sided peg the ERSA stabilizes the exchange rate at a 
fixed level c in all periods. Hence = e for t — 1, 2,. .. , 7". As a result, 
it engages in foreign borrowing (B®,) and monetary injections (Ag,). 
Since the ERSA has to satisfy its temporal budget constraints and 
break even over time, all choice variables—private as well as the 
ERSA’s—have to satisfy (17) and (18). In addition, the exchange-rate 
stabilizing authority has to end up with no foreign debts: 

= 0 . ( 20 ) 

Of course, commodity markets have to clear, so that conditions b and c 
from the definition of a floating exchange-rate equilibrium apply also 
to the present regime. Finally, money markets and bond markets have 
to clear. Since our ERSA does not operate in the domestic bond 
market, it does not carry forward foreign money, and it does not 
make foreign currency transfers, conditions e and/ from the defini¬ 
tion of a floating exchange-rate equilibrium apply also to the present 
regime. A difference arises in the equilibrium conditions for the 
home currency market and the foreign bond market, because the 


’’ See, however, Helpman and Razin (1980) for a case in which, due to uncertainty, 
the ERSA has to carry forward some foreign currency. 
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ERSA borrows on the foreign bond market and it changes the supply 
of domestic money by means of the exchange-rate stabilization oper¬ 
ations. The market-clearing conditions for the domestic money mar¬ 
ket and the foreign bond market read in the present regime: 

-(- MJ5, = ^ (X„r + X%r), 1,2 . T, (21) 


Bf, + Bf, + = 0, 1=1,2, (22) 


First let me prove that the one-sided peg fixed exchange-rate re¬ 
gime equilibrium allocation of consumption is the same as that in a 
barter economy. For this purpose recall the formulation of the opti¬ 
mal consumption plans that was developied at the end of Section III, 
and let us use formulations (12) and (13) for present purposes. It is 
sufficient to show that in the home country’s problem (12), the right- 
hand side of the wealth constraint equals the present value of output 
in equilibrium. 

In a fixed exchange-rate regime the interest rate parity condition 
(3) reads *//< = */■« = ij, < = 1,2,..., T, which implies equality of 

discount factors; that is, d„t = dy, = d„ t = 1,2. T. Hence, the 

equality between the right-hand side of the wealth constraint in (12) 
and the present value of output translates in the present context into: 
r-i T r 

Mh + ^ dt+ipHiynt + ^ d^Ht — ^ d,pHiyHt< 

(■=■1 t=i t“i 

which is equivalent to 

r-t T 

2 (.df+x — dt)pH(fm ~ dfpfijyiiT = ~Mff — ^ diXffi. (23) 

<-i i-i 


Now if i, > 0 (i.e., d,+ , < d,), money is not used as a store of value and 
all home money is spent on home goods, implying: 


(.dt+x dx)pfi(ynt ~ (dt+x 


I 

dt) ^ (A//T + ^t) 

T=I 


(24) 


If i, = 0 (i.e., d,+ , = df), then (24) holds trivially. Hence, (24) holds for 
< = 1, 2, . , . , T - 1. In the last period all money is spent on goods; 
therefore= M„ + X^x(^ht + ^t)- Combining the last result 
with (24) and applying the method employed in the Appendix, one 
gets:* 

r-i 

^ (d»+i ~ d,)pH,yHt dfpHTyHT ~ 

1=1 

T 

~Mff — ^ difiiXif, + X^i). 


For details see Helpman (1979). 
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By comparing the last result with (23) it is clear that in order to prove 
(23) it remains to show that = 0. From (17) and (18), 

using the interest rate parity condition = if, = i,, we have; 

^Ht = e[(l + i,^,)Bf,_, — Bf,]. 

Hence, using the convention BJfo = 0 and the fact that d,(l + (<_,) = 
we calculate 

T 

^ d-tXfii — e d,[(\ + i,_,)Bf,-., — Bf,^ 

(=i t=i 

T T \ 

^ ^ d,Bf, I = —edj-Bff = 0, 

(=.2 (=i J 

which completes the proof (the last equality results from [ 20 ]). 

1 have shown that the equilibrium allocation of consumption in the 
present exchange-rate regime is the same as in a costless barter econ¬ 
omy and, therefore, also the same as in an equilibrium of a floating 
exchange-rate regime. I his means that the welfare level achieved by 
each country is the same in each (equilibrium) of these three alterna¬ 
tive modes of organization of the world economy. What 1 wish to 
argue now is that the country that chooses to stabilize ihe exchange 
rate in the one-sided peg fixed exchange-rate regime cannot pursue 
an independent monetary (transfer) policy. In particular, for given 
transfer policies in the two countries, there is exactly one level at which 
it can stabilize the exchange rate and end up with no foreign debt. Put 
differently, if the country chooses to stabilize the exchange rate at a 
particular level, the recjuirement that it ends up with no foreign debt 
imposes on it a particular transfer policy; the present value of its 
transfers is uniquely determined, although their distribution over 
time can be freely chosen. Changes in the pattern of transfers change 
the pattern of deficits in the balance of payments but not the equilib¬ 
rium pattern of consumption, provided the present value of transfers 
is maintained at the equilibrium level. 

In order to prove my arguments, observe that the purchasing 
power parity condition p,,, = epf, and (23) imply: 

T 

M„ ^ diX,]i = e 

<-i 

Since in the last period foreign money is spent on foreign goods. 


/ —i 

drpfTytiT ^t^\)pFtyHty ( 25 ) 
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If > 0, < G {1, 2, . . . , T — 1}, then in period / all foreign money is 
spent on foreign goods. If = 0, then Therefore, 


(dt d't+t)prt ~ (d,t d't+i) 






t= 


. ,T - 1. 

(26a) 


In addition, 

d’lpFl _ 

d,-,pfi-t ’ 


t = 2,3,....r,<7, = I. 


(26b) 


where is the equilibrium marginal rate of substituhon between 
consumption in period t and consumption in period 1. From (26) 
foreign prices and the discount factors are uniquely determined. 
They are, in fact, identical to the equilibrium foreign prices and 
foreign discount factors in a floating exchange-rate regime because 
the equilibrium marginal rates of substitution q, are the same in both 
regimes. Hence, the equilibrium value of the expression in the brack¬ 
ets on the right-hand side of (25)—which is po.sitive—is independent 
of the home country’s transfer policy and exchange-rate level. There¬ 
fore (25) implies a one-to-one relationship between the exchange-rate 
level and the present value of home money transfers. 


VI. Fixed Exchange-Rate Regime; Cooperative Peg 

The last exchange-rate regime that I wish to consider is one in which 
both countries cooperate in the stabilization of the exchange rate. For 
this purpose the countries establish an ERSA which buys and sells 
currencies at a fixed rate of exchange, e units of home currency per 
unit of foreign currency. The cooperation con.sists of each country’s 
supplying the ERSA with its own currency whenever the ERSA is 
asked to sell it and is short on it. The ERSA does not engage in 
borrowing and lending. 'When foreign currency is sold to the ERSA, 
this transaction reduces the quantity of foreign currency in circula¬ 
tion. It also increa.ses the quantity of home currency in circulation, 
with the increase equal in value to the reduced amount of foreign 
currency in circulation, since sales of foreign currency mean pur¬ 
chases of home currency. The opposite effect occurs when home 
currency is sold to the ERSA. This means that the operations of the 
ERSA do not change the aggregate value of monies in circulation; 
they only change their composition. However, transfer policies affect 
the aggregate quantity of money in circulation. The present regime is 
in line with recent modelings of fixed exchange-rale regimes in that 
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the Stabilization of the exchange rate per se does not affect the 
aggregate supply of money in the world economy * 

In the cooperative hxed exchange-rate regime the changes in 
money supply caused by the ERSA are (using the interest parity 
condition and the convention that variables with subscripts t — 1 are 
zero for t = I): 

XpT — VXift (1 + — nipi^i ~ (27) 

_ -t- (1 -F 

f 

e 


T’hus, 


Xhi ~ + (1 + B^,] 

— e[Mfi + (1 + ~ — £p(]. 

= 0. 1=1.2. T. 


(28) 

(29) 


The definition of an equilibrium should by now be obvious and I 
abstain from stating it. 

Using the equilibrium and parity conditions, one derives—by 
means of the method employed in the previous section—the follow¬ 
ing budget constraints on consumption vectors: 

T T T 

^ ~ 2 ^ (30) 

(=i r-i f»i 

I T T 

^ dtpHi^* ~ ^ d,pntypf — f ^ dtXfi. (30a) 

(=1 (=1 1=1 

Due to (29), (30) implies that the equilibrium allocation of consump¬ 
tion in the cooperative fixed exchange-rate regime coincides with the 
equilibrium allocation of consumption in a barter economy if and only 
if the present value of monetary injections of currency H (or contrac¬ 
tions of currency F) caused by the operations of the ERSA is zero; that 
is, X/=id(XJi, = 0. I have shown in the previous section that in the 
one-sided fixed exchange-rate regime XLjdfXg, is necessarily zero in 
an equilibrium. Here this is not the case; in an equilibrium of the 
present regime the present value of home currency injections 


' If in a cooperative peg only one country sterilizes changes in its money supply that 
result from balance-of-payments adjustments, then the behavior of money supplies in 
this regime will resemble the behavior of money supplies in a one-sided peg regime. It 
is interesting to note that the prediction of the one-sided peg regime about the behavior 
of the world price level is consistent with the empirical findings in Genberg and 
Swoboda (1977) (i.e., that the money supply of the passive country, the United States, 
explains the world’s dollar price level). Since they took the standard view of a fixed 
exchange rate (i.e., a cooperative peg), they explained their findings with an argument 
about sterilization on the part of the United States, an argument that they did not test. 
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through the balance of payments need not be zero.'® If it is positive, 
the home country’s present value of consumption is less than the 
present value of its output, and the foreign country’s present value of 
consumption exceeds its present value of output. If it is negative, the 
opposite holds. In the former case it is as if the home country is 
making a transfer to the foreign country, while in the latter case it is as 
if the foreign country is making a transfer to the home country. Since 
in either case the allocation is efficient, then—to the extent that the 
transferer loses in welfare terms (which is not always the case)—in the 
former case the home country is worse off and the foreign country 
better off than in a floating exchange-rate regime (or a one-sided 
peg), while in the latter case the opposite holds. Since the outcome 
depends on the level of the exchange rate and on each country's 
transfer policy, countries have conflicting interests in the determina¬ 
tion of the exchange-rate level and monetary policies. 

VII. Concluding Remarks 

I have presented in this study an evaluation of exchange-rate regimes 
for a world economy in which there exists a single friction; that is, 
there exists a payments period and monies are required for market 
transactions. It has been shown that this friction has certain distribu¬ 
tional implications in each exchange-rate regime, but it does not 
introduce inefficiencies into the economic system. All results are 
easily extended to a many-country framework. In a way, these results 
provide a benchmark for the evaluation of more realistic models and 
their implications about exchange-rate regimes. 

There are various rigidities and imperfections (e.g., wage contracts, 
market imperfection) whose existence may have a bearing on the 
relative desirability of alternative exchange-rate regimes. Thus, it is 
possible that a particular exchange-rate regime performs better than 
others under certain types of imperfections. A characterization of 
imperfections under which each exchange-rate regime performs best 
will be valuable. Some work on the effects of uncertainty and imper¬ 
fect capital markets is reported in Helpman and Razin (1980) and 
Lucas (19801>). Lucas provides conditions under which a floating 
exchange-rate regime is equivalent (from the allocational viewpoint) 
to a fixed exchange-rate regime, despite the existence of uncertainty 
and incomplete capital markets, while Helpman and Razin provide 
conditions under which a floating exchange-rate regime performs 
better than a one-sided peg. 

There is also another aspect of exchange-rate regimes which has 
An example showing this point is presented in Helpman (1979). 
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received no attention in a welfare context but which is of major 
importance, and this is the strategic behavior of governments. I have 
shown that the equilibrium allocation of consumption is efficient in 
every exchange-rate regime and that, in a floating exchange-rate 
regime as well as in a one-sided peg regime, it corresponds to the 
equilibrium allocation of consumption in a barter economy. A crucial 
assumption on which these results are based is that expectations are 
self-fulfilling; that is, there are no surprises. In particular, the transfer 
policies are announced at the very beginning, and each government 
honors its declared policy. What happens if in the course of time one 
of the governments deviates from its announced policy? This type of 
surpri.se destroys both results. This can be seen as follows. Suppose 
that at .some point in time the foreign country is a net creditor in 
terms of foreign currency and it has zero net debt in terms of home 
currency. Then, by reducing the quantity of foreign money it in¬ 
creases via exchange-rate changes the real value of payments that it 
will receive for its outstanding credit." Thus, governments have an 
incentive to generate monetary surprises. But if they do it, one cannot 
analyze the consumer problem the way it was done in this paper, 
because the consumer will learn about the possible surprises and will 
accordingly adjust his decisions. Other governments will also adjust 
their policy. Hence, a game-theoretic framework, with an appropriate 
reformulation of individual problems, is required in order to analyze 
these issues properly. 

The incentive for strategic behavior is even more apparent in the 
cooperative-peg regime. Recall that in this regime the exchange-rate 
level and the preannounced monetary policies determine real trans¬ 
fers among countries. 'I'here are, therefore, conflicting interests in the 
choice of an exchange-rate level and monetary (transfer) policies. In 
particular, once an exchange-rate level has been chosen, then in the 
course of time countries may try to secure for them.selves higher 
welfare levels by departing from the preannounced monetary 
policies. Such unexpected departures generate allocational inefficien¬ 
cies. Again, in order to analyze this problem properly, one needs an 
appropriate game-theoretic framework. 

The need for an extension of the present analysis in order to take 
account of strategic behavior is clear from the context of this paper. 
However, such an analysis may also help to explain changes in 
exchange-rate regimes that took place in the world economy, such as 
the collapse of the Bretton Woods system. 

" A numerical example showing this eltect is presented in Helpman (1979). .Similar 
effects exist in the one-sided peg regime. 
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Appendix 

Here 1 prove that in a floating exchange-rate regime equilibrium the right- 
hand side of the wealth constraint in (12) equals A similar 

method can be used to show that the right-hand side of the wealth constraint 
in (13) equals f'he proof proceeds as follows: 


r-1 T T T 

^'dHt+ipHiym dm^H! ~ duipHtynt dii^\ 

(=i f=i ^ t=\ ^ 

T-i 

+ ~ dH,)pii,yif, — difTpHTyHT- 


(AI) 


However, I have shown in Section HI that for («, > 0 (01 dy„+, < rf„,) no one 
uses domestic money as a store of value, and that in the last period all money 
IS spent. These results, together with the equilibrium conditions b and d in the 
definition of a Moating exchange-rale regime equilihi ium, imply 

(.diii+i — dni)pHiyHi — (dfii+i ~ dfiP^Mii + t — 1,2, . . , , T — 1, 

T 

Pinym = + ^^ht- 

T=»l 

This is so, since when domestic money is not used as a store of value in period 
t, all of it is spent on goods, implying= M,i, and pHtb%t = ‘^m- Using 
condition b this implies that pmyHt = which yields pHtym = Mh + 

when combined with d. Thus, the above relations hold for t„, > 0, but 
they hold also trivially for t,i, = 0 since ini = 0 if and only if d„,+ , = d„,. Using 
these lesulls, we obtain: 


7-1 r-i ( 

^^(d^i+i — di„)piiiyni — di,ip„fy„^ = ^(du(+i — dnt)\M^ + 

,=1 (-1 ' T-l ' 

T T 

' T=l ' /“I 

The last equality is obtained from the fad that 

T-i T-l T-i T-l 


(A2) 


and 


7-1 t T-l f 7 

X/fT “ — djfr^Xfi 


f=1 T=1 


f=l r»| 

T-l I 




f= 1 T-I 
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^ V V _ V 

— / ,“»(/ / Am / .-^Wt ~ -X/ii — (IhtXht -/ 4hiXh 

<*2 T»t <"3 T«l J —I 

Combining (Al) with (A2), we obtain 

T-l T T 

^^ dHtpHtym- 


This completes the proof for the home country. A similar proof applies to the 
foreign country. 
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Energy-production subsidies are, paradoxically, shown to be likely to 
increase U.S. dependence on imported oil. Sundard studies of net 
energy yields are shown to be seriously biased upward for two rea¬ 
sons, First, many omit indirect energy inputs, which, our input- 
output calculation shows, probably causes large errors. Second, those 
studies all omit the energy opportunity costs of nonenergy inputs 
(e.g., the fuel substituted elsewhere for the labor used to produce 
energy). We prove that, absent externalities, any fuel-output subsidy 
which causes an otherwise unprofitable expansion must yield an 
incremental fuel output smaller than the increment in energy input 
plus the energy opportunity costs of other inputs. 


The policy measures under consideration to decrease our de¬ 
pendence on imported oil now seem to include two leading candi¬ 
dates: (i) taxes or tariffs to discourage consumption and increase the 
profitability of alternative energy sources and (ii) a program of sub¬ 
sidies to innovative energy sources. Politicians have generally em¬ 
braced the latter as the one means of salvation from an alternative 


We are sincerely grateful to the Sloan Foundauon, wh&se grant greatly faciliuied 
completion of this study. Sue Anne Batey Blackman provided invaluable research on 
net-energy study techniques in practice and has given us very helpful comments. We 
are also grateful to James Sweeney of Stanford University for valuable comments on an 
earlier draft which helped us to avoid some misleading statements of our results. 
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that appears to them to be politically suicidal, while economists have 
almost uniformly favored the former, arguing that the program of 
special subsidies is an invitation to inefficiency. 

In this paper we will come to a similar but considerably stronger 
conclusion. We will argue that the difference between the two ap¬ 
proaches is potentially more significant and serious than even most 
economists had imagined. Our main conclusion can best be stated in 
the form of an example. Both programs we are considering can be 
used to stimulate the production of oil from sources such as shale. 
Any expansion in this activity stimulated by a tariff on imported oil 
can be expected to have a positive net energy yield and, in particular, 
a positive net yield of oil. However, an expansion in the same activity 
induced by a program of subsidies is likely to have a negative net yield 
of petroleum (!) if other outputs in the economy remain unchanged. 
Of course, in practice the subsidy is likely to cause some reallocation 
of resources that reduces at least some other (energy-using) economic 
activities. The final result may well be an overall increase in the 
nation’s stock of energy. However, we will show, under very plausible 
assumptions, that the increase in oil stocks could always have been 
made greater still if exactly the same reduction in other energy-using 
activities had occurred by it.self, with no expansion in oil shale pro¬ 
cessing. The implications for current policy are, clearly, more than a 
little disquieting. 

This result actually has a straightforward explanation, as we will see. 
Basically, the extreme difference between the efficiency of the general 
tariff and that of the targeted .subsidy lies in the fact that the latter 
does not lead to the appropriate substitutions of energy and other 
inputs throughout the economy that should be part of a program to 
increase oil stocks. Consequently, the subsidy induces the employ¬ 
ment in oil shale processing of inputs which could have been substi¬ 
tuted elsewhere in the economy for larger quantities of oil than they 
can yield through oil shale convension. 

Several comments are required before we turn to the body of our 
analysis. First, the basic premise on which the entire discussion pro¬ 
ceeds is that vulnerability of the nation to blackmail by foreign gov¬ 
ernments is a true social cost and can legitimately be treated as a 
detrimental externality of any increase in oil consumption. This and 
self-defense against economic exploitation by the cartels constitute the 
justification of a tariff on oil imports. In the absence of these two 
problems it could be argued that the rising cost of petroleum is an 
ordinary resource-depletion problem which can, perhaps, be handled 
reasonably well (distributive considerations aside) by the market, 
uninfluenced by any particular government intervention. 
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Second, it should be obvious that government programs other than 
the two we are considering are also possible. For example, subsidies to 
new energy sources can be accompanied by a set of taxes on use of oil 
which together will constitute a reasonably efficient program. This is 
rather analogous to the substitution of a combined program of taxes 
and subsidies for a system of effluent charges.' However, because of 
the difficulties of designing and administering an ideal mixed pro¬ 
gram, together with the fact that it is (apparently) not being consid¬ 
ered by decision makers anywhere, we have chosen not to deal 
explicitly with such alternative measures. 

Finally, it must be emphasized that while we believe that a tariff or 
some close substitute is the best way to encourage investment in solar 
energy, biomass, and other innovative energy sources, and-that direct 
subsidies are likely actually to yield net energy losses in the sense 
indicated, we will see that there can be exceptions. Where substantial 
outlays on research and development must precede a substantial 
increase in such an energy-producing activity, and if they introduce 
significant free-rider problems, then for obvious reasons the market 
will not allocate a sufficiently large quantitv of resources for the 
purpose. In such a case a subsidy may be entirely justified. But our 
analysis suggests that such exceptions should be permitted only on the 
basis of extremely strong evidence, in light of the danger that such a 
subsidy will actually exacerbate the nation's energy problem. 

In this paper we will also show that the standard net-energy studies 
are characterized by a bias toward overestimation of the net yields of 
the processes they examine, only part of which has previously been 
recognized. This bias is not a matter of overenthusiasm of inves¬ 
tigators, which leads them inadvertently or deliberately to distort their 
results, but is the consequence of .several technical shortcomings in 
their study procedures. One of their difficulties is that, while net- 
energy studies commonly take into account the direct fuel inputs used 
in the processes which they examine, many of them consider only part 
of the remainder of the multiplier-like series—the materials and 
machines needed to make those materials and machines, and so 
on—all of which, obviously, have their own energy requirements. We 
will provide empirical evidence which indicates that the omissions are 
sometimes far from negligible. Moreover, even when the total energy 
requirements of a new process are “correctly” calculated using input- 
output techniques, only the average energy requirements per unit of 


‘ We are indebted to James Sweeney lor drawing our attention to this sort of mixed 
policy. Note, however, that a mixed program is not likely to be a perfect replacement 
for a tariff on oil imports. Thus, as is well known, in enviionmental programs partial 
substitution of subsidies lor effluent charges is likely to lead to inferior results. 
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energy output will thereby be determined. The nargttuil energy 
quirements are J/keiy to be higher than the average requirements, and 
therefore the marginal net energy yield may be considerably lower 
than the calculated average energy yield. 

But this is not their only omission. The nonenergy resources that go 
into an energy-production process have an energy opportunity cost, 
as we have already noted, for they would instead have been used, at 
least in part, as substitutes for energy in other sectors of the economy. 
This energy opportunity cost is generally left out of net-energy 
studies, though it is in fact a very real concomitant of the production 
process. As a result of all of the biases that have just been mentioned, 
a process which appears to pass the net-energy test by a narrow 
margin is nevertheless likely to fail it in fact. 

While the indirect energy costs and the energy opportunity costs of 
a production process may prove very difficult to measure directly, we 
will show here that there is (at least at the margin) a duality relation 
between profitability and net energy yield which permits the former 
to stand as a proxy for the latter. We will prove that any energy- 
production process which is unprofitable can from this fact alone be 
considered a potential net energy loser in the sense indicated earlier. 
Consequently, we can no longer presume that energy-losing projects 
are apt to be rare and pathological exceptions. Rather, we are forced 
to regard any innovative energy project which can be carried out only 
with the aid of government subsidies as a proper subject of suspicion. 


I. The Energy Opportunity Cost of Nonenergy Inputs 

I he first issue we raise is the failure of many net-energy studies to 
carry their calculation of the energy used in the production of energy 
a sufficient number of “rounds” into the past, to determine the energy 
content of the inputs used to make the inputs, and so on, used in the 
energy-production process. Many of the net-energy studies employ a 
procedure called “process analysis” which involves explicit engineer¬ 
ing descriptions of all the steps involved in a manufacturing process, 
and it is easy to see why it is impractical for this method to encompass 
many earlier rounds. Although this problem has previously been 
recognized in the literature, the magnitude of the resultant bias seems 
not to have been investigated. We therefore carried out some calcula¬ 
tions to evaluate the magnitude of the error,* using 1972 input-output 

* .See, c.g., Berry 1978-79. It is noteworthy that some of these studies are careful to 
warn the readers of this source of bias (see, e.g.. Perry et al. 1977; Berry 1978-79; Pels 
and Williams 1978), while others (e.g., Baron 1976, 1978; Hunt 1976) do not mention 
the subject. 
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data provided by the Bureau of Economic Analysis® and 1967 ener^ 
coefficients provided by the Center for Advanced Computation 
(1975). Letting the matrix A represent the Leontief interindustry 
coefficient matrix and D the vector of direct primary energy con¬ 
sumed per unit of sectoral output/ the total energy consumed per 
unit of sectoral output E is given, for an average sector, by £ = D(/ - 
A)-'. 

In some process-analysis studies, only the direct energy cost per 
unit of output of a new energy-producing process is considered, only 
taking account of the vector/). In others, the energy cost includes the 
“first-round” energy content of the new process and is evaluated, in 
effect yielding the sum (D + DA ). Reportedly, only the most ambitious 
process analyses include the “second-round” content, evaluating the 
energy cost (D + DA + DA^). Thus even these studies omit the portion 
(DA^ -I- DA* of the cost of a new energy process, and so they 

overestimate its net energy yield correspondingly. 

We carried out calculations to estimate the magnitude of the errors 
that can result.® We first looked at the discrepancy between the direct 

® For a description of the data, see Riu (1979) and Ritz, Roberts, and Young (1979), 

* The direct energy requirements for each sector are valued in British thermal units 
and were divided into four sources in the original data: (1) coal, (2) petroleum prtxl- 
iicts, (3) natural gas, and (4) electricity. This breakdown created an immediate problem 
for the calculation of total energy requirements, for electricity is not a primary energy 
source but a manufactured form of energy. It must be produced from coal, pietroleum, 
natural gas, uranium, running water, geothermal energy, solar energy, or some other 
natural source. Thus there is double counting in the tiriginal energy matrix which 
includes both the oil, coal, and natural gas used to produce electricity and the electricity 
Itself. Therefore, we first assigned the primary-energy sources used to produce elec¬ 
tricity to the electricity-using sectors, since we still want to classify the electricity as a 
direct energfy input. Thus we pretended, in effect, that each industry produces its own 
electric power (as some, indeed, do). In the 85-sector scheme, there are two sectors that 
sell electricity—utilities (sector 68) and state and local enterprises (sector 79), which 
includes public utilities. Our first step was, then, to distribute the coal, petroleum, and 
natural gas used in the two utility sectors to the nonutility sectors in proportion to the 
dollar purchases of the nonutility sectors from the uutity sectors. The next step in the 
assignment of electricity generation among its users was to distribute the intermediate 
inputs of the two utility sectors to the nonutility sectors in proptrrtion to the dollar 
purchases by the latter from the former. The last step was to enter zeros into the cells 
recording the purchases from the utility sectors by the nonutility sectors. 

’ We carried out two more modifications in our esumation pnxjctdure. The first was 
to take into account the energy content of the fixed capital stot;k. (The A matrix consisLs 
only of intermediate flows.) Since a piece of capital is used to produce more than one 
unit of output, the appropriate addition is the total energy content of the capital stock 
of each sector per unit (or, in our case, per dollar) of output. I'o estimate this, we added 
a row of depreciation coefficients (dcpreaalioii per dollar of output) to the A matrix. 
Moreover, we evaluated the energy content of the depreciated portion of the capital 
stock per dollar of output not at its historical value but at its replacement value. To do 
this, we assumed that new capital stock by component was added in each sector in 
propordon to the sectoral components of the gross private capital formadon vector in 
final demand. A new coefficient column was therefore added to the A matrix corre- 
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energy requirements and the total energy requirements in the man¬ 
ufacturing sectors in the input-output table. The (weighted) average 
ratio DjEi was 0.36 (where the subscript i refers to sector *), meaning 
that such a calculation, on the average, omits some 64 percent of the 
total energy cost. The individual ratios ranged from 0.09 in the 
apparel industry to 0.62 in chemicals. Next, we looked at the ratio of 
the direct plus first-round energy requirements (D + DA) to the total 
energy requirements, which averaged 0.66 and ranged from 0.30 in 
apparel to 0.83 in chemicals. Finally, the ratio of direct plus first- 
round plus second-round energy requirements {D -I- DA + DA^) to 
total energy requirements averaged 0.78 and began from as low a 
figure as 0.53. Even using the direct plus first-round plus second- 
round estimates of energy requirements, we find an average error of 
the order of 28 percent, which can severely bias standard estimates of 
net energy yield. This means that if a process-analysis calculation 
indicates that a given process will offer a positive net energy yield as 
high as, say, 20 percent, it is in fact likely to be a net energy loser. 

But the omission of some indirect energy uses by net-energy studies 
is not the only significant source of overoptimism in their estimates of 
the net yield of new energy .source.s, for the calculation still does not 
take into account the potential energy-saving powers of the labor, the 
tnctals, and the other primary resources (aside from energy) used 
directly and itidirectly in the production of energy. We will provide a 
formal discussion of this point, whose implication will be that any 
unprofitable energy source is a proper candidate for suspicion that it 
will be a net energy loser, and in later sections still stronger proposi¬ 
tions will be of fered. But first a simple example may help to indicate 
the nature of the i.ssue. 

Suppose that an economy has two outputs, manufactured energy 
and bread, and two primary resources, energy and labor; that bread 
can be produced by either of two processes, one labor intensive and 
one energy intensive; and that both processes are in fact currently in 
use. Then, other things being equal, if employment is constant, every 
labor hour used in the manufacture of energy increases the amount 
of bread that must be produced by the energy-intensive rather than 
the labor-intensive process. 'I'he corresponding increase in energy use 


s[x>nding to the dcprecialitm row which was equal to the percentage distribution of 
gross private c apilal lormauon over its components. I'he second adjustment was to add 
some measure of the energy cttiitenl of the (intermediate) imports used in production. 
Since we are not really concerned with the foreign energy content of the imports but of 
the domesuc energy opportunity cost, the relevant measure is the energy content of the 
exports traded in exchange lor these imports. To calculate this, we added a vector of 
export coefficients corresponding to the import coefficient row. This procedure im¬ 
plicitly values the energy conteiii of the imports at the energy content of a correspond¬ 
ing dollar’s worth of exports. 
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is the energy-opportunity cost of the hour of labor.* Obviously, this 
should be included in a full calculation of net energy. If the man¬ 
ufacture of 1,000 Btu of energy uses up 900 Btu directly and indi¬ 
rectly and requires labor time which would instead have been em¬ 
ployed as a substitute for another 200 Btu, the process in question 
clearly depletes society’s stock of energy on balance. A calculation 
which omits this opportunity cost is simply incorrect and misleading. 


II. Profit as an Indicator of Energy Opportunity Cost 

Direct estimates of the magnitudes of such opportunity costs may not 
be easy to obtain, but we will show that the profits of the energy 
manufacturing firm can tell us a great deal about the magnitude of 
the energy opportunity cost of its operations. Let the economy’s 
production function for manufactured energy satisfy 

Piyi, ■ ■ • . . . , r„) = 0, (1) 

wherey, is the quantity of energy output, r,, is the quantity of primary 
inputy used in energy productioti, and rj is the quantity of primary 
resource^ used as input, directly and/or indirectly in all other outputs. 
Let ri, and r, represent the quantities of energy input, for the moment 
taking all energy to be homogeneous.^ 


* The reality and significance of ihe indirect energy cost of energy produi uon and of 
the energy opportunity cost of its other inputs is undeiscored by recent newspap>er 
a<counts. A fall in demand for nuclear electricity has led to a decrease in production of 
enriched uranium sufficient, according to the editor i>{' ElecCnral World magarine, to 
reduce electricity use in the enrichineni process by an amount equal to 1 pci cent of U.S. 
electricity consumption {New York Times 1980). And a pagc-1 story (Shabecoff 1980) 
indicates that rising energy prices are leading to a long-term shift toward the use of 
more labor in place of energy (e.g., use of additional workers to insulate homes and 
factories and even the hiring of more cowboys to drive cattle overland rather than 
transport them by truck), and it reports an estimate by Dale Jorgenson and E. A 
Hudson which suggests that a doubling of energy prices can iriciease employment bv 5 
percent. Althcjugh some may think that these estimates exaggerate the magntiudes of 
the indirect energy costs and the energy opporiuiiity costs, they clearly show that these 
are real and tangible phenomena which tan be expected to Ijecome increasingly obvious 
if energy prices continue to rise. 

^ Here, the y/s refer to net output—that is. final output—instead of total output 
I'hus the vector of final demand is 



An intermediate output will, essentially, be valued in terms of iLs primary-resource 
content. More specifically, using a standard inpul-ouipui striKture, define 
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Then the production of energy will be profitable if® 

/>iyi > (2) 

where p, is the price of both the energy output and the energy input 


qt, ■ . -7im 


^■1 • • • ^nm 

to be the n by m coefficient matrix., showing the amount of each primary input I 
through n r cqiiirecJ per unit of output for each of the m sectors. We will let the first row 
of contain the direct energy inputs in each of the sectors per unit of output. 
Therefore, 

d, ...d„ s 
Vai ^2m I ■ 

‘/m - qnmf 

rhe sum of the clircd and indiro< t primary inputs retjuired per unit of final demand is 
then given by matrix f 





where the ftrst tow of t indicates the diret t plus indirect requiretnents of the primary 
energy source. I heii, for a given vector of final demand, y, the total primary input 
ic-c|iurcmc‘ms ate given by vector w: 


U' = 


We can use the expressions 
fii =>1^1 
r,2 = J.tji 


+ >.«!+ • +y„e» 

>»s?l + >jjj. + • • . + y^im 

>15.1 +yi-'.2 + • ■+y^nm 

r, = yzcj + .. . + y^m 
r, = y^2i yurt*!. 

and 



Cl. — y [C.i r, — y^^HZ + . . . yii.tn„ 

to define the symbols used in ecj. (1). Note, morecivcr, that the same calculation can be 
reperformed using several alternative tec hniques for eac h of the sectors—i.e., alterna¬ 
tive column values lor a and q. This can yield a continuous production function F in the 
limit. 

' Here it Is assumed that energy is produced in a process that is completely separate 
from other outputs so that r,, can be measured unambiguously because there are no 
common input costs. 
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used by it.* Here and throughout the remainder of the paper, we will 
takepi to be a constant; that is, we will assume that the United States is 
a price taker on the world energy market. This permits us to avoid the 
complications of analysis of strategic price behavior that the United 
States might want to consider. However, a bit will be said on this 
subject later. 

Assume that input prices either are given exogenously or are for 
some other reason beyond the control of the firms that use them. 
Moreover, whether or not input-using firms are competitive, assume 
that all, or at least a substantial number, of them are cost minimizers, 
that is, that they seek to produce whatever output vector they select at 
minimum total cost. Then, as is well known, they will purchase those 
input quantities that satisfy 

pilp, = F,j/F„ = dr,./dr„ = dr./dr^, (3) 

where F^ = dFIdr^j, and so forth. Consequently, the profitability 
criterion, (2), can be rewritten as 

<1^ " 

>1 ^ ''ll + X = >'11 + X 

'r (4) 

= »'1I + X 

III. Tendency of Total Profit to Underestimate Total Net Energy 
Yield 

The equivalence of (2) and (4) immediately suggests the relation 
between unprofitability and net energy loss, for 

r,jar,,/ar,j = r,jar,/arj (5) 

appears to be an indicator of the energy opportunity cost of input j 
used in the energy manufacturing process; It is the quantity of input j 
used multiplied by the marginal quantity of energy input for which a 
unit ofj can be substituted, ceteris paribus; that is, it is multiplied 
by the marginal rate of substitution ofj for energy. Unfortunately, (5) 

• One should note that here a crucial feature of energy production enters, namely, the 
fact that both the output and one of the inputs are reaJly the same commodity. This is 
what permits us to assign the same price, p,. to both the energy input, r,,. and the 
energy output, y,. This is rather reminiscent of Quesnay’s Tableau economujue, where the 
output of agriculture was precisely the same product as its input—corn. The fact that 
energy is both an input and the output in this process permits us to divide (Si), through 
by the product price.pi, to obtain a ratio of input prices,^ which can be interpreted 
with the aid of efficiency condition (3). For noneconomist readers we may note that the 
standard relation (3) is obtained directly by minimizadon of total cost, Xp/,,. subject to 
the production funcdon constraint F(-) = 0. 
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is not the opportunity cost we are seeking, and we will show now that 
the correct figure is likely to tie lower than (5) in many plausible 
circumstances. 

The figure we really want corre.sponds to the saving in energy 
which would be permitted not by a small reduction in the use of; in 
activity 1 but by total elimination of the use ofj for this purpose. To 
permit us to describe the issue with the aid of a two-dimensional 
diagram first, a.ssume there are only two inputs, energy, r,, and 
another input, r.^. The upper part of hgure 1 represents the man¬ 
ufacture of all goods other than energy, andyy' is the isoproduct locus 
corresponding to the initial equilibrium, with point E representing 
the initial input combination. 

Immediately below point E we have drawn the r„ axis of a graph 
representing the manufacture of energy, with £, its initial input 
vector, .so that is the initial quantity of input 2 used in the process. 
If all firms are cost minimizers, dr,/dr.i, the slope ofyy' at£, will equal 
<fr,,Wr, 2 , the slope of y,y! at £|. 

Transfer of all of r ,2 from production ot energy to production of 
other goods, holding the output of the latter constant, will move the 
economy from points toE' on curveyv' in the upper figure, thereby 
reducing the use of input 1 by S/t' = Ar, = ri^ArJAr^. This is the true 
energy opportunity cost of the use of input 2 in production of energy. 

In general terms, to determine the energy opportunity cost of input 
;, we must take the quantity of this input used in energy production to 
fall from r,j all the way to zero. Simultaneously, we must take that 
amount of inputj, Ar j = r,^, to be transferred into activities elsewhere 
in the economy and determine the quantity of energy that Ar j can 
replace in those other industries, holding all those outputs constant. 
The opportunity-cost figure we are seeking, then, is rijAr,/Arj, the 
decrement in use of energy permitted by the incremental u.se of Arj = 
r ,2 elsewhere in the economy. Thus our net energy criterion becomes, 
instead of (4), 

n 

yi ^ + X (6) 

This relationship constitutes our definition of the net energy yield of 
an activity: It measures whether the energy output of the activity is 
greater than the direct and indirect use of energy inputs plus the sum 
of the energy opportunity costs of all other inputs used in energy 
production if all other activities in the economy remain constant. 

Let us now .see the relation of net energy criterion (6) to the 
profitability criterion (4) and, in particular, to the terms r,j5r,/drj 
which it employs. In figure 1 draw the tangent fT toyy' at£. Then, 
clearly, 

rj2dr,/drj = ST > SE' = Ti^ArJAr^. (7) 
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Fir.. 1.—Calculation of an input’s potential energy opportunity cost 

This inequality will hold whenever the production surface is quasi¬ 
concave so that the isoproduct curve will be convex to the origin, as is 
yy'. For then, with the slope oiyy' diminishing in absolute value as one 
moves to the right, Arj/Arj must always be smaller, absolutely, than 
drildr^. Intuitively, this is so because as more and more is used to 
replace energy in general production, one can be expected to be left 
with energy uses which become ever increasingly difficult to replace, 
for efficiency in the substitution process requires the easy energy- 
replacement opportunities to be used up first. 

Thus, where (as usually assumed) the isoproduct curve has the 
shape ofyy' the relationship equivalent to (7) can be expected to hold 
for all inputs,/ It then follows that the RHS (right-hand side) of (6) 
can generally be expected to be smaller than that of (4). Since (4) is the 
test of profitability, while (6) is the test of net energy yield, it follows 
that the latter may frequently be easier to pass than the former. That 
is, a new energy manufacturing process may be unprofitable and yet, 
nevertheless, yield a positive net energy output. However, the dif¬ 
ference can be expected to be small. The reason is that what is at i.ssue 
is the difference between marginal and average opportunity cost, not 
in the energy-producing industry but in the remainder of the econ¬ 
omy taken as a whole. Since the quantities of input involved are likely 
to be small relative to the economy’s total use of these inputs, the 
range of the isoproduct hypersurface traversed can be expected to be 
negligible. Thus the change in slope and, consequently, the difference 
between (4) and (6) should normally be small. 

It must be reemphasized that the calculation is all predicated on the assumption 
that all outputs other than domestic energy production remain constant. It also ignores 
any price effects that can be expected to follow from a large change in an energy- 
producing activity. 
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*^CONOVv 


fVe may then arrive at: 

Proposition I: While unprohtabiJity does constitute grounds fotsusp,. 
cion that a new energy process will be a net energy loser, it cannot be 
taken necessarily to reject that process on net-energy grounds. 

However, even this guarded conclusion suggests a judgment more 
san^fuine lhan the situation warrants, as will be shown in the next 
section. 


IV. Marginal Unprofitability Implies Negative Marginal Yield of 
Net Energy 

We will show now: 

Proposition 2: If firms in the economy minimize the cost of produc¬ 
ing their output vectors, a marginal increase in energy output from 
any process that has a negative profit yield must also have a negative 
potential marginal yield of net energy when other outputs are held 
constant. 

I'his conclusion has policy implications which are extremely dis¬ 
turbing: 

Corollary: If a firm that invests in an energy-producing process can 
capture all of its profit yield (i.e., if there are no externalities or similar 
market failures), any expansion in the manufacture of energy (be¬ 
yond the quantity that maximizes profits) that is induced by subsidy 
unaccompanied by a specially designed program of taxes on energy 
use elsewhere in the economy must be excessive in the sense that (at 
least) a small reduction in that energy output can increase the economy’s 
energy stocks if all other activities in the economy are unaffected," 

Put alternatively, since such an expansion in the energy-producing 
process may well lead indirectly to a reduction in other activities in the 
economy, some energy may well be saved as a consequence. But any 
saving in energy would have been greater if those other activities had 
been reduced in the same way without the expansion in the energy- 
production process that is subsidized. 

In other words, whenever a subsidy to an energy-producing activity 
leads to its expansion beyond the level to which it would be brought by 
market forces, it can be expected to have a negative (marginal) net 
energy yield in this sense. That is, under the circumstances postulated 
any subsidy may make matters worse, not just in cost-benefit terms, as 
we might normally expect such a subsidy to do, but in terms of 
physical energy supplies. 

" This assumed implicitly that the profit surface is concave (hill shaped) and that in 
the absence of subsidy the firm will operate at the point of maximum profit or beyond it 
(as it will, e.g., under sales maximization). Then the subsidy-free output must corre¬ 
spond to a downward-sloping portion of the relevant cross section of the profit hyper¬ 
surface, and the result applies. 
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The proof of this disturbing proposition is straightforward. Nega¬ 
tive marginal profitability (on our parametric price assumption) is 
described by** 

pidy^ < tPsdttj, ( 8 ) 

but by (3), (8) can be rewritten as 

R 

dyi < dr II + ^ dr,,dr,/dr(9) 

i“2 

which tells us that the marginal net energy yield of output y^ is 
negative. Q.E.D. 

One remarkable feature of this proof is that it does not rest on a 
restrictive premise such as the presence of perfect competition. The 
key assumption which underlies (3) is that firms minimize the total 
cost of whatever output they produce. But this holds for a wide 
variety of market forms—tor example, profit-maximizing oligopolists, 
monopolists or monopolistic comfjetitors, and firms which maximize 
total revenue (sales)** or growth rather than profits.** 

Nor is it necessary that this premise be satisfied throughout the 
economy, as long as it holds for the sectors where the inputs will go if 
they are not used in energy production, and, in particular, (3) does 
not have to be satisfied in the energy-producing sector itself. That is, 
we do not have to assume that the economy is initially operating 
efficiently, for the opportunity cost, as we have seen, is calculated not 
in the energy-producing sector but in the remainder of the economy 
where input j can be substituted for energy use. Thus, if somewhere 
else in the economy least-cost production occurs and satisfies= 
dr,/drj, a trade-off on the terms indicated is indeed possible.** And if 


This assumes thal energy price, is determined on ihc world market and i.s 
therefore uninfluenced by the output of the domesuc industry that manufactures 
energy. If this is not so,R„ = pi(dy, — dr„) should be replaced by /tj = d(piVi ~ ~ 

tVi ~ ’'ii)dpi + ptidyi - dr^i). With a negatively sloping demand curve, whenrfvi > 0 then 
Rt<R,. This means that when margin^ proht is zero, the marginal net energy yield will 
be positive so that some subsidy to the output of a profit-maximizing producer of energy 
will in this case contribute to the economy's energy stocks. However, since it seems 
unlikely that the magnitude of the output of any one producer of synthetic fuels, e.g., 
will be sufficient to have a significant effect on internauonal oil prices, this point does 
not seem particularly important for policy. 

However, our result does have special implicauons for firms with objectives such as 
sales maximization which drive them beyond the point of maximum profits even in the 
absence of subsidy. If energy production is in the hands ofsuch a firm, our result shows 
that even without subsidy, its energy output will be carried to a point where its marginal 
net energy yield is negative, so that a restriction in its output will actually increase the 
economy’s energy stocks. Thus a lax on such an activity, rather than just a zero subsidy, 
is called for, at least in theory. 

Thus this differs from the usual conclusion that subsidy will reduce welfare in that 
our current conclusion is expressed in physical rather than psychological (uulity) terms 
and, unlike the usual result, does not require universal perfect competition. 

Thus the neoclassical assumption of substitutability at the margin, as implied in 
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this relationship holds in those sectors where the resources will actu¬ 
ally be used, such a trade-off will in fact occur. 

V. On the Reality of the Energy Opportunity Costs 

We must be careful not to exaggerate the strength of our result. An 
energy opportunity cost clearly does represent a potential energy loss, 
and in that sen.se an expansion of an energy-production process doei, 
repre.sent an energy loss relative to what would have been achieved if 
its resources had instead been used in an optimal program to increase 
energy stocks. But in reality, if the energy-producing activity were 
reduced from y,,, its magnitude under a subsidy, to its profit- 
maximizing level, the inputs released as a result might very well not be 
u.sed in a way that maximizes their contribution to the energy pro¬ 
gram. First, they might not be used efficiently and, second, they 
would probably be used in part for other purposes. They would be 
used in an efficient manner if (3) were to hold everywhere at either 
level of >(, that is, if all firms in the economy were to produce 
whatever output vectors they choose to supply at the minimum cost 
for that vector. But if that is not so, the actual energy loss resulting 
from the increa.sed use of other inputs needed to raise y, toy,, may 
well be less than our calculations indicate. In addition, whether or not 
they were reallocated efficiently, in practice, only some portion of 
those reallocated resources would be used as a substitute for oil. I'hai 
is why the corollary to proposition 2 has been put rather guardedly. 

Nevertheless, those losses will almost certainly be real (and very 
possibly substantial, as is suggested by the material reported in n. 6 
above). That is, if the energy-producing activity is in fact reduced and 
the corresponding input quantities are thiown on the market, the 
forces of supply and demand can be expected to cause a readjustment 
in relative input prices which induce the employment of at least some 
of those inputs elsewhere in the economy. 7’hose other inputs will then 
in fact serve (in part) as energy substitutes, fhus the marginal energy 
opportunity cost of the labor, materials, and other primary inputs 
u.sed in energy production will represent some actual loss in energy 
stocks and not a remote possibility of substitution which is perhaps 
unlikely to occur.'® Interpreted in this sense, our propositions no 


pjp I = drjdr^, need not hold thioughout the economy. For the weaker interpretation 
of our result (see next section) to be valid, all that is required is that somewhere in the 
economy the resources dr,, absorbed by the energy .sector when its output is increased 
byrfvi, he substitutable for energy. It .seems hard to imagine that this weak premise will 
ever fail to be satisfied. 

'* .See ti. 2 alxive. We may inquire briefly about the implications of violations of some 
of the premises underlying propiosiuon 2 In particular, we will considei (a) the effects 
of the presence of specialized inputs in the energy-using industry which are not usable 
elsewhere. (A) monopsony power in the input-using industry, and (c) cases where 
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longer guarantee that a subsidy will always cause an incremental net 
energy loss, but they nevertheless leave it under a dark shadow of 
suspicion. 

We can put the matter another way by recognizing explicitly that 
there are two different opportunity-cost concepts, which we may refer 
to as the potential and the actual opportunity losses. In our case, the 
former is the maximal amount of petroleum that can be saved by 
substitution of some quantity of another resource, Arj, while the latter 
refers to the amount of oil that will actually be saved by the use to 
which Azj will in fact be put when it is released from a subsidized 
activity. Our formal propositions evaluate the opportunity cost in 
terms of potential opportunity loss, but there is good reason for 
policymakers to be concerned with the actual opportunity loss, whose 
magnitude, while undoubtedly substantial, is certainly lower than the 
potential-loss figure. 

We can now use a diagram to suggest another way of looking at the 
analysis, which may help to remove any air of sleight of hand that may 
characterize our results.'^ In figure 2, two production-possibility 
fi ontiers are represented for the economy, showing the variables, net 
energy outputs and other outputs, on the axes. Here yy® represents 
the production-possibility frontier relevant without the subsidy I'or 
the energy supply technique, while Y'Y” represents the production- 
possibility frontier relevant with the energy supply subsidy. This latter 
frontier is drawn below and to the left of the former since the subsidy 
reduces economic efficiency. Here the reduction in economic effi¬ 
ciency can be approximated by the following integral; 

f 's(E)dE, 

J fCo 

where S(E) represents the subsidy offered per unit of supply, E 
represents the quantity of energy produced using the technique in 

management derives utility from the hiring of a quantity of input / greater than that 
winch minimizes cost, t he first of the.se works to undermine proposition 2 , since such 
specialized inputs enter the iru rernental profit expiession as a money cost though they 
incut no energy opportunity cost. I'hus they mav cause failure of the profit test even 
when the net energy test is passed However, the next type of violation of our premises 
can be expected to work the other way, slieiigthening proposition 2. Monopsony power 
over inputy means that in equilibrium drjdi, = mejp, > pjlp„ sime we expect the 
supply price ol j to be rising (here me, is the maiginal cost of purchases of 7 to the firm 
that u.ses it as an input). Hence the true opportunity cost of drjdr,, will be greater 
than/)//),, the corresponding figure in the profit equation. Finally, exce.ssive purchases 
of 7 , because they provide utility to management, need not affect our result. With a 
given stock of resource 7 it is impossible for this to be a universal phenomenon. But if it 
occurs only in a limited sector of the economy, elsewhere ; will Ixcome scarcer, its 
relative marginal yield will ri.se, but its price will rise correspontlmglv, and pj/pi = 
will continue to be satisfied by firms in the remainder of the economy (i.e., those 
which do not buy excessive amounts of;). 

*’ We are grateful to James Sweeney and Robert Willig for suggesting this way ol 
looking at the matter. The next several paragraphs follow a draft by Sweeney. 
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Fic;. 2.—Prcxluclion-possiliility frontiers with and without energy supply subsidy 

question, £0 represents its initial output, and£, represents the output 
under the subsidy. In figure 2, A represents the initial output combi¬ 
nation in the absence of any subsidy, and B represents the output 
combination with the subsidy. These points have been drawn under 
the a.ssumption that the subsidy in fact increases net energy outputs 
and decreases the outputs of other goods and services. Point C repre¬ 
sents a combination of net energy outputs and other outputs available 
if a perfectly efficient mechanism, such as a tax on the imports of all 
energy, had been instituted. The difference in net energy outputs 
between C and B is what we call the potential net energy loss asso¬ 
ciated with this subsidy. Note that, although the subsidy program can 
increase the actual net energy output, there is a loss in potential net 
energy output relative to what could be obtained with a perfectly 
efficient program. 

Alternatively, the distance fromy4 toD can be viewed as the poten¬ 
tial net energy output loss asstx;iated with a subsidy program. This is 
the net energy loss if the economy is managed in such a way that all 
other outputs remain constant. 

Thus we see that even though a subsidy program may increase 
actual net energy output, it can be expected to lead to a decrease in 
potential net energy output. 


VI. Promise of Future Profitability: A Dangerous Fallacy 

Before we turn to our overall conclusion, it is important to deal with a 
critical error in reasoning which the preceding analysis may well elicit. 
Undoubtedly it will suggest the response that many new energy 
sources, while admittedly unprofitable today (both at the margin and 
overall), should become viable in the foreseeable future if enough R & 
D is devoted to the task. This is probably true, but it weakens neither 
the analysis nor its policy implications, because the issue is not 
whether they will ultimately begin to yield positive profits, but 
whether those profits can be expected to be sufficient to compensate 
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for the initial losses. To drive home the point, consider each of the 
two possibilities explicitly. 

Case a: If future profits are insufficient to make up for earlier losses 
at the margin, our analysis still shows that over its lifetime the invest¬ 
ment will be a potential net energy loser. It will yield some net energy 
gains in later years, but not enough to make up for the resource 
depletion it caused earlier. To see how this is implied by our previous 
analysis, we need merely interpret the quantities in our equations to 
represent discounted present values of the corresponding future 
streams, and the result follows at once: The present value of the 
stream of future profits serves as the required indicator of the present 
value of the stream of values of the future net energy yields, and 
propositions 1 and 2 continue to apply.'® 

Case b: Where expected future profits do make up for earlier losses, 
the corresponding net-energy tests will, of course, be passed. But in 
the absence of capital market imperfections and externality problems, 
there should then be no need for government subsidies. Private 
enterprises should, in these circumstances, be expected to invest ade¬ 
quately in the project. After all, any firm that invests in an innovation 
normally expects to lose money initially and to make it up later. There 
is simply nothing unusual in such cases. 

Thus we conclude that the likelihood that with sufficient research 
outlays a new energy source can be made viable financially at some 
point in the future does not weaken the presumption that subsidies to 
such projects will lead to a net loss in the economy’s potential net 
energy stocks. 

Here it should be noted, incidentally, that the re.search process itself 
has its energy costs—direct, indirect, and those that take the form of 
energy opportunity costs. (One fusion-research project, e.g., is re¬ 
ported to use as much electricity as a medium-sized American city.) It 
may also be appropriate at this point to take note of another poten¬ 
tially misleading argument on our subject. It is tempting to assert that, 
because the energy issue is not exclusively a matter of growing scarcity 
of a resource but involves freedom of the nation’s foreign policy from 
the threat of international blackmail, it may well pay society to invest 
in energy-production projects to an extent that is unprofitable for the 
private sector. By itself, this assertion is legitimate enough, but un¬ 
fortunately there is no way such investments will help to deal with the 
international blackmail problem if the projects undertaken are net 
energy losers. In short, we are by no means advocating adherence to a 
profitability criterion for its own sake. We are merely suggesting that 

" It can, however, be argued (and, indeed, it has been) that it is inappropriate to 
discount a flow of benefits to future generations. This is not the place to go into the 
complexities of this difficult issue—the proper standards of justice in intergcnerational 
distribution. 
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profit can often serve as a useful indicator of the net energy contribu¬ 
tion promised by such an undertaking. If its net energy yield over its 
lifetime will be negative, there is no way it can be justified on 
geopolitical grounds any more than it can be rationalized in economic 
terms. 

VII. Heterogeneity of Energy Sources 

There is another assumption underlying our analysis which can easily 
lead to a second erroneous conclusion on policy. We have assumed all 
forms of energy to be homogeneous, but in fact they are not. Coal is 
very different from oil in terms of the uses to which it can easily be 
put, in transportability, in terms of environmental effects, and so 
forth. Such differences may sometimes be able to justify the adoption 
of projects which are (marginal) net energy losers. Suppose one type 
of energy (petroleum) is in short supply while another (coal) is very 
abundant. An effective process to transform coal into oil may then be 
worthwhile even though the nation’s inventory of British thermal 
units is reduced in the process. 

Heterogeneity may take yet another form—that of differences in 
the dates at which ditferent energy supplies become available. Sup¬ 
pose we were to believe that within a century, fusion will supply all our 
energy needs safely, abundantly, and cheaply. The energy supplies 
which wilt not be u.sed before the twenty-second century may become 
valueless or nearly so. If this is, for example, true of much of our coat 
reserves, a net energy loss involved in the trade of coal tomorrow for 
liquid fuel today may well be Justifiable. 

But the conclusion is emphatically not that the net energy consider¬ 
ations emphasized in this paper can be safely ignored. Indeed, it is 
remarkable that the robustness of our basic theorem (proposition 2) 
extends it to cover the is.sue of heterogeneity of energy sources. That 
is, we will see now that in the absence of externalities, a subsidy to a 
particular form of energy (such as current output of petroleum 
from coal) must actually reduce the potential net output of that form 
of energy (it must reduce net petroleum stocks even if other forms of 
energy are used as an input in the process). In showing this result, we 
can easily deal with an economy having any arbitrary number of 
forms of energy. But the logic of the argument as well as the nature of 
the formal analysis can be brought out most clearly by assuming that 
there are only two forms of energy, one of which (call it “petroleum") 
is directly substitutable for imported energy, while the other (call it 
“coal”) is not directly substitutable for imported fuel and is available in 
large amounts which may never be used up.'* We then have: 


'• Obviously, proposition 3 will not hold if coal is a homogeneous and totally free 
economic good which can be substituted for petroleum without cost. But coal does cost 
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Proposition 3: Under the conditions of proposition 2, the expansion 
of any petroleum-producing activity which involves a negative margi¬ 
nal profit yield will involve a net reduction in petroleum stocks at the 
margin, even if substantial amounts of coal are employed in the 
process, if output of coal and other outputs are held constant. 

That is, such an unprofitable increase in petroleum output may well 
exacerbate the economy’s energy importation problem, involving in¬ 
creased use of coal as a substitute for petroleum! 

At first glance this result may be even more implausible than propo¬ 
sition 2 itself, but its proof is simply a matter of trivial reinterpretation 
of the steps of the proof of proposition 2. For this purpose we need 
merely consider yj to be manufactured petroleum (e.g., oil extracted 
from shale or from coal), r, and r,, to be the inputs of petroleum alone 
(rather than total energy input), and r.^ and r ,2 to be coal inputs. Then 
(10) immediately constitutes our result, for it tells us about the net 
marginal yield of commodity 1 (petroleum) alone, regardless of the 
quantities of input 2 (coal) or other resources used to produce it. 

One may well argue vis-a-vis this proposition that it is unreasonable 
to assume that a subsidy, say, to conversion of coal to oil will leave 
other outputs unchanged, for it will surely affect at least the gross 
output of coal and perhaps the gross outputs of other energy sources. 
Nevertheless, our interpretation of proposition 2 in relation to this 
issue remains valid here. That is, the petroleum saved by a reduction 
in other activities in the economy may offset the oil lost by the 
expansion of the subsidized activity, but in that case (he gain in oil 
inventories would have been still greater if those other outputs had 
contracted in the same way without the subsidy-induced increase in 
conversion of coal into oil. 

However, in this case the behavior of substitute energy sources is 
actually apt to exacerbate the problem rather than relieve it: (a) The 
output of other forms of energy used as inputs in the subsidized 
process (the coal in our example) will expand and therefore increa.se 
their own use of oil as an input, thus tending to reduce the overall 
petroleum yield;^® (h) the outputs of other forms of energy (say, 
electricity) may be reduced because subsidized oil becomes .somewhat 
cheaper, but if this is so it will only occur because oil consumption is 
increased and substituted for other types of energy. On both scores, 


money to extract, and it vanes in quality and ease ol extraction Hence, n has a ^on^e!o 
price per Btu which can be expected to rise as coal suxiks are used up. 

It is, of course, (rue that in this case the available quantity of coal may be increased, 
but there are two reasons why it will not help to solve the problem’ First, by hy^thesis 
“coal” IS used here to represent an input which, unlike oil, is not in short supply. 
Otherwise, there is nothing gained by sacrificing 10 Btu of coal for ft Btu of oil. the i.wue 
we are now discussing. Second, by proposition 2, the subsidy must still have a negative 
potential net energy yield of coal and oil together as well as in oil atone (proposition 3). 
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then, the net energy loss in the subsidized production of oil is likely to 
be aggravated by the response in output of other types of energy. 

VIII. Oil Tariff as Stimulus to Supply 

We began our discussion by contrasting the net energy consequences 
of a targeted subsidy program with those of a tariff on imported oil. It 
is time to turn to the latter. Obviously, a tariff will tend to reduce oil 
use by both consumers and industry as well as stimulate energy 
supplies from both conventional and innovative sources. 

Standard economic analysis also suggests that such a tariff will 
accomplish all this in an efficient manner.** If the magnitude of the 
tariff corresponds to the size of the marginal detrimental externality 
represented by increased dependence on OPEC, in a competitive 
world it will move the economy to a Pareto optimal point on the 
efficiency frontier. This point will involve larger oil stocks and smaller 
outputs of at least some other commodities than would otherwise have 
been produced. All this would, of course, be accompanied by some 
reduction in the consumption of those other goods by households and 
other consumers. 

Nevertheless, our analysis may well appear to have shown that this 
benefit of the tariff is likely to be offset in part by its supply conse¬ 
quences. To the extent that it serves, in effect, as a subsidy to domestic 
production, it seems likely to lead to changes in output whose net 
energy yields are negative, with the tariff protecting producers from 
the reductions in profit that would otherwise follow. This view of the 
matter is, however, incorrect. Since a tariff raises the cost of imported 
energy everywhere, it will not only increase the gross output of 
domestic production of energy in export-substitutable forms; it will 
also cause substitution of other inputs for use of such types of energy 
elsewhere in the economy. This will cause adjustments in the energy 
opportunity costs of those inputs, which will mean that any profitable 
expansion in domestic energy production resulting from the tariff 
will in fact have a positive net energy yield. This is because the 
increased substitution, throughout the economy, of other inputs for 
oil will cause the marginal rate of substitution (MRS) to fall; that is, an 
additional unit of such an input will be able to replace less oil than it 
could before. In this way the MRS between oil and any other input 
elsewhere in the economy will remain equal to its marginal product in 
innovative petroleum-producing activities. 

In terms of figure 2 the point is that the tariff will move the 
economy from a point such as A to one such as C, with an increase in 
oil output and a loss in efficiency, while the subsidy will, instead, move 
us to a point such as B, causing the potential net energy loss BC. 

For an excellent discussion along these lines, see Schmalensee (1979). 
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Somewhat more formally, we see that the pretariff output of an 
innovative energy source will be smaller than the amount that 
maximizes profits after the imposition of the tariff. As we have seen, 
an increase in output that offers a positive marginal profit must also 
have positive marginal net yield of energy. It follows that the net 
energy yield of the expansion in output of manufactured energy after 
imposition of the tariff must be positive. 

Nevertheless, there are .some reservations which must be expressed 
about the tariff approach. First, if a tariff induces an increase in 
consumption from domestic sources, it may simply hasten the process 
of depletion of our oil inventories and speed the economy’s growth in 
vulnerability to the cartel! 

One can also think of a case in which subsidy of a net energy losing 
project with high sunk costs (e.g., high research costs but low variable 
costs) may conceivably be superior to a tariff. If, once the resources 
are exptended, it constitutes a credible threat which prevents the cartel 
from raising its price in the future, it may be worth the outlay. It may 
be superior to a tariff which will also lead to domestic investment 
outlays on energy production and, in addition, result in higher prices 
to consumers.®* 

IX. Concluding Comment: Subsidies Which Do Increase Energy 
Supplies 

We began by denying that we were opposed to all programs involving 
subsidies to new energy sources. Our objective, rather, is to make 
absolutely clear how easily an ill-considered subsidy program can 
backfire and may actually serve to deplete the economy’s energy 
supplies. But we may now seem to have proved too much—to have 
shown that any subsidy can be presumed to cause energy losses. In 
fact, we have not done so. Rather, properly interpreted, our results 
indicate the conditions under which such outlays can contribute 
energy resources. This is made possible by the role of externalities 
that may produce cases in which a project’s net energy yield offers no 
corresponding net profits to the firm that undertakes it. 

There is no point in attempting to provide an exhaustive list of the 
ways in which such sources of market failure may arise. We will only 
mention two which, we suspect, are of prime importance here and 
which should illustrate the sort of projects for which subsidies can be 
justified. The two externalities which will be discussed are those re¬ 
sulting from differences between private and social risk and those 
which arise from the free-rider problems engendered by investment 
in research. 

” We are indebted to Robert Willig for pointing out this pns.sibility to us. 
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There are at least two reasons for which the social risks of an 
investment may be smaller than those incurred by a private firm 
which undertakes it, so that the investment may be desirable for 
society even though it is unattractive to private enterprises. The first is 
the well-known argument of Arrow and others which points out that 
since at any given time the economy contains a substantial number of 
investments which incur risks that are to a considerable degree inde¬ 
pendent, the “law of large numbers” reduces the risk to society of an 
additional project below what it is to a private entity which has a 
considerably smaller number of projects under its control. 

Second, financial risk is likely to be more threatening to the indi¬ 
vidual owner of an enterprise than it is to society. If the project goes 
bankrupt but continues to operate in receivership, the investment is 
by no means lost to the community, though it is lost completely to the 
initial investor. So, on these grounds, too, new energy projects, which 
are indeed risky, may incur social costs and, hence, energy costs lower 
than their energy yield even if they are unattractive to private inves¬ 
tors. 

In practice, perhaps the most important justification for subsidy to 
particular energy production projects is the free-rider problem which 
accompanies all applied research. Research almost always generates 
information which is u.seful to others besides those who have con¬ 
ducted or financed it, and there is often no way to make those others 
bear an appropriate share of the cost. Thus, taking all beneficiaries 
together, the total yield of the research may well exceed its costs, and 
our analysis can be shown to suggest that an energy project which 
does so can be presumed to have a positive net energy yield.*^ Never¬ 
theless, because some of the beneficiaries do not make a correspond¬ 
ing fitiaticial contribution toward the cost of the research, such a 
project may well be unattractive to any private entrepreneur. 

The patent system is, of course, intended to deal with this free-rider 
problem, and to some extent it does. However, some research results, 
particularly those that cjualify as products of “basic research,” are not 
patentable. Others are early steps in a development proce.ss which will 
occupy many years—so many that the period of patent protection is 
exceeded. And even results which are eligible for patent protection 
may nevertheless throw off substantial indirect benefits to free riders 
who will be helped by them to produce substitutes for the patented 
item. This is frequently sugge.sted as the explanation for the fact that 

This is fairly obvious if the free riders use their unpaid-for knowledge to prorfuce 
more eneigy; their energy output must be added to that of those who conducted the 
research. However, the argument applies even if the free riders use the knowledge in 
other economic activities, since the prtxluci of those attivilics will also have an energy 
opportunity value. For example, if the new kntiwledge reduces the inputs needed to 
produce a unit ot some other product, the inputs saved can be used as a substitute for 
energy elsewhere in the economy, in the manner already discussed. 
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many firms choose not to patent the results of their R & D activities, 
thereby avoiding the disclosure required for a patent application. 

These are probably the prime reasons no private firms will under¬ 
take the research outlays required for a protracted and risky de¬ 
velopment process such as that required for very innovative energy 
sources like nuclear fusion, which is generally not expected to yield a 
financially viable product until we are well into the twenty-first cen¬ 
tury. 

In conclusion, let us emphasize again that even if a benign energy 
source such as .solar energy does not pass the test that qualifies it for 
subsidies under our criterion, it may nevertheless be socially desirable 
to encourage its production. Bui a tariff on oil importation or a tax on 
the consumption of oil would appear to be the appropriate way to do 
so. 
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In a general equilibrium model of a labor economy, the size of 
government, measured by the share of income redistributed, is de¬ 
termined by majority rule. Voters rationally anticipate the disincen¬ 
tive effects of taxation on the labor-leisure choices of their fellow 
citizens and take the effect into account when voting. The share of 
earned income redistributed depends on the voting rule and on the 
distribution of productivity in the economy. Under majority rule, the 
equilibrium tax share balances the budget and pays for the voters’ 
choices. The principal reasons for increased size of government 
implied by the model are extensions of the franchise that change the 
position of the decisive voter in the income distribution and changes 
in relative productivity. An increase in mean income relative to the 
income of the decisive voter increases the size of government. 


I. Introduction 

The share of income allocated by government differs from country to 
country, but the share has increased in all countries of western 
Europe and North America during the past 25 years (Nutter 1978). 
In the United States, in Britain, and perhaps elsewhere, the rise in tax 
payments relative to income has persisted for more than a century 
(Peacock and Wiseman 1961; Meltzer and Richard 1978). There is, as 


We are indebted to Karl Brunner, Dennis Epple, Peter Ordeshook, and Tom Romer 
for many helpful discussions and to the participants in the Carnegie-Mellon Public 
Economics Workshop, an anonymous referee, the editor, and the Interlaken Seminar 
for constructive comments on an earlier version. 
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yet, no generally accepted explanation of the increase and no single 
accepted measure of the size of government. 

In this paper, the budget is balanced.* We use the share of income 
redistributed by government, in cash and in services, as our measure 
of the relative size of government and develop a theory in which the 
government’s share is set by the rational choices of utility-maximizing 
individuals who are fully informed about the state of the economy 
and the consequences of taxation and income redistribution.* 

The issues we address have a long intellectual history. Wicksell 
(1958) joined the theory of taxation to the theory of individual choice. 
His conclusion, that individual maximization requires government 
spending and taxes to be set by unanimous consent, reflects the 
absence of a mechanism for grouping individual choices to reach a 
collective decision. Following Downs (1957), economists turned their 
attention to the determination of an equilibrium choice of public 
goods, redistribution, and other outcomes under voting rules that do 
not require unanimity. 

Several recent surveys of the voluminous literature on the size or 
growth of government are now available (see Brunner 1978; Peacock 
1979; Aranson and Ordeshook 1980; and Larkey, Stolp, and Winer 
1980).^ Many of the hypotheses advanced in this literature emphasize 
the incentives for bureaucrats, politicians, and interest groups to 
increase their incomes and power by increasing spending and the 
control of resources or rely on specific institutional details of the 
budget, taxing, and legislative processes. Although such studies con¬ 
tribute to an understanding of the processes by which particular 
programs are chosen, they often neglect general equilibrium aspects. 
Of particular importance is the frequent failure to close many of the 
models by balancing the budget in real terms and considering the 
effect on voters of the taxes that pay for spending and redistribution 
(see, e.g., Olson 1965; Niskanen 1971; and Hayek 1979). A recent 
empirical study by Cameron (1978) suggests that decisions about the 
size of the budget are not the result of “fiscal illusion,” so the neglect 
of budget balance cannot be dismissed readily. 

We differ from much of the recent literature in three main ways. 


' All variables are real. There is no inflation. Budget balance means that redistribu¬ 
tion uses real resources. Public goods are neglected. 

* Ideally the size of government would be measured by the net burden impo.sed (or 
removed) by government programs. 

’ Larkey et al. (1980) include a survey of previous surveys. Recent surveys bv Mueller 
(1976) and Sahota (1978) summarize recent contributions by Downs (1957), Musgrave 
(1959), Olson (1965), Niskanen (1971). Buchanan and I'ullock (1972), Riker and 
Ordeshook (1973), and others to such related topics as the determinauon of equilib¬ 
rium collective decisions and the effects of government poliacs on the distribution of 
income. 
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First, voters do not suffer from “fiscal illusion” and are not myopic. 
They know that the government must extract resources to pay for 
redistribution. Second, we concentrate on the demand for redistri¬ 
bution and neglect any “public goods” provided by government (see 
also Peltzman 1979). Third, we return to the earlier tradition of de 
Tocqueville ([1835] 1965) who associated the size of government, 
measured by taxes and spending, with two factors: the spread of 
the franchise and the distribution of wealth (property).^ 

Our hypothesis implies that the size of government depends on the 
relation of mean income to the income of the decisive voter. With 
universal suffrage and majority rule, the median voter is the decisive 
voter as shown by Roberts (1977) in an extension of the well-known 
work of Hotelling (1929) and Downs (1957). Studies of the distribu¬ 
tion of income show that the distribution is skewed to the right, so the 
mean income lies above the median income. Any voting rule that 
concentrates votes below the mean provides an incentive for redis¬ 
tribution of income financed by (net) taxes on incomes that are (rela¬ 
tively) high. Extensions of the franchise to include more voters below 
mean income increase votes for redistribution and, thus, increase this 
measure of the size of government. 

The problem with this version of the de Tocqueville hypothesis is 
that it explains too much. Nothing limits the amount of redistribution 
or prevents the decisive voter from equalizing incomes or, at a 
minimum, eliminating any difference between his disposable income 
and the disposable income of those who earn higher incomes. Incen¬ 
tives have been ignored. Higher taxes and redistribution reduce the 
incentive to work and thereby lower earned income. Once we take 
account of incentives, there is a limit to the size of government. To 
bring together the effect of incentives, the tlesire for redistribution, 
and the absence of fiscal illusion or myopia, we develop a general 
equilibrium model. 

Section II sets out a static model. Individuals who differ in produc¬ 
tivity, and therefore in earned income, choose their preferred combi¬ 
nation of consumption and leisure. Not all individuals work, but those 
who do pay a portion of their income in taxes. The choice between 
labor and leisure, and the amount of earned income and taxes, de¬ 
pend on the tax rate and on the size of transfer payments. 

I'he tax rate and the amount of income redistributed depend on 
the voting rule and the distribution of income. Section III shows how 
income redistribution, taxes, and the size of the government budget 


* We are indebted to Larkey et al. (1980) for pointing out the similarity between de 
'I'ocqucville and the conclusion we reached in an earlier version and in Melucr and 
Richard (1978). De TtKqueville's distribution of property finds an echo in the concerns 
about “mob rule” by the writers of the Constitution. 
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change with the voting rule and the distribution of productivity. A 
conclusion summarizes the findings and main implications. 

II. The Economic Environment 

The economy we consider has relatively standard features. There are 
a large number of individuals. Each treats prices, wages, and tax rates 
as givens, determined in the markets for goods and labor and by the 
political process, respectively. Differences in the choice of labor, lei¬ 
sure, and consumption and differences in wages arise solely because 
of differences in endowments which reflect differences in produc¬ 
tivity. In this section, we extend this standard model to capture the 
salient features of the process by which individuals choose (o work or 
subsist on welfare payments and show the conditions under which 
these choices are uniquely determined by the tax rate. 

The utility function is assumed to be a strictly concave function, 
m(c, /), for consumption, c, and leisure, /. Consumption and leisure 
are normal goods, and the marginal utility of consumption or lei¬ 
sure is infinite when the level of consumption or leisure is zero, re¬ 
spectively. There is no capital and no uncertainly. 

The individual’s endowment con.si.sts of ability to produce, or pro¬ 
ductivity, and a unit of time that he allocates to labor, n, or leisure, / = 
1 — n. Individual incomes reflect the differences in individual pro¬ 
ductivity and the use of a common, constant-returns-to-scale technol¬ 
ogy to produce consumption goods. An individual with productivity x 
earns pretax income, y: 

y(x)=xn(x). (1) 

Income is measured in units of consumption. 

Tax revenues finance lump-sum redistribution of r units of con¬ 
sumption per capita. Individual productivity cannot be observed di¬ 
rectly, so taxes are levied against earned income. The tax rate, t, is a 
constant fraction of earned income but a declining fraction of dispos¬ 
able income. The fraction of income paid in taxes net of transfers, 
however, rises with income.* There is no saving; consumption equals 


Relianre on a linear tax follow, a well-established tradition. Ronier (1975) analvzetl 
problems of uniituxlality using a linear tax and piedeicrminerl government spending. 
Rolierts {1977), using a linear tax and a predetermined budget, .showed that the median 
voter dominates the solution if incomes are ordered by productivity. Linear tax func¬ 
tions are used also when the social welfare f unction is used to determine the opumal tax 
(see .Sheshinski I97U). The degree to which actual taxes differ from linear taxes ha.s 
generated a large literature. Pcchman and Okner (1974) find tliat the tax rate is ap¬ 
proximately constant. King (1980) writes that most redistribution in the United States 
and the United Kingdom comes from the transfer system, not from the ux system. 
Browning and Johnson (1979) show that conclustons about pro|rortionalitv of the tax 
rate depend heavily on assumptions used to allocate the burden of indirect business 
taxes. 
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disposable income as shown in (2): 

c(x) = (1 — <)«x + r, c ^ 0. (2) 

If there are individuals without any ability to produce, * = 0, their 
consumption is r ^ 0. 

Each individual is a price taker in the labor market, takes t and r as 
givens, and chooses n to maximize utility. The maximization problem 

is: 

max u(c,l) = max w[r + nx(l — t), 1 — n]. (3) 

ree[0,I] ne[0.1] 

'I'he first-order condition. 


0 = ■ = Ue[r + m:(l — t), 1 — n]x(l — t) 

art 

— Uj[r + nx(l — t), 1 — n]. 


( 4 ) 


determines the optimal labor choice, n(r, x(l — <)], for those who 
choose to work. The choice depends only on the size of the welfare 
payment, r, and the after-tax wage, x(l — f).* 

Some people subsist on welfare payments. From (4) we know that 
the productivity level at which n = 0 is the optimal choice is 


= “<(f. 1) 

Ur(r, 1)(1 -O' 


( 5 ) 


Individuals with productivity below Xo subsist on welfare payments 
and choose not to work; n = 0 for x « Xq. 

Increases in redi.stribution increase consumption. For those who 
subsist on welfare, c = r, so dc/dr = 1. Those who work must consider 
not only the direct effect on consumption but also the effect of 
redistribution on their labor-leisure choice. The assumption that con¬ 
sumption is a normal good means that dc/dr > 0. Differentiating (4) 
and using the second-order condition, D < 0, in footnote 6 restricts 


dc _ Urfx( 1 - 0 - u„ ^ „ 
dr -D 


( 6 ) 


Consumption increases with r for both workers and nonworkers pro¬ 
vided consumption is a normal good. 

The positive response of c to r takes one step toward establishing 
conditions under which we find a unique value of r that determines 


* By assumption, u is strictly concave, so the second-order condition is negative and 
(4) d^nes a maximum. The second-order condition is = D = Uc,x*(l — t)* - 

2u,.ix(l - <) + Uu < 0. 
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the amount of earned income and amount of redistribution for each 
tax rate. The next step is to show that normality of consumption is 
sufficient to establish that earned income (income before taxes) in¬ 
creases with productivity. 

Pretax income is 

y{r, t. x) = xn[r, x(l - <)]- O) 


People who do not work, x ^ Xot have y = 0 and dy/dx = 0. For all 
others, 


ax 


= n 


+ X 


dn 

dx 


( 8 ) 


The first-order condition (eq. [4]) yields 

dn _ Mc(l — t) ~ 0^ ~ Mc/«(1 — t) 

dx —D 


The sign of dnidx is indeterminate; as productivity increases, the 
supply of labor can be backward bending. Pretax income, y = nx, does 
not decline, however, even if n falls. Substituting (9) into (8) and 
rearranging terms shows that the bracketed term in (10) is the 
numerator of dc/dr in (6). Hence, dy/dx is positive for all x > Xo 
provided that consumption is a normal good; 

dy _ Uo(l - t)x w[Ur(X(l - 0 - M|(] ^ 
dx -D 

The final step in establishing that there is a unique equilibrium 
solution for any tax rate uses our assumption that leisure is a normal 
good. The government budget is balanced and all government 
spending is for redistribution of income. If per capiu income is y, 
then 

ty = r. (11) 

LetF’(-) denote the distribution function for individual productivity, 
so thatF'(x) is the fraction of the population with productivity less than 
X. Per capita income is obtained by integrating: 


y = 



XTi[r, (1 - t)x]dF(x). 


( 12 ) 


F-quation (12) shows that per capita income, and therefore total 
earned income, is determined once we know Xq, /, and r. From (5). we 
know that Xo depends only on t and r, and from (11) we know that, lor 
any tax rate, there is at least one value of r that balances the budget.^ 


’ The left side of (11) is nonnegativc and is a continuous function of r thai is bounded 
hy (X, where x is the average of x. 
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If leisure is a normal good, the value of r that satisfies (11) for each t is 
unique.® 

Once r or I is chosen, the other is determined. The individual’s 
choices of consumption and the distribution of his time between labor 
and leisure are determined also. The choice of r or t uniquely deter¬ 
mines each individual’s welfare and sets the size of government. 

III. The Size of Government 

The political process determines the share of national income taxed 
and redistributed, fhe many ways to make this choice range from 
dictatorship to unanimous consent, and each produces a different 
outcome. We call each political process that determines the tax rate a 
voting rule. 

In this section, we consider any voting rule that allows a decisive 
individual to choose the tax rate. Two examples are dictatorship and 
universal suffrage with majority rule. A dictator is concerned about 
the effect of his decisions on the population’s decisions to work and 
consume, but he alone makes the decision about the tax rate. Under 
majority rule, the voter with median income is decisive as we show 
below. We then show that changes in the voting rules and changes in 
productivity change the tax rate and the size of government. 

1 he decisive voter chooses the tax rate that maximizes his utility. In 
making his choice, he is aware that his choice affects everyone’s 
decision to work and consume. Increases in the tax rate have two 
effects. Each dollar of earned income raises more revenue but earned 
income declines; everyone chooses more leisure, and more people 
choose to subsist on redistribution. “High” and “low” tax rates have 
opposite effects on the choice of labor or leisure and, therefore, on 
earned income. 

Formally, the individual is constrained to find a tax rate that bal¬ 
ances the government budget, equation (11), and maximizes utility 
subject to his own budget constraint, equation (Vj. I'he first-order 
condition for the decisive voter is solved to find his preferred tax rate: 

(13) 

where is the income of the decisive voter. 

" The normality of lei.sure iiiean.s that itlldr > 0 and, therefore, drildr = ~dl/dr < 
0. Sintc 

■S' 

the left side of (11) is a strictly decreasing, continuous function of r. The right side of 
(11) strictly increases with r. This implies that there is a unique value of r that satisfie.s 
( 11 ). 
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Roberts (1977) showed that if the ordering of individual incomes is 
independent of the choice of r and /, individual choice of the tax rate 
is inversely ordered by income. This implies that with universal suf¬ 
frage the voter with median income is decisive, and the higher one’s 
income, the lower the preferred tax rate. By making the additional 
assumption that consumption is a normal good, we have shown that 
incomes are ordered by productivity for all r and t. Combining 
Roberts’s lemma 1 (1977, p. 334) with our results, we can order the 
choice of tax rate by the productivity of the decisive voter.® The 
higher an individual’s productivity, the lower is his preferred lax rate. 

Figure 1 illustrates the proposition and shows the effect on the lax 
rate of changing the voting rule. Fhe negatively sloped line is the 
relation between individual productivity, x, and the individual’s pre¬ 
ferred tax rate. This line need not be linear. 

The maximum tax rate, <max. 's chosen if the decisive voter does not 
work. An example is x = x^,. In this case, x Xo; the decisive voter 
consumes only r, so he chooses the tax rate (<max) that maximizes r. 
Any higher tax rate reduces aggregate earned income, tax collections, 
and the amount available for redistribution. From equation (5), we see 
that the maximum tax rate must be less than t = 1. 

As productivity rises from Xo to x, the lax rate declines from to 
0. At X(t = 5c, the decisive voter is endowed with average productivity 
and cannot gain from lump-sum redistribution, so he votes for no 
redistribution by choosing t = 0.’® From equation (5) and 1/^(0. 
we see that everyone works when r = 0. If the decisive voter’s produc¬ 
tivity exceeds 5c, t and r remain at zero and aggregate earned income 
remains at society’s maximum. 

Changes in the voting rule that spread the franchise up or down the 
productivity distribution change the decisive voter and raise or lower 
the tax rate. Our hypothesis implies that changing the position of the 
decisive voter in the distribution of productivity changes the size of 
government provided Xo < Xd < X. Major changes in x^ have occurred 
in two ways. Wealth and income requirements for voting were re¬ 
duced or eliminated, gradually broadening the franchise and lower¬ 
ing the income of the decisive voter. Social security retirement systems 
grew in most countries after the franchise was extended. By increas- 


* The formal statemeni of the result is- Consider any two pairs and (tj, tj). If I, 

> t,, then foi all x: x is indiffeient between (r,, (,) and (r^, tj) implies that x' weakly 
prefers (r^, tj) to (r,, t|) for all x' < x and x" weakiv prefers (r„ /,) to (r„ /,) for all x" > x; x 
strictly prefers (r,. /,) to (r^, /,) implies thalx" striclJy prefers (r,. (,) to (i,. (j) for all x" > v; 
X strictly prefers (rj.tj) to (r,, /,) implies that x' slrirdv prefers (r^, <,) to (r,./,) foe all x' < 
X. Note that this result does not reciuire uninitKlalily of voter preferences for tax rates. 

We have omitted public gtnxls. In an carlici version we showed that under care¬ 
fully specified conditions, public gotxfs can be included without changing the result for 
redistribution. 
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Fig. I 

ing the number of retired persons, social security systems increase the 
number of voters who favor increased redistribution financed by 
taxes on wages. Some of the retired who favor redistribution also 
favor low taxes on capital, property, and the income from capital. 

The size of government changes also if there are changes in relative 
income, as shown by equation (13), or relative productivity. Conclu¬ 
sions about the precise effect of changes of this kind are difficult to 
draw. We cannot observe productivity directly and can only infer 
changes in the distribution of productivity, F('), by observing changes 
in relative income. Recent literature makes clear that these effects are 
disputed (see Sahota 1978; King 1980; and others). Further, we 
cannot deduce the effect of changes in productivity on t directly from 
equation (13). The reason is that y depends on t, so finding the effect 
of changes in relative productivity requires the solution to a nonlinear 
equation in t. Instead, we rewrite (13) in a form which involves the 
(partial) elasticities of per capita income (y) with respect to redistribu¬ 
tion (r) and the wage rate (Af[l — <]). 

Let T = 1 — t be the fraction of earned income retained. From (12), 
y depends on r and r only. The total derivative 

where Jr and y, are the two partial derivatives. Substituting (14) into 
(13), we solve for f. 

I = m - I + 7)(y, r) 

m — 1 -t- 7}(J, r) -f rm}{y, t) ’ 


( 15 ) 
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where m is the ratio of mean income to the income of the decisive 
voter, ylydt and the tj’s are partial elasticities. Using the common 
economic assumption that the elasticities are constant, the tax rate 
rises as mean income rises relative to the income of the decisive voter, 
and taxes fall as m falls: 


dt ^ V(y» t)[1 - -giy, r)] 

dm [m - 1 + -qiy, r) + m7](y, r)]* 


( 16 ) 


Relaxing the assumption of constant elasticities weakens the conclu¬ 
sion, but we expect the sign of (16) to remain positive provided the 
change in the elasticities is small. 

One of the oldest and most frequently tested explanations of the 
growth of government is known as Wagner’s law. This law has been 
interpreted in two ways. The traditional interpretation is that gov¬ 
ernment is a luxury good so that there is a positive relation between 
the relative size of government and the level of real income. Recently 
Alt (1980) has questioned this interpretation of Wagner’s idea. Alt 
(1980, p. 4) notes that Wagner argued that there is “a proportion 
between public expenditure and national income which may not be 
permanently overstepped.” This suggests an equilibrium relative size 
of government rather than an ever-growing government sector. 

The traditional statement of Wagner’s law—that government 
grows more rapidly than income—has been tested many times, but 
with mixed results. Peacock and Wiseman (1961), Cameron (1978), 
and Larkey et al. (1980) discuss these tests. Our hypothesis suggests 
that the results are ambiguous because Wagner’s law is incomplete. 
The effect of absolute income on the size of government is condi¬ 
tional on relative income. Average or absolute income affects the 
elasticities in equation (15), and the relative income effect is given by 


To make our hypothesis testable, we must identify the decisive 
voter. The applicable voting rule in the United States is universal 
franchise and majority rule. Under this rule, the voter with median 
income is decisive in single-issue elections, as we argued above. Hence 
the median voter is decisive in elections to choose the tax rate, so m is 
the ratio of mean to median income." 


" The many tests of the median-voter hypothesis using regression analyses are 
incondusive. One reason is that many of the tests do not discriminate between the 
median and any other fractile of the income distribution (see Romcr and Rosenthal 
1979). Cooler and Helpman (1974) use income before and after uxes net of transfers 
to estimate the shape of the social welfare function implicit in U.S. data. They conclude 
that “the assumption that ability is distributed as wages per hour . . . —perhaps the best 
assumption on distribution of ability—vindicates the median voter rule." 
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Government spending and taxes have grown relative to output in 
most countries with elected governments for the past 30 years or 
longer. Increased relative size of government appears to be inde¬ 
pendent of budget and tax systems, federal or national governments, 
the size of the bureaucracy, and other frequently mentioned institu¬ 
tional arrangements, although the relative rates of change in different 
countries may depend on these arrangements. 

Our explanation of the size of government emphasizes voter de¬ 
mand for redistribution. Using a parsimonious, general equilibrium 
model in which the only government activities are redistribution and 
fixation, the real budget is balanced, and voters are fully informed, 
we show that the size of government is determined by the welfare- 
maximizing choice of a decisive individual. The decisive individual 
may be a dictator, absolute monarch, or marginal member of a junta. 

With majority rule the voter with median income among tbe en- 
f ranchised citizens is decisive. Voters with income below the income of 
the decisive voter choose candidates who favor higher taxes and more 
redistribution; voters with income above the decisive voter desire 
lower taxes and less redistribution. The decisive voter chooses the tax 
share. When the mean income rises relative to the income of the 
decisive voter, taxes rise, and vice versa. The spread of the f ranchise 
in the nineteenth and twentieth centuries increased the number of 
voters with relatively low income. The position of the decisive voter 
shifted down the distribution of income, so tax rates rose. In recent 
years, the proportion of voters receiving social security has increased, 
raising the number of voters favoring taxes on wage and salary in¬ 
come to finance redistribution. A rational social security recipient with 
large property income supports taxes on labor income to finance 
redistribution but opposes taxes on inettme from property. In our 
analysis, there is neither capital nor taxes on property, so the increase 
in social security recipients has an ef fect similar to an extension of the 
franchise. 

Our assumption that voters are fully informed about the .size of 
government differs from much recent literature. There, taxpayers 
are portrayed as the prey sought by many predators who conspire to 
raise taxes relative to income by diffusing costs and concentrating 
benefits, or in other ways (Buchanan and Tullock 1962; Olson 1965; 
Niskanen 1971; Hayek 1979). We acknowledge that voters are ill 
informed about the costs of particular projects when, as is often the 
case, it is rational to avoid learning details. Knowledge of detail is not 
required to learn that the size of government has increased and that 
taxes have increased relative to output or income. Long ago it became 
rational for voters to anticipate this outcome of the political process. 
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Wagner’s law, relating taxation to income, has generated a large 
literature and has been tested in various ways. Our analysis shows that 
Wagner’s law should be amended to include the effect of relative 
income in addition to absolute income. 

Kuznets {1955) observed that economic growth raises the incomes of 
skilled individuals relative to the incomes of the unskilled. In this way, 
economic growth can lead to rising inequality and, if our hypothesis is 
correct, to votes for redistribution. The rising relative size of govern¬ 
ment slows when the relative changes come to an end and reverses if 
the relative changes reverse in a mature stationary economy. 

The distinctive feature of our analysis is not the voting rule but the 
relation between individual and collective choice. Each person 
chooses consumption and leisure by maximizing in the usual way. 
Anyone who works receives a wage equal to his marginal product. 
Taxes on labor income provide revenues for redistribution, however, 
so everyone benefits from decisions to work and incurs a cost when 
leisure increases. 

The analysis explains why the size of government and the tax rate 
can remain constant yet be criticized by an overwhelming majority of 
citizens. The reason is that at the voting equilibrium nearly everyone 
prefers a different outcome. If unconstrained by the voting rule, 
everyone but the decisive voter would choose a different outcome. 
But only the decisive voter can assure a majority. 

An extension of our argument may suggest why real government 
debt per capita, as measured in the budget, has increased more than 
20-fold in this century. The decisive voter has as much incentive to tax 
the future rich as the current rich. An optimal distribution of the cost 
of redistribution would not lax only the current generation because, 
with economic growth, the future generation will be richer than the 
current generation. By shifting the burden of taxation toward the 
future, income is redistributed intertemporally. 

'l o pursue these questions more fully and to analyze any effect of 
defense and public goods, it seems necessary to embed the analysis in 
a model with saving, capital accumulation, and public goods and to 
explore the effect of permitting relative shares to change as income 
changes. From an analysis of a growing economy, we ran exj>ect lo 
develop a rational theory of the growth of government to comple¬ 
ment our analysis of the government’s size. 
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The Family, Inheritance, and the 
Intergenerational Transmission of Inequality 


Nigel Tomes 

of Wei>tern Ontario 


llticc|ual inheritance of material wealth is commonly considered 
a major tau.se of inequality in consumption. However, theoreti¬ 
cal models of the intergenerational iransinission of inequality by 
Becker, Blinder, and Ishikawa imply that unetjual inheritance may 
either inciea.se oi icdute tonsiimption inetpialily. Difterenies in 
inherited wealth resulting (tom uuc<)uai parental incomes increase 
me(|uahty iu re<ipicnts’ con.stiinplion. However, unetpial heque.sls 
causetl by (iill'eientcs among families in the endowed ability of 
thildren or the costs of producing humait capital are equali/.iiig. 
b.mpirical re.sulls cotifirin llie.se picdicttons: I’he iiiberilaucc re¬ 
ceived by children is inversely related to both c bildren's income anti 
parental education. l ints betjuesis ate “coiiqiensatoi y" in that 
(ceteris paiibus) low-uitorne thildren inherit more than their ad¬ 
vantaged contemporaries. 


The iiltiniale difficulties . . . center around the problem 
of social continuity in a world vv'here individuals are 
born naked, de.stitute, helple.ss, ignorant and untrained. 
. . . The fundamental fact about society ... is that it is 
made up ot individuals who are born and die and give 


An carliei version of this paper, ■■Inheritance anti the friteigenerational Tran.sniis- 
sion of Inequality, Theory and Empirical Results," was presented in workshops at the 
University of Chicago, Queen’s Uiiiversiiy, and the University of Western Ontario 
Workshop participants provided helpful comments. Dorothy Worth provided pio- 
gramming .Ts.si.stante. 1 would like to thank Marvin B. .Su.ssinan and Judith N. Cate.s fo 
permission to use their data. John A. Brittain provided a copy of the data. The rcfeiec 
and an editor provided constructive comments. Responsibility for any errors is my own. 

{Jtnnnal Poltit£al htonomy, ]9HI, vo] 89, m> 5J 
© 1981 by Th<* IJniverjicy of Chicago 0022-5«08/H1/8905-0002|01,50 
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place to others; and . . . that it is dependent upon the 
utilization of three great accumulating funds of inheri¬ 
tance from the past, material goods and appliances, 
knowledge and skill, and morale [which] . . . must in 
some manner be carried forward to new individuals 
devoid of all these things as older individuals pass out. 
. . . [T]he institutions of the private family and private 
property, inheritance and bequest and parental res|Rm- 
sibility, afford one way for securing more or less tolera¬ 
ble results in grappling with this problem. [Knight 
1921, pp. 374-75] 


I. Introduction 

Inequality in economic status and the reproduction of inequality in 
succeeding generations—“social mobility”—continue to l>e objects of 
social concern and the subject of investigation by social scientists. It 
has long been suspected that the institution of the family plays a major 
role in the generation and transmission of inequality and may severely 
limit the ability of public policy to alter the existing social order 
(Knight 1921). 

The role of the family has been examined in a large number of 
empirical studies conducted within the human capital framework 
(e.g.. Parsons 1975a). In contrast to this literature, there exists a 
research tradition which has examined the role of inherited material 
wealth in producing and perpietuaiing inequality (Stamp 1926; 
Wedgwood 1929; Blinder 1973, 1976a; Harbury and Hitchens 1979; 
Menchik 1979). 

Empirical research on the intergenerational transmission of in¬ 
equality has, for the most part, been conducted within the context of 
these two mutually exclusive approaches. However, models of family 
behavior recently proposed by Becker (1974), Ishikawa (1975), and 
Blindei (1976ft) strongly suggest the pos.sibility of substitution be¬ 
tween alternative m<xles of tiansfer and their dependence on a com¬ 
mon set of ability and opportunity variables. Thus, the "ability" of 
children and the education of parents—important determinants of 
children’s schooling—should also influence parent-child transfers 
oI material wealth. 

Recent theoreucal models suggest that the familv may not contribute 
as much to inequality as is commonly thought. Since parents are 
predicted to adjust the magnitude of intergenerational transfers in 
response to differences in the endowments of children, the insutution 



930 JOURNAL OF POLITICAL ECONOMY 

of the family permits a redistribution of resources which tends to 
equalize consumption, both between parents and children (Stiglitz 
1978; Becker and Tomes 1979)‘ and among disadvantaged and ad¬ 
vantaged siblings (Becker and Tomes 1976; Griliches 1979). Because 
of the equalizing role of the family, government redistribution mea¬ 
sures may have little, zero, or even a perverse effect on the distribu¬ 
tion of income and consumption (Barro 1974; Becker and Tomes 
1979). 

The purpose of this paper is to empirically test the altruism model 
of the intergenerational transmission of inequality proposed by 
Becker, Ishikawa, and Blinder. In this model, parental bequests of 
material wealth and human capital investments represent substitute 
forms of intergenerational transfer and are simultaneously deter¬ 
mined. The present paper extends previous work in a number of 
directions. First, previous studies of inheritance (Menchik 1979; 
Adams 1980) have been concerned with estimating the income elas¬ 
ticity of bequests^—and hence the extent to which unequal parental 
incomes are responsible for inequality in subsequent generations. In 
contrast, this paper emphasizes the response of parental bequests to 
variations in children’s incomes resulting from differences in child 
ability or in the ability of parents to produce human capital in their 
childten. Since intergenerational transfers serve in part to redistrib¬ 
ute re.sources among family members, when parental income is held 
constant, the amount inherited is predicted to be inversely related to 
the recipient’s income. Thus, within parental income strata, bequests 
of material wealth are predicted to be eqicalizing. Further, if more 
educated parents are more efficient at producing learning or earning 
skills in their children, the progeny of more educated parents will 
receive higher lifetime incomes from human capital. As a result, the 
equalizing role of the family will also be reflected in an inverse re- 
lation.ship between inherited wealth and parental education. 

Second, the choice of family size, a variable emphasized in simula¬ 
tion models (Pryor 1973) and empirical studies (Lebergott 1976; 


' In the model of Becker and Tomes (1979). it can be shown that, if the marginal 
propensity of parents to transfer re.sources to children exceeds the inheritability of 
endowments, the relative inequality in the steady-state distribution of income (as mea¬ 
sured by the coefficient of variation) is less when parents choose the optimum level of 
transfers than if there were no intergenerational transfers. 

* A limitation of these studies is that estimates of the income (wealth) elasticity of 
bequests do not distinguish between alternative models of bequest behavior. Both 
altruistic models and models of "egoistic” life-cycle accumulation in the presence ol 
uncertain lifetimes and imperfect capital (annuity) markets are consistent with a high 
income elasticity of bequests (see Tomes 1979). 
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Blomquist 1979), is treated as endogenous,® as suggested by the “new 
microeconomic” approach to fertility. 

Third, the observation that 55—60 percent of individuals do not 
receive an inheritance of material wealth (Blinder 1973; Blomquist 
1979) suggests that there are two categories of households: those 
making transfers exclusively in the form of human capital who are at 
a corner solution making no transfers of material wealth; and second, 
households making transfers in both forms. These two groups (re¬ 
gimes) are predicted to respond differently to variations in parental 
income and education (Ishikawa 1975; Blinder 1976^; Edwards and 
Grossman 1977). In the absence of data on material wealth transfers, 
previous investigators (Edwards and Grossman 1977; Leibowitz and 
Friedman 1979) were restricted to indirect tests of this model. The 
present paper reports direct empirical tests of the two-regime model. 

Empirical tests of the model are f>erformed using a unique data set 
which contains information on the inheritance, income, and schooling 
of children and the occupation, education, and family size of parents. 

The empirical results offer strong support for the Becker, Blinder, 
and Ishikawa models of family intergenerational transfers. The most 
striking empirical finding is that the inheritance received by children 
is inversely related to the recipient’s income. This result supjxirts the 
prediction that, within parental income strata, inheritance is 
equalizing—low-income children are “compensated" in the form of 
larger bequests, and high-income children are “taxed” in that they 
receive smaller bequests. Alternative explanations for the inverse 
relationship between inheritance and recipient’s income are that it is 
the result of (i) the work-disincentive effect of nonlabor income on 
earnings, (ii) the payment by donors for greater old-age care ren¬ 
dered by low-income children, or (iii) the reallocation among siblings 
of shares in a given total bequest. These alternative explanations are 
not supported by the data, leading us to accept the hypothesis implied 
by the Becker, Blinder, and Ishikawa models: differences in inherited 
material wealth p>erform a role of equalizing the distribution of re¬ 
sources between successive generations of the family. 

The plan of the remainder of the pajier is as follows: Section 11 
briefly outlines a model of the simultaneous determination of inheri¬ 
tance, human capital investment, and family size which captures the 
essential features of the Becker, Blinder, and Ishikawa models of 

’Blinder suggests that "the taste for bet]uests . . . would rise with the number of 
children” (19766, p. 90). Easterlin (1976) presenu a “bequest” model of fertility in 
which parents choose the number of children consistent with achieving a given target 
bequest to each child. In the model outlined below, the bequest per child is simultane¬ 
ously determined together with parental fertility. 
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intergeneratiooal transfer behavior by the altruistic family. In Section 
111 the data are described, and the model is specified empirically. 
Empirical results are reported in Section IV. Finally, Section V pre¬ 
sents a summary and conclusions. 

II. A Model of Inheritance, Human Capital Investment, 
and Family Size 

Let us abstract from uncertainty and life-cycle factors'* and consider a 
static, certain world in which individuals, endowed with perfect 
foresight, live for two periods—initially as a “child” and subsequently 
as a “parent." Generations of the family are assumed to be interde¬ 
pendent and overlapping. Parental fertility and the transmission of 
resources to children are assumed to be motivated by altruism, in that 
the number of children and their per capita consumption enter the 
parent's utility function together with the parent’s own consumption. 
This utility function may be written as 

U - ( 1 ) 

where N is the number of children and is the per capita consump¬ 
tion of children during their subsequent tenure as parents. For 
simplicity, per capita consumption is a.ssumed to be the same for all N 
children within a given family. The quantity ip represents the aggre¬ 
gate consumption of parents.^ 

This formulation captures the essence of the Becker, Blinder, and 
Ishikawa models® in which parents derive satisfaction from a gross 
measure of the well-being of children—their lifetime utility, income, 
or consumption, fhe donor’s utility dttes not depend on the composi¬ 
tion of recipient's consumption (i.e., there are no “merit” goods), on 
direct services (e.g., old-age care), or on the sources of recipient’s 
lifetime income—the division between human wealth and financial 
wealth. 

Individuals are assumed to enter the world with an endowment of 
income determined by such factors as genetic and cultural inheri¬ 
tance, government fi.scal and social policy, exogenous economic 


'• This assumption rules out "tontingenl bequests" when life-cycic savings for tuture 
consumption become bequests due to the uncertain incidence of mortality. Bevan and 
Stiglitz (1978) and Tomes (1979) discuss such models. In the present model, all be¬ 
quests are planned and are motivated by parental altiuLsm. 

’ All households are assumed to be composed of two parents plus children. 1 do not 
distinguish between the consumption of individual parents. 

* The differences in the formations of Becker. Ishikawa. and Blinder may be of 
importance for some issues, such as the wealth effects of bond-Hnanced fiscal policy. Of 
greater importance in the present context arc the aspects that these theoretical models 
have in common. 
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growth, and random “luck.” For simplicity, I assume that all children 
within a given family possess equal endowments. Thus, endowed 
income may vary across families but not within families.^ 

Motivated by their altruism, parents may augment the consumption 
of their children either by investing in their human capital or via 
direct transfers of material wealth." The relationship between the 
child’s consumption and parent-child transfers is determined by the 
income-expenditure constraint of the child generation and the tech¬ 
nology described by the income-generating function, 

Zr = /r = « + /3x'“V’' ■\-a = e + k+a, (2) 

where k = is the constant elasticity human capital prixluction 

function, 0 < y, tj < 1. 

The left-hand side of (2) represents the child’s income-expenditure 
constraint which indicates that, in the ab.sence of transfers by the child 
to subsequent generations, consumption equals income.® The right- 
hand side of (2) defines income in terms of its proximate determi¬ 
nants: e represents the endowed income of the child; k, income from 
human capital; and a, the child’s income from material wealth.*® 
Financial asset markets are assumed perfect, so that the rate of return 
on material wealth is constant and independent of the individual’s 
stocks of human and nonhuman capital. 

In general, both market-purchased goods (e.g., tuition) and 
home-produced inputs (e.g., the time of parents) enter the produc¬ 
tion function for human capital. For simplicity, we assume that 
market-purchased goods and home-produced inputs are perfect sub¬ 
stitutes, so that a single aggregate input (.v) enters the production 
function. Investment in the human capital of a given individual is 
assumed subject to diminishing returns with respect to x (i.e., 1 > tj > 
0). The return to (home and market) inputs in the pnxluction of 
children’s earning skills is also permitted to depend on the child's 


’’ Becker and l omes (1976) consider differences in iransterc to children within the 
Sicme family W'hen children diffet in endowed ability. Evidence ielalinj{ to intrafamilv 
inec]uality m heqiiesis of financial wealth is presented in .Sec. IV. 

“We assume that all parents .ire at the intenor solution where nei parent-child 
transfers are positive—i.e.. altruistically motivated transfers are nonzero Even with the 
secular ^lowth of incomes over the generations, regiession toward the mean in en¬ 
dowed income and reveise (child-parent) transfers effected bv bond-hnanced fiscal 
policies and .scKial security piograins would tend u* make jwisilive privale parent-child 
transfers optimal for at least some households. Given the observed prevalence of 
“lorwaid" transfeis in forms such as schcMiling. this assumption seems plausible. 

“Becker and Tomes (1979) considei intcrgeiierational translers iii the ccjiiteM of 
many generations. 

'“For simplicity, stcxks of both human capital and material wealth aie measured in 
units of the child's income. Note also that in general r would represent a vector of 
earning and learning abilities. 
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characteristics—in the form of the child’s endowments. To the extent 
that the child’s endowment represents the ability to learn, greater 
child endowments will be associated with greater income for given 
inputs (i.e., y > 0). 

Families are permitted to differ in their ability to produce income- 
earning skills in their children (Leibowitz 1974). The parameter /3 in 
equation (2) represents the level of parental efficiency, which is as¬ 
sumed to operate in a manner equivalent to technological progress, 
increasing the next generation’s income for given inputs and child 
endowments. 

In the absence of intrafamily public goods and fixed cost comjx)- 
nents, and given identical children, the full income constraint of 
parents can be written as” 

Ip = Zp+ pNx + paNa, (3) 

where parental consumption (Zp) is the numeraire. Ip is the full in¬ 
come of parents, p is the unit price of human capital inputs, pa is the 
cost of increasing the asset income of each child by $1.00,'^ and pNx 
znd paNa represent (respectively) the total expenditures by parents on 
the human capital and material wealth of the next generation. 


The Commodity Demand Functions 

The allocation problem of parents is to maximize utility (1) subject to 
(2) and (3). The first-order conditions for an interior .solution are; 

^ = 1 - ( 4 ) 


■ 4 ^ 


pN 

)3x->’'(l - 1?) 


= paN — 'TTc, 


= px + Pan = p\-^j +paa^7ry, 


(5) 

( 6 ) 


where ^ is the marginal utility of parental income, is the marginal 
utility of consumption by the ith generation, and Un is the marginal 
utility of family size. 

The marginal rate of substitution between each commodity and 
parental income is equated to the marginal cost defined by the first- 
order conditions (ztj for the jth commodity). The interaction of 


" Since all children have equal endowments and the same income-generating func¬ 
tion (2), human capital investment and material wealth transfers will be equal for all 
children within a given family. 

'* In the absence of estate and income taxes, /»„ = (I -t- r)'‘, where r equals the 
intergenerational rate of return on assets. 
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“quality and quantity” dimensions of choice is reflected in the fact that 
the marginal costs of heir’s consumption and family size (ttc and 
respectively) depend not only on prices but also on the level of 
commodity outputs. Because of this interdependence, we predict an 
inverse relationship between family size and parent-child transfers. 

The opdmum scale and mode of intergenerational transfers are 
determined by the first-order conditions (4) and (5). We assume that 
the (increasing) marginal cost of human capital investment is initially 
below the (constant) marginal cost of asset transfers. If loans from 
outside the family or intrafamily loans from parents are available, all 
households would invest in human capital up to the point where the 
marginal costs of both forms of transfer were equalized. Under these 
circumstances, human capital investment would be independent of 
the “opportunities" defined by parental income and family size and 
would be determined solely by parental efficiency and the child’s 
endowed ability. 

Observation suggests that such equality of opportunity among 
households does not exist as regards human capital investment be¬ 
cause of the absence of collateral and the high cost of enforcing 
implicit or explicit loan contracts. In the present model, this imper¬ 
fection in the capital market is modeled in an extreme form by 
assuming that alt human capital investment is financed internally 
within the family and neither intergenerational loans nor bequests of 
debt are permitted (i.e., a s 0). Under this financial liability con¬ 
straint, all human capital investment constitutes an intergenerational 
transfer. In this context there will exist two categories of families— 
those making transfers on a scale sufficient to make positive asset 
transfers for whom the financial liability constraint is ineffective, and 
those households who would like to transfer debt to the next genera¬ 
tion but for whom the financial liability constraint is binding. 

For households for whom the financial liability constraint (a s 0) is 
binding, the first-order conditions (5) and (6) require modification to 
become 


'V 


pN 


/3x-V(l - v) 


= TTr < paN, 




= TT.v. 


(5') 

( 6 ') 


This latter category of households will make all transfers in the 
form of human capital, and material wealth transfers will be, zero. 
Since, for these households, marginal transfers are in the form of 
human capital investment, the level of such transfers will depend on 
parental income and family size—determinants of opportunities—in 
addition to parental efficiency and child endowments. 
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For both categories of households, the solutions to the appropriate 
first-order conditions can be represented as a set of demand functions 
for family size and the consumption of both generations in terms of 
“family resources” and commodity marginal costs (wj’s) (Becker and 
Lewis 1973). The demand for the marginal form of transfer repre¬ 
sents an excess demand function for children’s consumption. 

If the parental utility function (1) is homothetic CES with equal 
elasticities of substitution between all arguments, and if we assume 
that the next generation’s consumption (in present value terms) rep¬ 
resents no more than half of family resources,’* the following sign 
predictions are obtained; 

For a > 0: 

(+) I?) (~> (-> 

a = j3. A, N], (7a) 

( + ) (7) (7) (-> 

/V = §(/„, /3, A, a], (7b) 

For rt = 0; 

(+» (•') (-) <-) 

P(rt > 0) = f>[/p, J3, A, N], (8a) 

Ar = /L[/„. H, A], (8b) 

where equation (8a) predicts the probability of positive material 
wealth transfers. 

The major predictions of the model can now be summarized.’^ 
When asset transfers are positive, greater parental income is pre¬ 
dicted to increase both a.sset transfers and family size. Since the 
demand for material wealth transfers represents an excess demand 
for child consumption, the ratio of asset transfers to family size will 
increase with parental income, and further, the income elasticity of 
material wealth transfers is predicted to be decreasing (Becker and 
T omes 1976). The inverse relationship between quality and quantity 
dimensions of choice is reflected in the negative signs on family size 
and asset transfers in equations (7a)—(7b). 

Material wealth transfers are predicted to be inversely related to 
children’s income frtim human capital, reflecting the substitution 
between the two modes of transfer. When parental efficiency is held 


t his IdUer coiidilion is siif hcicnl to sign some ot the comparative static results. This 
condition will hold it the rale of timc/gciieraiion prelerente—refleiting both "impa¬ 
tient e” and the fact that consumption on the part of one's heirs is not the same as own 
consumption—exceeds the rate tif return net of the (endogenous) rate ol population 
growth, t.c., 6 > r - g > 0, where g - N — I. 

The compai ative .statics and predictums ol the model are di.scusscd in detail in an 
earlier version of this paper. See al.so Tonics 1981o, I98I4. 
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constant, variations in human capital reflect differences in child en¬ 
dowments. An increase in child endowments raises the next genera¬ 
tion’s income for given parental expenditures, which increases family 
resources and therefore the consumption of both generations. The 
increase in parental consumption is financed by reduced parental 
expenditures on children. Since expenditures on children’s human 
capital are an increasing function of child endowments, this requires 
a more than offsetting reducuon in material wealth transfers.'^ In this 
way, the reduction in intergenerational asset transfers redistributes 
resources away from children with high endowment incomes and 
reduces the inequality in consumption between members of the next 
generation, as compared with the case when transfers aie inde¬ 
pendent of endowments. 

Equations (7a) and (8a) indicate that the ef feet of parental efficiency 
on both the magnitude and probability of receiving positive material 
wealth transfers is ambiguous when the recipient’s human capital is 
held constant. However, when the scale of human capital investment 
is permitted to adjust (i.e., k is not held constant), greater parental 
efficiency is predicted to reduce both the magnitude and probability 
of receiving asset transfers.'® Greater parental efficiency in the pro¬ 
duction of children’s human capital reduces the cost of this mode of 
transfer and increases family resources. This produces both a sub¬ 
stitution effect, increasing human capital investment and reducing 
material wealth transfers, and an income effect, increasing parental 
consumption, financed by reduced asset transfers. 

A second set of predictions concerns the determinants of human 
capital investment in the two regimes—families making positive and 
zero material wealth transfers. Assuming all variables are measured in 


'' 111 the simple case in which y = tj, i e , the human capital proctuctinn liinction (2) is 
CRT.S in X and c. the elasticity ol material wealth transfers with respet i to human capital 
tran.sfers: k (holding p, /,„ and iV constant) is ^iven by A (rf In aid In *) = -(frjl - h^) + 

{1 ~ *r[ I + (1 ~ t))<rj}), where A = fr„<l — > 0. the ft.'s aie expenditure shares in 

lamily lesources S = /„ + ’’’rlc - be = trri,/S.h„ = jr,a/5.Ar = n,r/S, and A* = p.\x/{ \ — 
'f)S. Since 0 < 1 — r? < 1 aiul ihe elasticity ol substitution cr < I (by ihe second-order 
loiidilions), a sufficient condition for the almvc expression to be ncjrativc is that the 
share of children’s consumption (in present value terms) in family resources, h^. not 
exceed 'A. 

'* In the ca.se considered in the previous fiKilnote, the elasticity of material wealth 
transfets with resjiect to parental efficiency P. when human capital transfers are 
permitted to vary (holding e, and N constant) is given by: ,\(d In aid In p) = 
— Akiy 1 — A,,[ 1 -(- (1 — T))tr)}. This expression is negative if A, ‘/s. Kurther. it can be 
shown that in the reduced-form txjuation in which N is permitted to vary (obtained by 
substituting eq. [9a] into L7a]-17b) and solving tor <?), greater parental efheienev re¬ 
duces a.sset transfers, provided that the returns to parental expenditures on human 
capital do not exceed !4 (Tomes 1981a). Heckinairs estimates ol the returns to scale in 
the prcxluction of human capital do not differ significantly from '■i (Heckman I976o. 
table 3A). 
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logs and given the assumptions concerning technology and the udlity 
function, these predictions can be summarized as: 

For o > 0: 

(-0) <+) (+) l-O) 

A = ro + yJp + yif + ya/? + y*N. (9a) 

For a = 0: 

(+1 (?) (+) (-) 

k = p„ + p,Ip + pse + pa/i + P4N, (9b) 

Pi > y, = 0, yi > Pi, ya > pa, p4 < y4 = 0. (9c) 

Since the financial liability constraint is not binding on families 
making positive asset transfers, the scale of human capital investment 
is independent of opportunity variables—implying the zero restric¬ 
tions in (9a). For households at a corner solution making zero material 
wealth transfers on whom the financial constraint is binding, marginal 
transfers—in human form—depend both on parental income and 
family size. 

An additional testable implication is that human capital investment 
is more responsive to parental efficiency among households making 
positive asset transfers. This follows from the prediction that, for 
hou.seholds making positive asset transfers, the substitution effect on 
human capital transfers resulting from greater parental efficiency 
exceeds the sum of income and substitution effects experienced by 
families making no material wealth transfers (see Edwards and 
Grossman 1977; Tomes 1978). The predictions summarized in 
equations (7)-(9) are subject to empirical test in Section IV. 

III. Empirical Specification 

When we move from the theory to the empirical specification of the 
model, two points require discussion. First, the theoretical model 
assumed both perfect foresight and certainty. Under these conditions, 
individuals face a single allocation problem at the beginning of their 
parent status and have neither the desire nor the need to revise their 
decisions in the course of their lifetimes. However, in the real world 
both certainty and perfect foresight are lacking. Therefore, based on 
their future expectations, parents determine their fertility, the level of 
human capital investment, and material wealth transfers, with the 
possibility of revision if realized outcomes diverge from their expec¬ 
tations. Given that in the typical family life cycle completed fertility is 
determined prior to the completion of investment in the human 
capital of children and that material wealth transfers may occur at any 
lime until the decease of the parents, these decisions are subject to 
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revision to differing degrees. In addition, human capital investment 
may be completed before the child’s income-earning ability becomes 
known to either generation, since that ability may be subject to 
stochastic influences such as the occurrence of ill health and disability. 
Given that the parents’ fertility is predetermined and given the pre- 
commitment of human capital investment, no adjustment of these 
choice variables may be possible or desirable. However, the level of 
material wealth transfers may be adjusted in the light of such events. 
In the empirical specification of the model I view the dependent 
variables as Jointly determined. However, uncertainty and the tem¬ 
poral ordering of events suggest the introduction of variables which 
measure the realization of events over the life cycle. 

Second, the theory derives predictions concerning the total trans¬ 
fers of material wealth to the next generation which may take the 
form of gifts during the parental lifetime or bequests at the decease 
of parents. Data on gifts are notoriously difficult to obtain, and the 
data set I use does not contain this information. However, the omis¬ 
sion of information on gifts presents less of a problem for the present 
study which employs data on small wealth leavers. In these data there 
is little tax incentive in favor of gifts over bequests.” In addition, the 
prospect of unanticipated changes in the lifetime incomes of children 
and parents creates an incentive to defer material wealth transfers 
until the parents’ decease. The inheritance received by the child is, 
therefore, used as a measure of material wealth transfers. 


The Data 

The data used in this study derive from a 5 percent random sample of 
estates probated in the Cleveland, Ohio, area in 1964-65, yielding 
659 estates (Sussman, Cates, and Smith 1970). Surviving kin and 
other heirs were interviewed, including 657 sons and daughters of the 
decedents. Information was obtained on the total estate, usual occu¬ 
pation, education, and other characteristics of the deceased and on 
the inheritance, income, education, and other characteristics of the 
surviving kin. The principal variables are briefly described here and 
defined, together with other variables, in table 1. 

” The mean gross estate of decedents was $12,000, and less than 5 percent of 
decedents’ estates in this sample exceeded $60,000 alter deductions and were liable for 
Federal F.state Tax (Sussman et al. 1970, p. 188). Ohio had (m 1964-65) an inheritance 
Ux which commenced at 1 percent on the inheritance of children in excess of 
$7,000—almost double the mean inheritance in the data 1 use. Information gleaned 
from Barlow, Brazer, and Morgan (1966) suggests that the omission of data on gifts 
may not be a serious problem in terms of omitted variable bias and that the income 
elasticity of gifts exceeds that of bequests (for details, see the earlier version of this 
paper). Menchik's study of large Connecticut estates supports this latter conclusion 
(Menchik 1979, table 4). 
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The inheritance received by the child is used as a measure of 
material wealth transfers. Theory suggests that inheritances will be 
zero for families where financial constraints are binding. In accor¬ 
dance with this prediction, the observed distribution of inheritances is 
truncated—41..5 percent of children (251 cases) received the lower- 
limit value of $250'* from the decedent’s estate. The natural spec¬ 
ification of the inheritance equation for the complete sample of 
children is a Tobit scheme: 

a = ^X-u if pX-u>.250, 

a = .250 if f3X ~u^ .250, 


TABLF. 1 

Dehniiion ok Variabi.ks 



.SAMin.K OK Al.t. t'llli riRKN 


T'hc'orctKal 

Variable 

Empiiical Measuie 

Mnemonic 

Material wealth 

Inhfnramc re<^ived by .von/daughier 

INHR 

transfers 

($00()s)« 


Human capital 

Annual family income of son/daughter 

IN COMERS 

investment 

(OOOs)t 



Years of schooling of son/daughter 

SC HR 

Family size 

Total nunilier of kin of decedent 

ENKIN 

Paiental income 

Constructed measure of decedent’s “perma¬ 
nent" income (00()s)§ 

INCOMED" 

Parental 

Years of schixiling of decedent 

SCHD 

efficiency 

Additional 

Race of dcredeiu (white = 1, nonwhile * ()) 

RACE 

paixTit 

■Sex of decedent (female = 1, male = 0) 

SEXD 

charat icrisiics 

Age c»f decedent 

ACiED 


Age ot decedent squared 

AGED.SQ 


interaction txrtwcen age and schooling of 
decedent 

AGED SCHD 


Marital statusol dccedcnt at death; maiiied 
(omuied category: widowed, divt)rted, ov 
single) 

MARRIEDD 


Religion of decedent: Jewish, Piotestaiit, 

rJEWD 


Caathoiic (omitted category: no religion. 

PROTD 


Eastern Orthodox) 

L GATHD 


Origin of decedent: birthplace in U,S., 

r ORfGINUS 


Western Europe (omitted category: birth¬ 
place Eastern Europe or other) 

lORlGINWE 


Number of kin of decedent, other than chil-. 
dren 

OTHERKIN 


"* Since the original data on inheritances arc coded in $1,000 intervals, we were 
unable to distingui.th between small mheiitances (< $500) and zero inheritances (see 
table 1, §). In ihe discussion Itelow we use the term "zero inheritance" to refer to 
children receiving an inheritance equal to the lower-limit value ccxled as $250 and 
“positive inheritance" to refer to an inheritance in excess of $250. 
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TABLE 1 (Continued) 



Sami-le of All Oiii.ohen 


Theoretical 

Variable 

Empirical Measure 

Mnemonic 

Additional 

Sex of recipient (female = 1, male — 0) 

SEXR 

recipient 

Age of recipient 

ACER 

characteristics 

Age of recipient squared 

AGER.SQ 


Marita] status of recipient* single, divorced, 
o! separated (omitted category, married) 
Number of household members employed 
Dummy variable for the presence of non- 
labiir income (1: positive nonlabor in¬ 
come; 0: zero nonlabor income) 

Sex of breadwinner in recipient’s household 
Freejuency of contact between the recipient 
and decedent" 

f SINGLER 

1 DIV/SEPR 
VEMPLR 
NLABINC: 

SEX 

VISITS 


* In the oripnal data the child's inhentamc is (oded in $000 intervals An inheriuime of Ir&s than $500 is 
iberefore rtxied as $250 (i e , 250), following Brittain (1970, p. 42). Kviimatrs of the model using a Jimii value of zero 
were virtually identical with those repotted. 

t The repot ted monthly family imome, whic h was onginally coded m categories, was recoded on an annual basis, 
using interval initlpoints for the closed intervals and using the (estimated) mean of a Pareto distnbuuon for the 
open-ended interval 

t * designates a variable entered in natural l<tgariihm 

ft Mie intorne of recipients INCOMtR (for those leporiing a p*«iiive family income) was regressed on a set of 
■permanent" characteiistus of reafn/nLf fhe estmiaied coefficicnis were (f?* » 216. .V * 60ft) 

INCOMER' -S9I5 - S25SCHR t OIACiER SCHR I04A<;tR 

r 0(X)2A(>KRS<^266 (K:c:R 502SEMPI R - IW5.SKX 
+ .627RACE + 4940RIGINWE+ 6580RICINLS + 320OR1CINEE 
■r (kiKPROI -t- SSfK'.ATH + I 27HJFW. 

where OK Id NEK * 1 it ret ipteiil i ongiii is Eastern E.urope and U otherwise. (X.C.R is a seven-category occupatiix) 
of lireadwinncr cfwle (coded from I unskilled, to 7 executive) fhesecorfhcienM, together with ihc lortr^pimding 
<haracterisitcs of the deci'dciit. w-ere used to construct the income variable IN(X>MFI) (NFMPl. w*s set equal to 
unity) 

“ I'he original data on the lres|ueiicy of contact were c«>ded in categories daily, weekly, monthly. Ie»» often, yanrs 
These categories were reoKfed in terms of the number of sisius per year ( • 100) The category "less often" was cixied 
as 03, 1 e . ihiee visits prr year t he <aieg<>ry "vane-s" and nttssing values were < ixled at the mean of the nonnussing 
observations 


where X is the vector of right-hand-side variables and u is assumed 
normally distributed, independent of'X, with zero expectation and 
variance cr^. In addition, we also present estimates of the determi¬ 
nants of the probability that an heir will inherit an amount in excess of 
the lower-limit value. 

The level of .schooling is the most frequently used measure of 
human capital investment. However, this variable corresponds to an 
input measure of human capital investment rather than the output 
measure suggested by the theory, which is the income of the child 
attributable to parental transfers. Also, schooling represents a deci¬ 
sion variable determined early in the life cycle, whereas income repre¬ 
sents the return to skills and other sources of income realized later in 
the life cycle. Life-cycle considerations suggest that family size may 
depend on children’.s schooling, whereas the inheritance received by 
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the child may be more closely related to the child’s realized income 
than the child’s schooling. The empirical model incorporates these 
considerations and includes an equation relating the recipient’s in¬ 
come to schooling and other characteristics. 

The reported total number of kin of the decedent was used as a 
measure of family size. Since this measure includes kin in addition to 
children, the number of these more distant kin was included as a 
control variable. 

No direct measure of the annual or lifetime income of the decedent 
is available. However, information on the “permanent characteristics” 
of the decedent is available. Given supplementary data on income and 
the same set of characteristics from another source, one can construct 
a predicted income variable for decedents. The subsample of the 
surviving kin of these estates was used for this purpose. First, the log 
of recipient’s family income was regressed on his permanent charac¬ 
teristics; then the estimated coefficients, together with the corre¬ 
sponding permanent characteristics of the decedent, were used to 
construct a measure of the decedent’s “permanent” income. The 
education of the decedent was used as a measure of parental effi¬ 
ciency. When parental income (and hence market efficiency) is held 
constant, a higher level of parental education is interpreted as imply¬ 
ing a greater level of nonmarket efficiency. Additional characteris¬ 
tics of the decedent (age, sex, race, religion, origin [birthplace], and 
marital status) and of the recipient (age, sex, marital status, and 
number of individuals employed in recipient’s family) were included. 
Some of these variables hold constant life-cycle factors (e.g., age), 
while others may be interpreted as crude measures of some dimen¬ 
sions of the child’s endowment. 

Several writers have proposed that parental bequests may represent 
a payment by parents in return for services, such as care during old 
age, provided by children'* (e.g., Sussman et al. 1970; Parsons 19756). 
If low-wage children possess a comparative advantage in the provision 
of such services, as seems plausible, an inverse relationship between 
inheritance and the recipient’s income could be observed even in the 
absence of altruism. In order to account for this {xissibility and in 
an effort to test this alternative hypothesis, an additional variable— 
the frequency of contact between the recipient and the decedent 
(VISITS)—was included in the inheritance and income equations. If 
some children sacrifice market income in order to care for their aged 

E.g., Sussman et al. (1970) state: “Testators will their estates to designated children 
. . . according to their perception of their needs, emotional ties with them, and services 
exchanged among family members over the years." Children "generally accept the 
notion that the sibling who has rendered the greatest amount of service to an aged 
parent should receive a major portion of the inheritance” (p. 118). 
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parents in return for a larger inheritance, the VISITS variable should 
enter with a negative coefficient in the income equation and a piositive 
coefficient in the inheritance equation. 

The empirical model consists of four equations in which the de¬ 
pendent variables are the inheritance, income, and schooling of the 
child and the total number of kin of the decedent. Since the determi¬ 
nants of human capital investment are expected to differ according to 
whether financial constraints are binding, estimates of the complete 
model are presented separately for the two subsamples according to 
whether the recipient inherited in excess of the limit value or the 
minimum amount. 

Finally, in accordance with the previous discussion, life-cycle factors 
suggest certain identifying restrictions. Thus the marital status of the 
decedent at death—the presence or absence of a surviving spouse—is 
assumed to influence only the inheritance and to have no inde¬ 
pendent effect on the remaining endogenous variables. Also, the 
marital status of the recipient (divorced or separated, married or 
single) enters the schooling, income, and inheritance equations but is 
excluded from the family size equation. 


IV. Empirical Results 

Table 2 reports estimates for the sample as a whole. Row 1 presents 
the Tobit estimate of the inheritance equation, while row 2 presents 
the corresponding two-stage least-squares (2SLS) estimate. The im¬ 
portant qualitative results are the same, but since the latter does not 
take account of the truncation of the dependent variable, I shall 
concentrate on the Tobit estimate. 

The important finding is that the amount inherited is inversely 
related to the recipient’s income, with an elasticity of —0.92 when 
evaluated at the mean inheritance.*® This result contrasts with the 
findings of previous studies using these data.*' Before accepting this 
result as confirmation of the theoretical model, several alternative 
interpretations should be considered. First, since the right-hand-side 

The regression in row I omits parental education. When this variable is included, n 
IS insignihcanl (I = 0.628), while the recipient’s income is on the margin of significance 
at the 5 percent level (( = 1.932). 

Brittain (1978), in an analysis of these data designed to detect differences in 
inheritance by the sex of the recipient, reported that '“need' for bequests indicated by 
the prebequest economic status of the son and daughter was tested, but found to have 
no influence. This is . . . consi.stcnt with equal division among siblings" (p. 45, n. 47), 
However, Brittain's regression (pp. 42-45) included only the number of children and 
the gross estate of the decedent as regressors in addition to the recipient’s .sex. Adams 
(1980) also analyzed these same data to estimate the income elasticity of bequests, but 
since the income of children was used as a measure of the decedent’s income, he did not 
separate the effects of child's income and parent's income on material wealth transfers. 
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variable is the recipient’s income rather than wage rate, it may reflect 
the labor supply of family members. Since the receipt of an inheri¬ 
tance is predicted to increase the demand for leisure, the work- 
disincentive effects of inheritance also imply an inverse relationship 
between inheritance and family earnings^ (Heckman 1976a). In 
order to evaluate this possibility, the inheritance and income 
equations were estimated simultaneously in rows 2 and 3, with the 
recipient’s income permitted to depend on the amount inherited.^® 
The inheritance enters the income equation with a significant piositive 
coefficient, suggesting that the income variable (INCOMER) may 
represent a better measure of income gross of the inheritance rather 
than earnings (see n. 22). The income equation reported in row 3 
constrains the family background variables (family size, decedent’s 
income and schooling) to enter recursively via the recipient’s school¬ 
ing. This assumption is relaxed in row 4 to allow for the direct 
influence of these variables on recipient’s income via lil'etime gifts and 
other mechanisms. The conclusions are unchanged—inheritance 
continues to enter with a positive coefficient which is on the margin of 
significance at the 10 percent level. The available evidence does not 
support the work-disincentive hypothesis 
Second, as mentioned previously, an inverse relationship Ijetween 
children’s income and inheritance may also arise if some children 
sacrifice market earnings in order to provide care for their parents, in 
return for a larger ber|ue.st. In order to test this possibility, the VIS¬ 
ITS variable was introduced into both income and inheritance 
equations. I his variable enters the income equation with a significant 
negative coefficient, suggesting that the opportunity cost of child- 
parent interaction is positive. Surprisingly, the VISITS variable enters 
with a significant negative coefficient in the inheritance equation.®'* 

In a sinj^lc-peruxi labor/lcisure inode! die percent change in earnings (E) in 
response to n 1 percent change in nonlaljor income (E) is given bv (rf In E/d In F ) = 
l/Cv/(l - AV)K.Sz7}z - 1) = l-/C,/(l - A',.)]5 ,t,, <0irij, > 0. where ic,. = 17/, .V, = wl/E. 
S/ = PjZIF, whereE is lull income. / money income, / leisure, and Z consumption, and 
r?! IS the income cla.sticity of demand lor good j I he percent change in money income 
resulting from a I [icrcent change in V is(</ Inlld liiF) = = A', (I - 5 , 77 ,) > 0 if rjj 

> 0 . 

In the income equation leportcd m row 3. the inheritance vanable was predicted 
by using a I'obit regression to take accouni of die inincalion problem. The results do 
not differ substantially when this variable is entered m the usual manner, ignoring the 
truncation at the lower limit. The income equation also contains a dummy variable for 
the presence/abscnce of nonlabor income (NLABINC.) and the number of houselitild 
members employed (NF.MPLR), in order to hold constant differences in labor force 
participation Iretwcen households. The estimated coefht'ient on schooling in the income 
equation is “large” in comparison with estimates of the rate of return to Schooling. 
However, the dependent variable is (the log of) family income rather than individual 
earnings. If both the education levels and earnings of siKiuses are positively i orrclated, 
this could explain the large marginal effect of schooling. 

*■* This conclusion differs from that reached by Sussman et al. (1970) based on the 
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This result presents prima facie evidence against the pure “child- 
services” model of inheritance (i.e., with no altruism), since the dece¬ 
dent as a monopsony demander of child services would never be¬ 
queath an inheritance such that the recipient located on the 
backward-bending region of the child-services supply schedule.®* We 
therefore conclude that the negative relationship between the amount 
inherited and the recipient’s income confirms the existence of an 
operative altruistic bequest motive and is consistent with the predic¬ 
tion that inherited material wealth serves as a means of redistributing 
resources among family members. 

Other findings may be briefly summarized. The decedent parent’s 
income enters the Tobit inheritance equation (row 1) with a 
significant positive coefficient, implying an income elasticity of 0.84 
when evaluated at the mean inheritance. Last, the total number of kin 
(TNKIN)—the empirical measure of family size—enters the inheri¬ 
tance regression with a significant negative coefficient, consistent with 
the existence of substitution between family size (quantity) and the 
per capita bequest (quality). 

Row 5 reports a logit regression in which the dependent variable is 
the probability of receiving an inheritance in excess of the lower-limit 
value.®** This equation represents a test of the ability of the model to 


study of various cases: "The data indicate general agreement that the child who 
performs the greatest amount of service should receive the greatest part of the esute" 
(p. 119). The results reported here do not invalidate this explanation of estate distribu¬ 
tion in specific ca.scs. Post hoc explanations of this finding are (i) that contact with aged 
parents is a time-intensive and inferior commixlity or (ii) that children reporting the 
greatest frequency of contact are compensated during the parental lifetime (c.g., by 
living rent free in the parental home). A more complete model would include VISITS 
as an endogenous variable. Along these line.s a simple OIS “supply of visits" equation 
was estimated (t-statistics in parentheses; t = In); 

VISITS = 2.593 - .097INCOMERt - .004INHR .356NIJVBINC 
(2.831) (1.021) (2.743) 

+ .790SINGLF.R + .560SEXR + .314NEMPLR 

(3.950) (4.786) (3.528) 

- OOSINCOMEDt - .025.SCHD - .138TNK1N 

(.078) (.481) (4.059) 

- .561RACE - .282MARRIEDD .700JEWD, 

(1.670) (2.169) (2.456) 

«‘ = .I62. A( = 605. 

Tile recipient's income enters with a significant negative coefficient, while the inheri¬ 
tance coefficient is negative but not significant. These results are subject to possible 
simultaneous-equations bias. 

“ However, this finding u consistent with a child-service mode) which incorporates 
parental altruism. 

•‘The logit procedure assumes the distribution of u is Sech’ rather than normal. 
However, the cumulative distribution of the logistic function closely approximates the 
normal except in the tails (Ncriove and Press 1973). As expected, a probit regression 
produced similar results. 
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assign individuals to the alternative regimes*’ (see eq. [8a]). As pre¬ 
dicted by the theory, the recipient’s income and the number of kin 
enter with significant negative coefficients.** In contrast to the Tobit 
results (row 1), neither the decedent’s income nor the VISITS vari¬ 
able is significant in the logit regression, while the decedent’s school¬ 
ing enters with a significant positive coefficient. 

The final row of table 2 reports a reduced-form Tobit inheritance 
equation. This regression allows us to determine the effects of pa¬ 
rental income and education on inheritance when family size and the 
recipient’s income are permitted to vary. In this equation, the income 
elasticity of inheritance is 1.71, evaluated at the mean of the de¬ 
pendent variable—a value significantly different from unity at the 6 
percent level. This estimate is considerably larger than the estimate in 
row 1, a result consistent with the prediction of Becker and Tomes 
(1976) that, because of the effects of quality-quantity interaction, the 
“observed” income elasticity of child quality will exceed the underly¬ 
ing “true” elasticity. The decedent’s schooling enters the reduced- 
form inheritance regression with a significant negative coefficient. 
This confirms the prediction that more educated parents invest more 
in their children’s human capital (see table 3) and as a consequence 
make smaller bequests of material wealth.*® The parameter estimate 
implies that each additional year of decedent’s schooling reduces the 
child’s inheritance by almost $2,000—a sizable effect in relation to the 
average inheritance of $4,257. 


Estimates of the Complete Model 

Table 3 presents estimates of the four-equation model. In these re¬ 
gressions the sample is partitioned into two categories according to 
whether children inherit in excess of the limit value or not. Since the 
self-selection of households into either category results in the possibil¬ 
ity of sample-selection bias, the correction procedure outlined by 
Heckman (1976fr, 1979) was implemented. This procedure involves 
initially computing Mills ratios from the probit equation of the selec¬ 
tion rule and including these “omitted” variables—designated “ko” 


” As an intuitive test of ability of the logit equation to assign individuals to regimes, 
recipients were assigned to 0,1 categories according to the predicted P S .5. This 
resulted in the “correct" assignment of 73.12 percent of recipients as compared with a 
50 percent "correct" assignment based on random assignments using the populauon 
proportions. 

“This first result confirms the finding of Sussman et al. (1970) that “those most likely 
to inherit were persons with the smallest monthly incomes" (p. 155). However, those 
authors also state that "the percentage of survivors who inherited did not vary greativ 
among [survivor] income categories" (ibid.). 

“Additional empirical evidence in support of this prediction is reported in Tomes 
(1981ft), where it is found that the total estate of the decedent is inversely related to the 
decedent's education, holding the decedent’s income and other characteristics constant. 
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and “Xft” in table 3—in subsequent regressions using the selected 
sample.*® Table 3 reports the results obtained using SSLS, which 
allows the error terms of the equations to be correlated. 

Rows 1-8 report results for children who received a bequest in 
excess of the limit value. The estimates of the inheritance and income 
equations duplicate quite closely those obtained for the whole sample 
(table 2, rows 1-3) and require little additional discussion. Recip¬ 
ient’s income and parents’ income enter the inheritance equation with 
the predicted negative and positive .signs, with elasticities (at the 
mean) of — 0.74 and -I- 0.82, respectively.*' In contrast to the results in 
table 2, inheritance is not significant in the income equation (row 2), 
and the VISITS variable is not significant in line 1 or 2.** 

Although the semilog specification of the inheritance equation may 
be preferred on the a priori grounds that it embodies the prediction 
of a decrea.sing income elasticity, the constant elasticity formulation 
was also estimated (row 5). The estimates show that the major qualita¬ 
tive results are not dependent on the use of the semilog functional 
form. 

The results reported in table 3 represent a direct test of the two- 
regime model. The major question to be addressed is whether the 


” U' (10) represents the inheritance equation, the expected values of variables Zj 
(or the censored samples can be shown (see Willis and Rosen 1979) to be 

E{Z,\a > .250) = yjX + £(«,! >ji) = y^X 4- 

(Ti, 

£(Zj|a = .250) = y,X + £(ej|/3X <s u) = y^X + 

where 

A, JL < = -/(;8X/cr.)/£(/3X/(T.) 0, 

\ CTjj ' II ^ u ' 

- £(— -!L a = f(fiX/cr.)/l 1 - £(j0X/o-.)) * 0. 

where £ is the cumulative normal density and/ is its probability density function. The 
estimation procedure employs the estimated values A„ and from a probit regression 
including all the exogenous vanables except VISITS—because of the dubious 
exogeneity of this variable (see n. 24). The present e of the selection rule introduces 
heteroscedastinty into the equation residuals, resulting in upward-biased <-statistics. 
However, in the present context, there is a presumption that the bias is small. Applying 
the appropriate correction procedure to the l-statistics from the 2SLS estimates resulted 
in only marginal changes in the t-statistics. 

The significant positive coefficient on in row 9 indicates that individuals who 
inherit zero receive higher incomes than individuals with the same income-earning 
characteristics in the whole sample. This also supports the compensation hypothesis. 

“The income equation (row 2) constrains the background variables (family size, 
decedent's income and schooling) to enter recursively via the recipient’s schooling. 
When these variables are entered directly (results not reported here), the major conclu¬ 
sions are unaffected. 
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determinants of human capital investment differ among households 
according to whether material wealth transfers are positive or zero. A 
comparison of the schooling regressions for the two subsamples (rows 
3 and 10) indicates differences which are in the predicted direction. 
For children receiving an inheritance in excess of the limit value, the 
level of schooling is unrelated to either parental income or family size 
but is positively related to parental education. In contrast, for chil¬ 
dren who received a zero inheritance, the level of schooling depends 
on parental income (with an income elasticity of 0.07) but is unrelated 
to parental education. Thus opportunities (parental income) are im¬ 
portant for households making zero material wealth transfers, while 
parental education (one determinant of ability) is more important for 
households making positive material wealth transfers. However, con¬ 
trary to the theoretical prediction, the family size variable—a deter¬ 
minant of oppjortunities—is not significant in the school equation for 
children who inherit zero (row 10) upon whom financial constraints 
are expected to be binding. 

The family size regressions also differ between the two subsamples 
(rows 4 and 11), although concerning these differences we have no 
predictions. For families making asset transfers in excess of the limit 
value (row 4), parental income has a positive and significant co¬ 
efficient—the only evidence in these data of a positive income elas¬ 
ticity of family size. Parental education enters with a negative coeffi¬ 
cient which is significant at the 10 percent level. In contrast, for the 
sample receiving a zero inheritance (row 11), the parental income 
variable is negative and not significant, while parental education en¬ 
ters with a significant positive coefficient. These results underline the 
complexity and ambiguity of the relationships among family size and 
parental income and education. The one result common to both 
subsamples is that measures of child quality (the child’s inheritance 
and/or schooling) enter with significant negative coefficients in the 
family size equations, consistent with the existence of a quality- 
quantity trade-off. 


AdditioTial Results 

A major finding of the foregoing analysis is that, when parental 
income and education are held constant, children with higher in¬ 
comes receive a smaller inheritance of material wealth. This is consis¬ 
tent with the theoretical prediction that parents respond to dif- 

** The significant positive coefficients on X, in rows 4 and 8 imply that those who 
receive a positive inheritance are more likely to come from a smaller family than 
random individuals with the same characteristics. This also implies a quality/quantity 
trade-off. 
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ferences in the ability of’children by reducing transfers to childr 
order to finance greater parental consumption. In this way, pa 
share in the returns to greater child endowments. However, an • 
native interpretation is that the observed inverse relationship bet 
inheritance and children’s income reflects (pardy or entirely) the 
of inheritance in compensating for differences among siblings r; 
than between parents and children. Thus, if parents bequet 
larger share of the estate to the child with the lowest income i 
smaller share to his advantaged sibling, this also implies an in 
relationship between inheritance and the recipient’s income. 

This issue can be addressed using data on 137 families for vv 
information is available on more than one child. In this subsar 
41.6 percent of heirs (.siblings) received exactly equal shares, 
ever, this figure includes cases in which all heirs received the 
value of zero and therefore, by definition, inherited equally. Ex 
ing such ca.ses, 21.1 percent of heirs inherited exactly ecjual sh 
and the vast majority of heirs (78.9 percent) received unequal s 
in the parental estate. However, some of these between-siblinj. 
ferences are small: 50.4 percent of heirs received approximately ■ 
shares—within $500 of the average inheritance received by chili 
Nevertheless, the incidence of equal division is dramatically 1 
than that found by Menchik (1980) in his sample of large Connei 
estates. Menchik found that “equal sharing among children i 
rule" (p. 299). For example, among two-child families, in 62.5 pe 
of cases both children received exactly equal bequests and in 
percent of cases approximately equal bequests (Menchik 198 
310). The corresponding figures for two-child families in the pr 
sample are 22.2 percent and 44.4 percent.^'' The fact that st 
equal division is not the rule in this sample suggests that the possi 
that inheritance is equalizing between siblings warrants investiga 

Fable 4 reports the results of Tobit regressions for various sub 
pies of children from families with information on multiple heirs 
availability of multiple observations per family also permits the i 
sion of an unobserved family component in the income predi 
equation. 'Fhe constructed measure of the decedent’s income 


** t wo possible explanations ol this difference are: (i) In larj^e estates donor 
make sizable transfers in the form of inter vivos gifts in order to avoid estal 
inheritance taxes. Thus, unequal lifetime gifts by wealthy donors may serve the 
functions as unequal bequests by less wealthy donors, (ii) Apparent equal divis 
probate records is consistent with de facto unequal division when estates are rec 
uted among heirs. .Sussman el al. (1970) report that, in 27 percent of intestate 
(i.e., the decedent left no will) for which the law prescribed equal division : 
siblings, ex post redistribution among siblings occurred resulting in unequal shar< 
143-44). Thus probate records may underestimate the incidence of unequal div 
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ployed in table 4 includes this family component.®* In these regres¬ 
sions, both the mean income of children and the deviadon of the 
child’s income from the family mean are introduced as determinants 
of inheritance. The latter variable is predicted to enter with a negative 
coefficient if disadvantaged children receive larger shares in the pa¬ 
rental estate. This variable enters with the predicted negative sign but 
never attains statistical significance. In contrast, the mean income of 
children enters consistently with a highly significant negative sign. 
Estimates of the elasticity of inheritance with respect to the mean 
income of children cluster closely around — 1.8, evaluated at the mean 
of the dependent variable. Thus, when we hold income inequality 
among children constant, the higher the average income of children, 
the lower the inheritance received by children. These results suggest 
that the inverse relationship between recipient’s income and inheri¬ 
tance reflects for the most part a reallocation of resources between the 
generations of the family rather than simply the redistribution among 
siblings of a given total bequest to children. 

These results for the subsample for which information is available 
on multiple heirs and also those for the subsample with information 
on both the decedent and surviving spouse®* (not reported here) 
confirm the results reported earlier for the whole sample. 


V. Summary and Conclusions 

I have presented an empirical model of the iniergenerational trans¬ 
mission of inequality within the altruistic family. This paper adds 


“ Adams (1980) employed a conceptually similar, but le.ss appealing, technique using 
the mean income of recipients in a family to measure the decedent’s income. The 
procedure employed here was to estimate an income prediction equation similar to that 
in table 1, §, for the subsample for whom information is available for two or more 
survivors—either children or the surviving spouse {N = 431). The mean residual was 
then computed for each family and entered in a subsequent income equation. The 
estimated coefficients from this second regression, together with the dec^ent’s char- 
artenstics and the family mean residual, were used to construct the measure of the 
decedent’s income. The weight attached to the family residual depends on the extent to 
which the unobserved family component is correlated between spouses and transmitted 
from parents to children. In the absence of information on this parameter, weight.s 
were used in the umt interval (0.0, 0.1,..., 1.0). The reported results employ the weight 
which minimized the standard error of the regression (see note to tabic 4). In practice, 
the measure of the decedent’s income used in tables 2-3 pierformed almost as well a.s 
the measure used in table 4. (See the results reported in the earlier version of this 
paper.) The estimated income elasticity of inheritance did not differ in a systematic 
fashion using the two alternative income measures. 

“For this subsample (,N — 191), the effects of mother’s and father’s education, ages, 
and incomes were analyzed; also, a measure of the prediaed bequest from the surviv¬ 
ing spouse to children was constructed and used in the subsequent recessions. The 
results, which confirm the basic conclusions of this study, were reported in the earlier 
version of this paper. 
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empirical content to the theoretical models of Becker, Blinder, and 
Ishikawa in which bequests of material wealth and investment in 
children’s human capital are simultaneously determined and repre¬ 
sent substitute modes of parent-child transfers. 

Empirical tests of the model were performed using a unique data 
set containing information on the inheritance, income, and schooling 
of the surviving kin of a random sample of estates probated in the 
Cleveland, Ohio, area in 1964-65. 

The empirical results strongly confirm the equalizing role of in¬ 
heritance and the existence of substitution between human capital 
investment and inherited material wealth. The inheritance received 
by the child was found to be inversely related to the child’s income. 
Material wealth transfers are therefore “compensatory”—in that 
(other things being equal) children with low incomes receive greater 
bequests of material wealth than their better-endowed contem¬ 
poraries. Our estimates imply that, for individuals in the sample who 
received a positive inheritance (table 3, row 1), the comp>ensatory 
component of bequests reduced the variance of inherited wealth by 
30 percent and reduced the correlation between inheritance and 
income from 0.53 to 0.12. This result confirms the prediction of 
the altruism model of intergenerational transfers that an increase 
in the income of recipients leads the donor to reduce transfers in 
order to redistribute family resources in favor of his own consump¬ 
tion. The data do not support the alternative explanations of this in¬ 
verse relationship—that the receipt of an inheritance has a work- 
disincentive effect, that low-income children inherit more in return 
for the greater child services they provide for their aged parents, or 
that the observed relationship reflects the reallocation among siblings 
of shares in a given parental estate. Differences across families in 
parental education were found to result in variations in inherited 
wealth. Greater parental education is associated with more schooling 
and higher incomes of children and lower bequests of material 
wealth—results consistent with the theoretical predictions of the al¬ 
truism model. 

There is evidence that the financial constraints faced by families 
differ across households. Families who make positive material wealth 
transfers appiear less constrained financially when it comes to invest¬ 
ing in their children's human capital—and ability variables are the 
niajor determinants of children’s schooling. Conversely, families 
making zero bequests to children appear more constrained finan¬ 
cially, so that opportunity variables are the major determinants of 
children’s schooling. For both categories of households, there is evi¬ 
dence of a trade-off between family size (quantity) and per capita 
parent-child expenditures (child quality). 
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The first result is important for a number of reasons. Previous 
empirical work (Brittain 1978; Menchik 1980) has failed to find any 
evidence of compensatory bequests among siblings. In contrast, the 
present study provides .strong evidence that within the family— 
between parents and children—and among families in the same 
income/education stratum, the distribution of inherited wealth is 
equalizing. This supports the recent suggestion of Griliches (1979), 
that the family contributes le.ss to the inlergenerational transmission 
of inequality than is commonly supposed, since it operates as a force 
for equality between successive generations (parents and children), 


Appendix 


M[.A^s OF Va«iabi.f.s fob Various .Sampi.es 


Vanable 

All Sons/Daughters 
Rcfxrrting Positive 
Family income 
(N = 60.5) 

(1) 

S< >ns / Dau g h te rs 
ill col 1 Receiving 
an Inheritance 
in Excc.ss of 
the Limit 
Value 
(N = 354) 

(2) 

Sons/Daughters 
in col. 1 
Receiving 
an Inheritance 
Eiyual to the 
Limit Value 
(iV = 251) 

(3) 

INHR (SOOOs) 

4.257 

7,099 

.250 

INC:OMKR (lOOOs) 

ll.SIfl 

11 922 

10,973 

SCUR 

12.570 

12.517 

12.645 

AGKR 

42.828 

44,384 

40.633 

SF.XR {% lemale) 

49 5 

48,6 

50.2 

■NFMf'LR 

1.190 

1,184 

1.199 

Nt.ABlNt; 

29.8 

31,4 

27,5 

(% reporting the 
retcipl of nonlahor 
income) 

SINGLER (%) 

9.6 

9,0 

10.4 

DIV/SF.PR (9c) 

.3 

.3 

.4 

[NCOMF.D ($()00s) 

6.085 

5.559 

6.906 

SCHD 

9.102 

9.054 

9.171 

AGED 

70.630 

72,0.56 

68 618 

SEXD (% female) 

41.7 

52.5 

26.3 

MARRIEDD (%) 

.58.8 

39,5 

86.1 

TNKIN 

4.134 

3.791 

4.618 

OTH ERKIN 

1.020 

.743 

1,410 

RAGE (% noiiwhite) 

3.3 

2.7 

5.6 

CATHD (%) 

44.1 

44.9 

43.0 

PROTD (%) 

43.6 

41,5 

46.6 

JEWD (%) 

5.1 

6.2 

3.6 

ORIGINUS (%) 

51.6 

45.2 

60.6 

ORIGINWE (%) 

16.2 

18.1 

13.5 

VISITS 

1.443 

1.550 

1.292 

K 

- .696 

-.506 

-.965 


1.068 

1.319 

.713 
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Intraindustry Specialization and 
the Gains from Trade 


Paul R. Krugman 

Massachusetts Institute of Technology 


Several recent empirical studies of trade suggest that tnlerindustry 
specialization and trade, which reflect the conventional forces of 
comparative advantage, are also accompanied by intraindustry 
specialization, which reflects scale economies and consumers' taste 
for a diversity of products. This paper develops a simple model 
which illustrates this argument. Two main results are developed. 
First, the nature of trade depends on how similar countries are in 
their factor endowments. As countries become more similar, the 
trade between them will increasingly become intraindustry in 
character. Second, the effects of opening trade depend on its type. If 
intraindustry trade is sufficiently dominant, the advantages of ex¬ 
tending the market will outweigh the distributional effects, and the 
owners of scarce as well as of abundant factors will be better off. 


Over the years, many empirical students of international trade have 
argued that trade among the industrial countries cannot adequately 
be explained by conventional theories of comparative advantage. One 
might summarize this empirical critique by pointing to three as¬ 
pects of world trade which seem to contradict received theory. First, 
much of world trade is between countries with similar factor endow¬ 
ments. Second, a large part of trade is intraindustry in character—that 
is, it consists of two-way trade in similar products. Finally, much of the 
expansion of trade in the postwar period has taken place without 

This research was supported by a grant from the National Science Foundation. An 
earlier version of this paper, "International Trade and Income Distribution. Recon¬ 
sideration," was presented at the NBER Summer Institute in International Studies, 
Cambridge, Massachusetts, July 1979. 
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si/atjle reallocation of resources or income-distribution effects. 'I'his 
last point is particularly noticeable in the cases of the EEC and the 
North American automobile pact. 

The puipose of this paper is to formalize one possible explanation 
of these seeming paradoxes. I'he explanation is not a new one; It is 
essentially the same as that put forward by Balassa (1967), Crubel 
(1970), and Kravis (1971), among others. What this paper does is put 
the argument in terms of a foi mal model, a step which may be of 
some help in clarifying and disseminating ideas which have been “in 
the air” for .some time. 

Briefly, the argument of the.se empirical workers, a very clear ex¬ 
position of which is given by Kravis (1971), luns as follows. I'he 
conventional forces of comparative advantage operate on grou/is tif 
products (“industries”) and thus give rise to intennilustry specialization 
.ind trade. Ec<jnomie.s at' scale in protluciion, however, lead each 
country to produce only a subset of the products within each group, 
so that there is also tttlmindmlty specialization and irarle. This pro¬ 
vides a simple explanation of two of our empirical ostensible 
jiaradoxes. Countries with similar factor endowments will still trade 
because of scale economies, and their trade will he largely intraindus- 
ti y in (haiacter. Hie third seeming paiadox—the apparent painless¬ 
ness of some Hade liberalization—can also be resolved if we argue 
that incotne-distributioti effects are outweighctl by the gains from ,i 
laiger market when countries are sufficiently similar. 

While this is a sim|)lc and stiaightfbi w.iid explanation, however, it 
IS not so easy to fortnali/e. Scale economies are crucial to the argu¬ 
ment, and thc*y ate notoriously .iwkwaicl to handle In general ec)uilih- 
rium models. In this paper I follow an eat her paper (Krugman 1979) 
and use the device of Chamberlinian monoj^olisiic competition. As iti 
the earliet papet, this ptoves to be a vet y convetiietit approach, yielcl- 
itig a sitnple atid tracutble tncHlel. 'The sttucture of this mcxlel atid the 
detet tninatioti of this tnodcl’s ecpiilibi iutn iti a closed economy are set 
forth it! Sectioti 1. Section 11 shows how the paltetn of trade hcTwcett 
two coutilnes is deteimined in the model, developitig the basic re- 
1,itionship between differences in factoi endowtnents and the extent 
of intraindustry trade. Section III then examines the effects of trade 
on income distrthution and shows how the extent of inlraindustiy 
ttade detertnines whethet scarce factcirs of production gaitt or lose 
fiom ttade. Finally, .Section IV summarizes the results and discusses 
some im[)lications for theory and policy. 

It must be emphasized that the model presented here is in no sense 
a general one. In addition to making strong assumptions afxiul func¬ 
tional forms of cost and utility functions, I impose a great deal of 
symmetry on the model to simplify the analysis and give a natural 
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meaning to the concept of “similarity” in factor propcitiions. Thus the 
results of the analysis are at best suggestive. Nonetheless, they seem 
intuitively plausible and also seem to have something to do with actual 
experience. 

I. The Model in a Closed Economy 

Intraindustry trade depends on the existence of unexhausted econo¬ 
mies of scale in production. The main problem in modeling this kind 
of trade is how to handle these scale eccjiiornies, which must lead to a 
hreakdown of perfect competition (unless they are wholly extettial to 
firms). In this paper, as in an earlier paper (Krugman 1979), 1 will use 
the device of Chamheilinian monopolistic competition, basing the 
model on recent work by Dixit and Slight/ (1977). An “industry” will 
consist of a large number of firms, all producing somewhat differ¬ 
entiated products, all operating on the downward-sloping parts of 
their average cost cui ves. There will be two-way intei national trade 
within an industry hecau.se firms m different countries will produce 
different differentiated products. What prevents countries from pro¬ 
ducing a complete range of {iroducts domestically is the existence of 
fixed costs in production; thus scale economies are the basic cause of 
intiaindustty trade. 

We should note at the outset that the concept of an industry used in 
this paper is a somewh.it special one. One might w'ant to define an 
industry either as a group of products which are close substitutes on 
the supply side oi as a group of products which are close substitutes 
on the demand side. In the model of this pajier, I assume that there 
aie two groups of proditcls which fit both definitions. Products within 
each group are closet substitutes than products in dilfeient groups, 
while fattens of production are assumed mobile among products 
within each group yet immobile between groups. Ibis coinenient 
coincidence of the two |>ossif>le concepts of an indtisiiy m,iy or may 
not be empirically reasonable; it is certainly not theoretically necessary 
and should he regarded as one among main special assumptions. 

Another conceptual dilficiiliv concerns the notion of a “incyduct.” 
Ill the formulation belocv, all products seem to look alike, since they 
enter symmetrically into cost and utility functions. I bis may seem to 
iiudlye an illegitimate coin|3ai iscaii of physical (|iianiities of diffeieni 
goods. I show in the Appendix, hoyvevcr, that the formulation of 
many “identical” products c.ui be interpreted as a restriction on the 
paraineters of a model in yvhich products really do dilfer. 

Let us begin, then, with a tyvo-industry model of a closed economy. 
Each industry consists of a large number of products, .ill of yvhich 
enter symmetrically into cleniand, with the lyvo industries—industry 1 
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and industry 2—themselves playing symmetric roles. Alt individuals 
will have the convenient utility function. 

\i/« \iie 

f/ = ln^^r^.,j , (1) 

0 < 0 < I, 

where c,,, is consumption of the tth product of industry 1; is 
consumption of thejlh product of industry 2; and /V, and iVj are the 
(large) numhers of potential products in each industry. Not all poten¬ 
tial protlucls will necessarily be produced, and we will in fact assume 
that the actual numbers of products produced—n, and —while 
large, fall short of N, and 

The utility function (1) has .several useful properties. First, it en¬ 
sures that half of income will always be spent on industry I’s products. 
Seconcl, if the number of products in each industry is large, it implies 
that every producer faces a demand curve with elasticity 1/(1 — 6). 
Finally, (1) will allow us to repre.sent the gains and lo.sses from trade in 
a particularly simple way. 

On the demand side, then, an industry is assumed to consist of a 
number ol products which are imperfect substitutes for one another. 
On the supply side, however, they will be assumed to be perfect sub¬ 
stitutes. I'here will be only two factors of production, type 1 labor and 
type 2 laboi, each of which is wholly specific to an industry but 
nonspecific among products within an industry. Thus, type 1 labor 
will be used only in industry 1, type 2 only in industry 2. Within each 
industiy, the labor required to produce a particular product will 
consist of a fixed setup cost and a constant variable cost; 

/, . = O' J = 1, . . . , Ml, 

( 2 ) 

k.l = <x + j = 1, . . . , « 2 , 

where is labor u.sed in producing the tth product of industry, 1; x,,, 
is the output of that product; and so on. To go fiom these reejuired 
labor inputs to nominal costs, we must multiply by the wage rates ol 
the two types of labor, w, and w^. 

Fo close the model, we Ix’gin by noting that output of each produti, 
X, is the sum (tl individual consumptions of the product. At the same 
lime, total employment in each industry is the sum of employmem m 
jirodncing all the individual prcKlvuls. Assuming full employment. 
have 
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^/,.,=L, = 2-z 

I-l 


"2 



J=1 


0 < z < 1. 


(3) 


Thus the total labor force is set equal to 2, with the parameter z 
measuring factor proportions. As we will see below, z will assume 
crucial significance in determining the importance of intraindustry 
trade and the effect of trade on income distribution. 

We are now prepared to examine the determination of equilibrium 
in this model. This involves determining how many products are 
actually produced in each industry, the output of each product, the 
prices of products, and the relative wages of the two kinds of labor. 
We should note at the outset that it is indeterminate which products 
are produced—but it is also unimportant. 

Our first step is to determine the pricing policy of firms. We assume 
that producers can always costlessly differentiate their products. This 
means that each product will be produced by only one firm. If there 
are many products the elasticity of demand for each product will, as 
already noted, be 1/(1 — 6). (This is proved in the Appendix.) Thus, 
each fit in will face a demand curve of constant elasticity. We then 
have the fatniliar result that the profit-maximizing price will be mar¬ 
ginal cost plus a constant percentage markup: 


Pt = 0 

p2 = 6~'I3u’2, 

where pi and p.^ are the prices of any products in industrv 1 and 2, 
respectively, which are actually produced. 

Given the pricing policy of firms, actual profits depend on sales: 


rr, =p^, - {a + 

TTi = p/Ci — {a + )3v.i)u’2. 

whereX, and are sales of rcpre.sentative firms in the two industries. 

But in this model there will be free entiy of firms, driving each 
industry to Chamberlin’s "tangency solution" where profits arc zero. 
Thus we can use the condition of zeio profits in eejuilibrium to 
determine the equilibrium size mul number of firms. .Settirijr tr, = tt^ 
- 0 and using (4) and {fi), we have 


a 0 

—. . y - -- 


.V, -- vj -- 
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foi' the si/e of firrtis. 'The number of firms can then be determined 
from the fiifl-employment condition; 

Hi = (2 - z)/(a + I3x,), 

«2 = z/(a + pX2). 

I fie final step in determining equililjiium is to tletertnine relative 
wages, rtiis can be done very simply by noting that tlie industries 
receive equal shares of expenditure and that, since profits are zero in 
equilibrium, these receipts go entirely to the wages of the industry- 
specific labor fot ces. So w,]^, = implying 

w,lw 2 = z/(2 — z). (8) 

We now have a completely worked out etpiilibi ium for a two-sector, 
monopolistically competitive economy. It is indeteiminate which of 
the range of potential products within each industry are actually 
[>ioduced, but since all products appear symmetiically, this is of no 
welfare significance. The charatter of the etotiomy is detertnined by 
the two pat amc-ters z and 0. The value ofz determines l elative wages; 
If z is Ic.iw, type 2 lalvor will receive much higher wages than type 1 
lalroi. I'hc value of 6 mcasuics the degree of substitutability among 
pioducls withiti an industry. The Icrwer is 0, the more diffetentiated 
ate jrrodiKts, and the more impottant are unexploited scale econo¬ 
mies. From (4) we have 0 = fitoilpi - fiw-Jpz. But /3a/, and (iw-i are the 
m;itginal costs erf preaductiern, while in equilibrium price equals aver¬ 
age cost. Thus 0 is the ratio of marginal to aveiage cost (which is also 
the elasticity of cost witli respect to output). 

II. Factor Proportion.s and the Pattern of Trade 

In the last section we saw how et|uilibrium can be detcrininetl in a 
simple closed-econoinv model with scale economies and difler- 
entiated products. We can trow ex.imine what happens when two such 
economies trade. What we are |)rincipally concerned with is the prop¬ 
osition. atlv;tnced in the intiviductioii, that countiies with similar fac¬ 
tor eiiflowments will engage in intraindustry trade, while countries 
walh veiy fliffeient endowments will engage in Heckscher-Ohlm 
It ade. 

As a first step we neetl a working measure of the extent of intrain- 
dusiiy trade. I’he empirical literature on intraindustry trade (e.g., 
Hufbauer and (ihilas 1974; (iiubel and I.loyd 1975) generally con¬ 
centrates on an index of tiade overlap, that is. 
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wliereX^ is a country’s exports in industry A and Af* is imports in tiiat 
industry. This index has the property that, if trade is Ixilanccd indus¬ 
try by industry, it equals one, while if there is complete international 
specialization so that every industry is either an export or import 
industry, it equals zero. As we will see, this index fits in quite well with 
the model oi' this paper. 

riie other concept we need to make operational is that of sirnilanty 
in factor endowments. In general, this is not well defined. What I will 
do in this paper, however, is consider a special case in which the 
concept does have a natural tneaning without trying to arrive at a 
general definition. 

l.et us suppose, then, that there are two countries, the home coun¬ 
try and the foreign country. The home country will he just as de¬ 
scribed in Section I. I'he foreign country will be identical except for 
one thing: The relative sizes of the two indiistiies’ labor forces will be 
reversed. That is, the foreign counirv will be a mirror image of the 
home country. If we use <t star on a vai iable to indie ate that it refers to 
the foreign countty, we have 


Lf = 2 - z L-i = z 
Lt =2 Af = 2 - z. 


( 10 ) 


(Obviously, given this pattern of etidowments, we can regard z .is an 
itide.x of similarity in factor proportions. If z = 1, the countries have 
identical endowments. As z gets smaller, the factor proportions be¬ 
come increasinglv dilTerent. 

The miiror-image assumption can be given a geometric interpreta¬ 
tion. In figure 1, an Edgeworth box is vised to represent the interna¬ 
tional distribution of piochictive resources. The oi igiii O is used to 
measure home countiy endowments, O* to measure foreign endow¬ 
ments. The two di.tgon.ils of the Ixix can then be given economic 
interpretations; ()0* is a line along which f.ictor proportions ai e equal 
in the two countries, while the olhei diagon.tl is .t line .ilong whit h the 
countries are of ecjual economic size. I tie mii tor-image assumption is 
saying that the endowment point E lies on this diagonal. The 
p;uameterz then determines tlie position of A'; asz goes from 0 to 1. A 
moves ftom the coiner to the center of the box. 

Suppose, now, that these cotmtries .ire able to trade at zeio tr.ins- 
poi taiion cost. As belore, we can determine pricing hehavioi, the size 
and number of films, and relative w.iges. In .iddilion, we can deter¬ 
mine the volitme .ind pattern of tr.ide. 

Ehe first point to note is that the elasticitv of demand for any 
particular prcxluct is still 1/(1 - #). lliis gives us price ecju.itions 
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exactly the same as before; 


/>! = 

pi = e-'pWi, 

pt = 0~*fiwf, 


( 11 ) 


= 9 '/3wf. 

Now, however, the symmetry of the setup insures that all wages will 
be equal, both across intlusiries and internationally; 


re, = tef = w-i = wf. (12) 

1 he 7ero-proht crtndiiion will determine the equilibrium si^e of 
firm, X, which will be (he same for both intlustries in both countries: 


X ^ ae/fi(\ ~ 9). ( 13 ) 

Finally, lull employment determines the number of firms in each 
imlustiy in each country: 

«, = = (2 - z)/(a + /Jx), 

Uj = nf = z/(a + y3.x). 

What these results show is that trade will lead to factor price 
equalization while leaving the pattern of production unchanged. Our 
remaining task is to determine the volume and pattern of trade. We 
can do this by noting two points. First, everyone will devote equal 
shares of expenditure to the two industries. Second, everyone will 
spend an equal amount on each of the products within an industry. 
This means that the share of all individuals’ income falling on, say 
industry 1 products produced in the foreign country is V 2 • [nf/(n, + 
zif)]—that is, the industry share in expenditure times that country’; 
share of the industry. But the number of products is proportional u 
the labor force. Thus, if we let Y be the home country’s incom* 
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(equal to the foreign country’s), X, be exports of industry I products, 
X 2 be exports of industry 2 products, M, be imports of industry 1 
products, and Afj be imports of industry 2 products, we have 

X, = V 2 Y ■ 1(2 - z)l2], 

X 2 = ViY ■ (il2), 

( 15 ) 

M.^ViY ■ (z/2), 

M 2 = ‘/zy ■ 1(2 - z)/2]. 

Now, the relations (15) have two important implications. First, 
consider the volume of trade. l otal home country exports are X, + Xj 
= '/zK. Thus the ratio of trade to income is independent of z, the 
index of similarity in factor proportions. This can be regarded as an 
answer to the first ostensible empirical paradox mentioned in the 
introduction—the large volume of trade among similar countries. In 
this model, similar countries will trade just as much as dissimilar 
countries. 

The second seeming empirical paradox was the prevalence, in 
trade among similar countries, of two-way trade in similar products. 
If we substitute (15) into our expression for intraindustiy tiade (9), 
we get a simple, striking result: 

/ = z. (16) 

'/'he index of intraindustry trade equaLt the index of '>imilnrity in factor 
proportions. 

These results may appear to depend criit tally on the assumptions of 
this model, but in qualitative terms they can survive a gotxl deal of 
genet ali/alion. The persistetice of tratic between count ties with simi¬ 
lar factor endowtnents will occur in almost any model with economies 
of scale. The relationship between similarity of countries and the 
extent of intraindusti y trade can be shown to hold, for an appropriate 
definition of similarity, in a much more general model and has also 
been noted in a quite different context by Elhier (1979). Insofar as 
these insights are conceined, the virtue of this model is not m the 
dif ference of its conclusions but in the clarity with which they emerge. 

Where the special assumptions of this model become pai ticulai K 
useful, however, is in attempting to deal with the welfare conse- 
(luences of trade. These consequences are considered in the next 
section. 

III. Gains and Losses from Trade 

In this section we must again begin by delineating a concept which 1 
have been using loo.sely. This is the idea of the ‘‘seriousness" of 
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flistribution problems. VVhal we need is a clear way of formulating the 
notion that distribution problems from opening trade will not be 
serious, if countries are sufficiently similar in factor proportions that 
ihe trade which re.suh.s is primarily intraindustry trade. 

d'he criterion I will use to define nonscrious cli.slribution problems 
is the following; Distribution problems arising from trade will be held 
not to be serious if both factors gam from trade. This, of course, begs 
some questions, since there may be difficulties in getting groups to 
accept a relative decline in income even if they are absolutely better 
off. But this criterion is fairly reasonable and turns out to give- 
suggestive results. 

To find <nii whether factors gain from trade, we need to know how 
\itility depends on the vatiables of the model. .Siip]>ose an individual 
receivt's a wage ti' and has the utility function (1). He will theti spend 
<('/2 on the products of each industry atid divide his expenditure 
equally among the products within an industry. Thus his utility will 
tlepetid on his wage, the prices of tepie.settt.uise products in eac h 
inclustit, and the nuinber of products available; 

LI - In f- In 

“ 2 In 2 t- In tc//;, + In wtp., + -In tt, + —- ■ In 

file function (17) li.is the convenient property that all the effects 
enter atfditively. Utility cfepends on real wages in tet ms of represen¬ 
tative pioducts and on diversity. 

To analv/c the effects of tr;tde on welfare, it is useful to introduce 
some; tnoie tiotation; 

L\, tl., = utility of woikers in industries 1 and 2; 

.-e,,,;/',., - rc-al wage of industiy 1 workers in terms of products of 
industries 1 and 2; 

a'ii,'c,.^ = real wage of industiy 2 worket s in teims of products of 
ittdustiies f attd 2. 

fhen we can substitute into (17) to get (sujipressing the constant 
tet in); 

IJ, --- In le,, + In Wyi + -In + * „ - 

1 0 , ' 0 1 

In a',, * Inie,, + — - —lti«, * --^p-htn,. 

are now in .1 |h isit ion to ineasuie the well.tt e effc( t' ^ 

Sop[x.>-<- we Mall (iimii .1 fxiMiton of .illt.llkv. as 111 Sei ll'ia f 
move to Iter iiaifc, as m Section (i Ihrir will ilieii be nie li "!' 
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effects. First, there will be a distribution effect as fat tor prices are 
equalized. As one can easily verify, labor’s real waj^e remains the same 
in terms of the products of its own industry while rising; or falling in 
terms of the other industry’s products, depending on whether the 
factor is abundant or scarce. Thus, in the home country this effect 
benefits labor in industry 1 and luirts labor in intiustry 2. 

The second effect comes from the increase in the si/e of the market, 
which makes a greater variety of products available. I'his works to 
everyone’s benefit. 

Since both effects work in its favor, the abundant factor must be 
made better off. This leaves us with the problem of determining the 
change in utility of the scarce facioi'—indusiiy 2 labor in the home 
country and the syrnmettically placed industry 1 labor in the- foreign 
country. 

l.et a prime on a variable indicate its free-iiade value while un¬ 
marked variables refer to autarky, rhen, as we move from the au¬ 
tarky solution in Section I to the fiee-trade solniion in Section II, the 
chiinge in C/.j is 

U'l - Ui - In -t- ' ^ In n\/n, + ^ In 

U “ 

(19) 

= In z/{2 - z} + 2/(2 - 2) + 2 / 2 , 

where the first term is negative and represents the distribution loss; 
the remaining terms are positive and represent the gains from being 
part of a larger maiket. The cjiiesiion is under what conditions ihese 
terms will outweigh the first teinis. 

By collet titig terms, we can rewiitc (19) as 

f// - U, - - ^ ^ - ^ 2 - 2 -f- - In 2, (20) 

this gives us one immedi.ue lesiilt: If 9 < O.a. the scarce f.ictoi 
necessarily g.iins from tiade, since the Inst term will be positive ,ind 
the third tei m will outweigh the second, Kecall ih.ii 9 is a mc-asure of 
the substitutability of products within an industiv. What this result 
llien says is that // produds ore \u//inenlly tlil/ciftituiti'd, both g<iin 

jiotn iKttle. 

If ff > 0.5. whclhev lH)ih fM tors g.n/t (k-pciid.s on ihc extent to 
.. vthiih trade is inir.iindiisii v in tli.ii.itlei, which in uun dcjX'iuis on 
; wwA// //u’fo/u////rs .//r/n /.!(foi Wlund ■ o T. ,/,<• 

j finit tion (20) h.is (In ee pi o|>ei iles ( 1 1 .is ; .ippi o.ii be s J, I . I . !.;<k s 
j ''' 1(2 - 2fr)/fl| III 2 ' 0; (ii) .IS c goes to 'f'l o. f j f _• goc' t<> minus 

I '’’htiity; and (in) f.'i -- fis stru liv int le.ismg in I Inis, if cm-» ere lo 
[ k' -ipli (20), jt Would IcMik like hgiii e 2. I heie is .i i i me .il c .due <>/ .. 



970 


JOURNAL OF POLITICAL ECONOMY 



for which U’^ — — 0. If z '> z, both factors gain; if z < z, the scarce 

(actor loses. But z is our measure of similarity in factor proportions, 
rhus what wc have shown is that tf countries huve sufficiently similar 
factor endowments, both factors gain from trade. 

What is paiticularly nice about this result is that we have already 
seen that there is a one-for-one relationship l>elween similarity of 
(actor endowments and intraindustry trade. So this result can be 
taken as a vindication of the arguments of such authors as Kravis 
(1971) and Hutbauer and Chilas (1974) that intraintlustry trade poses 
fewer adjustment pioblems than interindustry tiade. 

We should note, however, that the critical value of interindustry 
ttade depends on the substitutability of products. The f unction (20) is 
(let t easing in 6-. diU^ — lnz(2 — z) < 0. So an increase in 0 

w'ill shift the (unction down. 7'his will increase z. The less differentiated 
are products, the more stmilnr countries must be if both factors are to gain from 
trade. In the limit, as 6 goes to 1, so doesT. 

I he results of this section are siiminarizcd in figure 3. On the axe.s 
aie the two parameleis 0 and z, both capable of taking on values 
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between zero and one. What we have shown is that the qualitative 
effects of trade depend on where we are in the unit square. In the 
southeastern part of the square—labeled “conflict of interest”—either 
.scale economies are unimportant or countries are very different in 
factor endowments, and scarce factors lo.se from trade. In the othei 
region—“mutual benefit”—the gains from intiaindustry specializa¬ 
tion outweigh the conventional distributional effects, and everyone 
gains from trade. 

IV. Summary and Conclusions 

fhis paper began with three “paradoxes” about international trade. 
•Since they do not seem paradoxical in the light f)f this model, perhaps 
we should state them as “stylized facts”: (i) Much of world trade is 
Ijetween countries with similar factor endowments. (li) fhe trade 
between similar countries is largely intraindustry in character; that is, 
it consists of two-way trade in similar products, (lii) The growth of 
intraindustry trade has not posed sciious income-distribution prob¬ 
lems. 

This paper offers a simple model which formalizes one possible 
explanation of these stylized facts. According to this view, the variety 
of products produced in any one country is limited bv the existence c.)l 
scale economies in production. Thus similar countries have an incen¬ 
tive to trade; their (rade will typically be in products produceil with 
similar factor proportions; and this trade will not involve the 
income-disii ibution effects characteristic of more conventional trade. 

In addition to helping make sense of .some puzzling empirical 
results, this paper is. 1 hope, of some interest from the standpoint of 
pure theory. The model dispenses with the ivvo most lundament.il 
assumptions «)f standard trade theory: perfect compeiiiion and con¬ 
stant returns to scale. Instead, I have tleall in this paper with a woild 
in which economies of scale are pervasive, and all linns have 
monopoly power. While the model depends on exiremelv lesirictive 
assumptions, it does show that it is possible for trade iheoi v to make at 
least some progress into this virtu.div unexplorefl teiiitoiv. 


Appendix 

I. The Concept of o Product 

In the formulation in .Section I. an indiisli v was assiiined to consist ol nianv 
products w'ilh the ".same” cost function anti entering in the "same" w,t\ into 
uulity. This may seem to involve a comparison of .tpples and oranges. How¬ 
ever, it can be justified as a restriction on the p.nameleis of a moie general 
model. 
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C’.oii*.i(lci thf utility ;in<l cost fiiiutioiis foi a one-industry model (the 
generalisation to two industries is obvious): 


U 


(S,C,)« .O<0<1, 


(Al) 


1 1 = «( + ) 3 (X,, 1 = 1. n. 


(A2) 


Heie we allow' goods to enter w/iih different weights into utility and to have 
diiteicnt cost liintlions; iliiis no assuinpiion is made about comparability of 
units, (iiveii (eil<iin leslrit lions on paiaineleis, however, it is possible to 
choose units so that a foi inulatton where all pioducis appear idenlital is valid. 
Let us suppose first that «( = « for all i. I he measurement of this cost is 
independent of the- choice of units, so this is a meaningful assum|)tion. Let us 
.ilso assume / 3 |/ 8 , =- fi foi all i. I'liis again does not deperul on units of 
measuieinent; lueasuiing pimkict 27 in hale lies of 10 instead of individual 
units will increase both (i.,-, aiul 8^7 Ity a l.icloi of 10 and leave the ratio 
u lu ha uged 

11 the ,>ssumpiions .ibout p.irameiers .ire granted — .ind they .ne special 
.1 ssu111ptioils, not genei .il pi opeiiics—-we i an jiisiif v the model in the text by .i 
(hoite ol units l et f.'i ~ lor all i. I lu'ii the iililiiv and lost inmiions 

become 


V - 



/, - (It + /J.V,, I - I, 


■ . n 


(a:1 ) 

(tX-l) 


II. Elnsluif^ III Dftnaiui for Ituiivulual I’todxuis 


The .in.ibsis in .Setlion I depends on the lesnli ih.il the elasliiilv ol dem.mil 
tor atn pat lit ul.ir pi odiict is l/( 1 0 ) 1 his Appendix gi\es ,i deinoiisli.iiion 

ol this. 

C’oiisider .in individu.il ni.iximi/ing his uliliu lunclion (1) subjed to .i 
htidgel tonsliainl. Ihe litsi-oidei (oiidilions lioiii th.ii iiiaMim/.ilion will 
have the toiin 


/h., 






AimCS.rn 


, 11, 


. 'Ij, 


wheie \ IS the sh.idow [>nce on the budget (oiisliaiiil. lh.it is, ihe m.ii giii.il 
nlililv of iiiconie. 

II tlieie .iic’ many pioduils, howcnei. the Inin prodiicmg a p.iiticiil.n 
product can t.ike the deriomin.itois <d these expic'ssions ,is given. I bus e.K li 
individu.irs demaiid (or a p.iilKul.ii (uodcKt. .tnd iheietoie .ilso m.iikd 
dem.ind, will have el.isiKiiv 1/(1 0 ). 
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The Adjustment of Consumption to Changing 
Expectations about Future Income 


Marjorie A. Flavin 

Umvnsity of yirgiina 


rtic paper aiialy7cs the role of curreni income in providing new 
inlotIllation aljuut (nture ineoiiie and thus signaling changes in 
periiiancm intonie. Using linie-scries analysis to c|uaniily the revi¬ 
sion in peinianent income induced by an innovation in the tuiTent 
inioine process, a siruciural econometric model of consumption is 
developed. The rejec tion of the joint i alional expeelations- 
petmaneiif income hypothe.sis is both statistically and <|uantiiatively 
significant. I'he paper also shows that the test of the rational 
expec tatiotis-permanetii income hypothesis proposed by Hall is 
based on the reduced form of this structural model and reconciles 
•Sat gent's consumptioti paper w-ith Hall's. 


Because the path of fttturc incotne is unceriaitj, the intlividual’s pei- 
inanent iiiconie anti, therefore, his consumption plans will be revi.sed 
[set iotl by |ieri()d as new inforinalion alxmt future income becomes 
available, liiuler ifie assumption that intlividutils hold rational ex¬ 
pectations of future income, the stocha.stic properties of these 
pet iod-by-period revisions in permanent incoitie can be exploited in 
order to formulate testable restrictions on the joint time-series be¬ 
havior c^f consumption antf other observaltle variables. 

1 his paper first reviews the econoinetric tests cif the permanent 
income-rational expectations tiypoihesis proposed by Hall (1978) and 
■Sargent (1978), then develops and executes an alternative approach 
to testing the hypothesis. Section 1 formally defines permanent in- 

1 his paper is part o( iiiy diKloial chsscitation at Mi l . 1 am gralcfui to Robert H.iH 
lot guidance, and to Andrew Alx-I, .Slaiilcy Kischer, Robert Klcxxl, Peter (larber, Rolx ii 
Lucas, Bennett Mct^alliiin, jetfrey Sachs. Keimelh Singleton, Richaid Start?, and 
several rcteiees for helphil suggestions. 

{Journal of Poinu-oi t-nmomy, IKHl, v«*( 8**. ii« 
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come and deiives the basic implication of the permanent inojtnc— 
rational expectations hypothesis: If consumption is proportional to 
permanent income in each period, and permanent income, in turn, 
represents the best estimate, given currently available infoi niation, f)t 
the individual s lifetime resources, then current consurnpticjn should 
diverge from consumption in the previous period by the amount ol 
the contemporaneous revision in permanent income. 

The published papers of Hall (1978) and Sargent (1978) both 
analyze this predicted relationship between consumption, laggerl con¬ 
sumption, and the revision in permanent income, conditional on 
lagged information sets.' The two papers reach conflicting empirical 
conclusions. Hall conhrming and .Sargent decisively rejecting the 
permanent-income hypothesis. Section H argues that an incorrect 
definition of permanent income used in .Sargent's formulation of the 
restrictions is the source of the discrepancy and shows that Sargent's 
analysis, once the dehnition of permanent income is corrected, leads 
to the same parameter restrictions tested by Hall. 

The restrictions tested by Hall and Sargent apply to regressions in 
which all of the l ight-hand-side variables are lagged at least 1 periofl 
with respect to the dependent variable, consumption. By analyzing 
the expectation conditional on lagged inf<si inaiion sets, their ap¬ 
proach makes use of the propeity that the revision in peimancnt 
income should be uricorrelaied with lagged variables. The Hall- 
.Sargent tests can be thought of as tests basetl on a reduced-form 
consumption equation, since a reduced foi m expresses the expecta¬ 
tion of an endogenous variable, conditional on predetermined vari¬ 
ables. 

This paper takes an alternative approach to testing the permanent 
iiit'oine-rational cxpecTations hypothesis by explicitly speiifving a 
structural model of consumption. The model developed in Section Ill 
analyzes the role of current income in pioviding new information 
about future income and thus signaling changes in permanent in¬ 
come. I'he model is intended to underscore the pcmit th.ii the ob¬ 
served sensitivity of consumption to current income should not be 
interpreted simply as the behavioral marginal piopensuv to consume 

' Bilson and taassin.in (l‘I79) loiiMiler soiiu- eviensioii', on ihc mo»tcl whuh I 
(levflop in tins paper, Bilsoii (1979) repoi ts loinpai.nivo csiiin.iies ol ilie nuxtel applied 
lo data toi che tinned .Slates, L'lnicd Kingdom, and tk'im.inv B.tiio (19781 has also 
•iddressed ihe ipiesiion of nhetlier ceilain vaiialiles alien toiisiiinptioii oiiK ihioiigh 
iheir lole in piediniiig future mcoiiie- Rcieni einpiiH.il noik on <onsmnption hv 
Bliiidci (1981) iiu()i|><)iales expliiii nealineiu of ihe expetiaiiinis ol liiiiiie iriioine in 
the perniancnt-incnnie iinxlcl Blinder's .inalvsis roneeiniales on nie.isiiring ihe ic- 
s|X)nse of consumer expenduiii c lo leiinjoi ai v < liaiiges in i.ises i .iihei ih.iii lesiing die 
pci inanent-incoinc Inpisdiesis iisell, Havashi (1979) also lesls die [Hnni.meni-imonie 
livpolhcsis and finds diat tonsiiinplion is more seiisiiisc lo tiiiieni inioine th.ni pre¬ 
dicted by the hypodiesis. 
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out ot current income. Becau-se income is a fairly highly serially 
correlatefl process, the lluctuaiions in current income will be corre¬ 
lated with fluctuations in permanent income. Formally, the model 
uses an autoiegressive-moving average (ARMA) representation of 
the income lime series in order to quantify the magnitude of the 
revision in permanent income implied by the contemporaneous ob¬ 
servation of tiirreiu income. I'he forecast error, or innovation, in the 
income lime-series process lepresents the new infbimation contained 
in the cui rent observation on income. Based on the reali/.ation of the 
cut rent itinovaiion in the intonte process, individuals tevise their 
expect,itions about future income. I'he magnitude t>f' the revisions in 
expectations ot future income in res|)onse to the realization of the 
eurient innovation depemfs on llie paiameiers of the ARMA rejne- 
sent.ttion of the income process, t herefore, the revision in perina- 
uenl income in a given peiiod will be propoiiional to the innovation 
in cuirent income, with the f.iclor ot proporiionaliiy determined by 
the p.uameteis of the income time-series process. Following this ap¬ 
proach, one tan specitv ,i structural et|uaiion rel.iting the change in 
consumption to the conieinporaneous revision in permanent income 
(modeled using the income innovation) and the change in cuireiit 
income. I'he coefficient of the change in iiicome is a ineasuie of the 
hi-h<icioial m.ugin.tl piopensiiv to consume out ol ciureni income, 
since ihe role of cuiient income in signaling th.inges in perm.mcnt 
I ri< ome li.is fiec'ii explic it 1 \ modelc'd. \ lu" pei m.uient ini ome- \ .Uion.il 
expel l.uions h\pi it hesis i an then Ih‘ tested bs testing sshelliei the 
m.ii gin.il pi < ipeiisil i (<»I onsiime I nil of i iii i cml iiu ome is shiiii/k .na/i 
dilfiiim fioiii ziiii If.isnig developed .1 lesi of llie piini.iiidl 
inioine i.nion.d expei t.iuoox lispothevis in the lonlexi ol i miii|) 1 i 
M l III IIII ,il ei onomell i< s\ stem, it is e.isv lo show 1 h.ill lie 11 ill S.n m 
lest Is the e(|Ui\ .del 11 li si f 1,1 si-if on die 1 edili ed |o| n 1 , >| 1 he ,i,, ,1 (1 
die Ml 111 tor ,il SV S cm (s |UM idenrdied, the lesi I i.ised oiii lie si 1 m ; iii, 1 
svsii m .mil die fl.ill N.ii gent lesi on dii leihiiid (oi ,11 >iiM 

Mumei II ,illv ldenlii.il v.ilues of die lesi si.nisiii (oi die pii" '■ 
inioine iallon,il exjcec i.nions fivpodiesis Wlule iln siiiiii 1 i p 
pioai h to lestiiig the In poihcsis is iioi si.iiisi n .1 llv nioie jji >10 1 ! il 1 i 
die 1 1 'l 111 ! er I f ol in .ippio.nii, 11 is 1 1 nisii lei .il <1 v nmii illn 1 le'o : 
siiii e II ^H'limls die leioveiv of die striiilin.ii p.ii iiiieii 1- 
SV stem 

I fie einpir ic ,il 1 esnhs incite ,iie ifi.ii die ohsei v ed sen site, 1 ' ■ 

MiiiiplioM to (iirit'in tiiccMne is gic-alei ih.iii is w.ni.niied I ' ' ' 
iminent inc otiic -life < >c le fiv ^Kitfiesis. eccTi w lien die 1 oli ot ' H ' 
iiifoinc- in signaling cfiaiiges m periiiaiicTii income n i.Ai e ' 
coiini I hr resliiciions irnplieci l>v tlie [K'I iii.inc-nt niion'i • ' ' ' 
rxjx'c latioiis hv|Kiific>is can lx- irjrcitcl si.nisiu ,illv .u o' 1 , 
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confidence levels. Further, the estimates of the marginal propensity to 
consume out of current income arc tjuite huge, indicating that die 
failure of the pet manent-income hypodiesis is quantitatively, as well 
as statistically, significant. 


I 


In its simplest foriii, the permanent-income hypothesis holds that 
individuals ought to gear their consumption to their lifetime re¬ 
sources rather than to their measured income in some arbitrary time 
period. To start with a particularly simple case, consider the con¬ 
sumption behavior ol an infinitely livc'cl individual in a world in which 
the retuin to teal wealth is certain and constant o\er time. I he total 
resources available to t bc‘ individual for cui rent and I utiii e ton sump¬ 
tion consist cif tut lent real wealth, ay, plus current and future labor 

income, x,+,, s — 0, 1,2.Labor income is assumed to be 

exogenously delei mined. 1 be individual's lifetime lesourtes can be 
represented in eithei stock or How form—the stock form being net 
wortfi, or the sum of real wealth and the present discounted value of 
current and future labor income, and the How form lieing permanent 
income, or tfie .miutity value of net worth. 

.Since i/ie path of future income is not known with cert.iinlv. the 
itulhidu.il imist iihikc his consn/nplion phnis on the hisis ot\uine set 

of espect.itioMs .iboui luiiiie income (ove n t/ie es/iec i,u;o/o .ihmii 
(iitiue iticonie field in pel lod f. die- iiidiMdii.ifs /leim.meni income 
(.111 be exjiiessed ,is 

if -ijrc, + “I /■('■(■'! <•' 

111 tilis discrete time lot mul.itioii. ay i> piesenis n.il \cimIiIi .u ifie 
iieginning ol peiiod /. v, is .issumed to be p.iid .it du end •>! du 
ix'iiod; .md the I cm! late ol ictuiii, i. is a-sumed coiol.ml 

IVi m.ment inc oine c .in be ihoiighi of .is the c oust.mi i csom cc ovv 
"hich, conclilion.il on evpections m peii.Kl t. c.iii be siisiaincd lot die 
U Miaindei of the individu.ifs time hon/on 1 bus. permanent income 
oiiglii to h.ive the juopeitv 


lo verifv that permanent income, as specified in ecpi.tlion (1). 
Hiis property, note th.il leal wealth will evolve iccoic mg to 

IC'Ml "' (• ' ^ 

'I die individual's c onsiimpnot' 'tv identic ally equ.il to his pi i ma 
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dis( <)iJnicd fuuirc inionu* is noi the appropriate definition of perma¬ 
nent income—is fn)t affected by relaxing the restriction that/f = f. In 
order to accoininodate Sargent’s more general consumption function, 
the analysis of Section I will be generalized to allow for an unre¬ 
stricted MPC. 

Based on the definition of permanent income as the annuity value 
of net worth, equation (1), antf llie consuirtption function with unre- 
stritted MPC, , will be related to c, as follows; 

V' / I ^ * 

c„, = fl + t(l -pn: (A'.m(15) 

~ (1 f r)u, + //,M- 

If/i / 1, consumption will he a raiulom walk with drift instead of a 
simple landom walk, faking the expectation in peiiod I — 1 gives; 

c,+ , = [1 r{l - - (1 -t- r)E,-^it, -I- (16) 

In comparing equation (16) with Stirgent's equation tor E,.,,c,+ ,, (14), 
remember that 1/a = 1 (- r. 

1 he pre.setKc of the tcrin/f(l — a)A',_,y( in Sargent's etjitation for 
ir,,n is ati artifact of lus definition of pet tnanenl iriti.)me as average 
(imtie iticotne. .Sargent’s income series, y,, is disposai^lc income and 
therefore includes futttre nonlahoi incotne as well as future hihor 
income. Moweset-, the present tliscounterl value of the future path of 
nonlahor ititome is not etjual to current nonhuman wealtli. Consiilci 
the tlefniifton ol [)et inanettt income as the antniity value of net worth; 

vf - r w, -h E,x,^., . (17) 

where is future lahoi earnings. 1 he imnhuman we.ilth term, rr,, 
totdd he expiessed as the present value of a fiitun- stream of earnings 
i.itlier than as a cutient stock. Nonhnman wealth w,, if niaint<uned at 
that level forever, would yield a stream of nontaboi'income ol nc, each 
periofl. rhus. 



Note, however, that the nonlahor income in the present value term is 
ru'i, the future nonlahor income the individual would receive eaih 
petiod if nonhuman wealth were maintained at uy. Kxpanding equa- 
tioti (18) to isolate actual nonlabor income in each future |Deiiod, 
gives 
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Kqiiatiori (19) distinguishes between two conceptually different future 
revenue Hows associated with re,. The secejud tei m is simply the 
present value of future nonlabor earnings, which ot cc)urse depends 
on llie path cjf nonhuman wealth over time. The first term reflects 
receipts from the sale <jf the wealth’s principal and payments to 
increase the principal. By defining permanent income as discejunted 
future income, Sargent includes the second teim but omits the first. 
Since the individual will systematically accumulate and decumulate 
nonhuman wealth over time in order to smejoth his planned con¬ 
sumption path, the first term in e(|Uation (19) will not be identically 
zero rir even constant over time. It is the omission ot the term Sj'Lq 
[ 1/(1 + r)]*^'r(Tc', — ?(»,,«) from his definitic)n of permanent income 
which is responsible for the disc repancy between .Sargent's version c>f 
the restrictions im|iliecl by the permanent-income hvpothesis (ec). 
[14]) and the coi rectcd version of the resliiclions (ec). [161)- 

Stirgent’s proceduic is to impose the restrictions across the 
parameters of thet vec tot autoregression of (c,. y,) implied by equa¬ 
tion (14). I he unrestricted autoiegiession ot (c,, yy) is: 

/'-'( -lO = c-,y,_, C2y,..2 f . . . -t- 

+ c/,r,-, + c/^c, 2 -I- . . . t- t20) 


and 

i>'r = «!>'(-1 + cCjV, .2 + . . . + ct„\,^„ 

+ byC, ^ + b 2<,-2 + ■ ■ ■ + b„r, „ (21) 

■Since /f,. iCm 1 -- + <<yt 1 + . • + c„y, ,, 4 , t- d,!--, ,c, 

+ d,(', I + . . . f c/„c, substituting out lot E, ,y, and E, |f, 
yields an ecjualion lor f.', iC,,, m terms oi y, , and r, ,, t - 1 , n. and 
then,, b,, r,.d, ittiiameters. Substituting these aiitoregressiye lepiesen- 
lalioits tor/y, a,, E,-,y,, and E, ,<-,n into ec|uaiion (14) giyes .i set ot 
leslrictions acioss the u,. b,. r,, d, patamelers, 1 hese resiiictioits .ne 
tested by estimating eciuations (20) and (21) ysith and yMihout the 
c ross-eqti.uion rest! ic tions imposed and computing a liWelihood ratio 
statistic. 

following .Sargent's procedure, but substituting the auioi egressiy e 
representations of /tf iff and ,r, 4 , into etjuation (lb), the "cor- 
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reeled” version of equation (14). provides the following set of resirie- 
tions on the parameters of the (f<, y,) vector autoregression: 

c,a, + Cj + d,c, = [I + r(l — B)]c, 

1 + c„ + d,c„.., = [1 + r(l - 

+ diC„ = [1 + r(l - (22) 

f,b, +t/i = [l +r(l 

c,b„^, + d,d„., + d, = [1 + r(l - B)]d„^, 
c,/>„ + d,d„ = [1 + r(l - B)]dn. 

This modified set of restrictions could be tested, following Sargent’s 
procedure, with a likelihood ratio test. However, rather than test the 
restrictions in their present form. 1 will argue tliat the time-series 
implications of the rational-expectations and permanent-income 
hypotheses can be stated and tested in a simpler form. 

The rcstiictions in e<{uation (22) are based on the projection of 
eciuation {15) on information known at time / — 1. Projec ting equation 
(15) on information known at time t instead gives 

fi'K'oi = [1 + r(l -B)K - (I + + A>(+,. (23) 

which is the time-series restriction tested in Hall’s paper. Hall assumes 
that transitory consumption, u„ is identically zero. Fherefore, as for¬ 
mulated by Hall, the testable hypothesis implied by (23) is that other 
variables dated I — 1 or earlier have no additional explanatory value 
in predicting r, if f(., is included in the regression. That is, equation 
(23) implies that 

d, = [I + r(I - B)]. 

= 0, t = 2, . . . , n (24) 

c, == (), t = \, ... ,n 

in the consumption autoregression (ccj. [20]), fhe coefficient of 
lagged consumption, d,. is estimated unrestricted. Presumably d, will 
be close to cjne. 

The parameter values given in (24) satisfy the modified versictn of 
Sargent’s cross-equation restrictions (cq. [22]): If the parameters of 
the consumption autoregression (cj, dj are set equal to the values 
specified in (24), (he ccjuaiion resiriciioiis Mated in (22) all reduce to 
the form 0 • « = 0 or 0 ■ = 0, so that the restrictions are satisfied for 

arbitrary values of the paratneters of the income process («,, h,). I'hai 

is, imposing the restrictions d( = 0, i = 2,. . . , n; and r, = 0, t = 1. 

w, and estimating the income autoregression unconstrained would 
satisfy the restrictions in equation (22). 
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Note hat Sargent s procedure yields resuict.ons in the form of a set 
of equations-one equation for each parameter of the consumpuon 
autoregression. In Hall’s paper, however, .he parameters of the co 
sumption autoregression are constrained to particular valties-lagecd 
variables other than the first lagged value of consumption are con¬ 
strained to enter with coefficients of ^ero. While correcting the 
definition of permanent income makes the two seis of restrictions 
consistent, in the sense that the set of parameter values in (24) satisfies 
the equation restrictions in (22). there remains the issue of why the 
two sets of restrictions are not identical in form. 

I he set of equation restriiaions in (22) is dei ived by modeling ^Cf 
as a general (unrestricted) autoregressive process, then solving for the 
testiictions across the parameters of the consumption and income 
autoregressions which are implied by e({uatiun (16). However, since 
equation (23) is also an implication of the rational-expectations and 
permanent-income hypotheses, no additional assumptions are re- 
(piired to restrict the consumption autoregression to a first-order 
process. If the parameters of the consiimpiion autoregression are 
assigned the values implied by equation (2.3). the equation restrictions 
in (22) all reduce to the form 0=0. That is. once the restrictions are 
imposed within the consumption autoregression, no further resiric- 
tiotis are implied by the set of restrictions in (22). Thus the difterence 
in form of the two sets of restrictions. (22) ainl (24). is aiti ibuiable to 
the laci that, in deriving the restrictions in (22). E, .,r, is modeled as an 
unnecessarily general .intoregressive process. 

At this point, the conflict between the Hall and .Sargent papeis 
concerning the restrictions on the systematic part of the consumption 
autoregressioti has been resolved in favor of Hall's formulation. 1 he 
two authors also differ in their execution of the test, .Sargent estimat¬ 
ing the autoregressions with qiiasi-dilfcrenced data and Hall es¬ 
timating the autoregressioti with the data in Icsels, 

Refei riiig to equation (15), note that the disturbance in the regres¬ 
sion of r, on c,_i and other lagged variables will include a term which 
repre.sents the revision in permanent income in period t as well as a 
moving average of transitory consumption. Using to represent the 
levision in permaneiU income, 

= rB ^ (E, - (25) 

the consumption autoregression, under the null hypothesis, is: 

c, = [1 + r(l - - (1 + r)u,. , f u,. (26) 

Hall assume.s that transitory consumption, it,, is idetiticalK zero. In his 
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analylical discussion, Sargent assumes u, is an ARl process, t/, = 

I -t- &»,, and quasi-differences (26): 

~ pCi 1 = U + k( 1 - 6)J(0-i ^ PO 2 ) 

+ ft ' Pfr-i + tor-(l + Ota, 

In the econonielrie specification of the test, however, Sargent assumes 
that the disturhance in the consumption autorcgressitjn is uiicorre- 
latetl with the regtessois (p. 689). Tlie disturhance will he uncorre¬ 
lated with the regressors only if transitory consumption is identitally 
zero, since 01 , is correlatctl with c,_,. Quasi-differencing the data in- 
imtluces a moving average of the revision in permanent income term. 
Since Sargent in effect assumes away liausiiory consumption hy 
asserting that the error is uncoi related with legressors, it seems more 
sensible to leave ilie iegression in levels atid thus preserve the white- 
noise propei tv of the error term. 

I hc lesults of tests of the cotrcttetl versioti of the testrictions aie 
presented iti table 1. Sargent's K, incoitte variable was construtted as 
K, (INI’ - ca|)ical consumption — state, local, and federal tax recetpts 
t ti.msfers; liis consumption variable (I was total iteisonal consump¬ 
tion expenditures, both series ate seasonally adjusted and measured 
in constant (1972) dollars. F*jr comparison, table 1 also intiudes simi- 
lai test-, iisttig Hall's iticome .tnd cttnsumption vaiialtles, whit h were 
per capita pet sonal disposable income and per capita expenditures on 
services and notidiiiable goods. Hall’s data weie also seastmally ad¬ 
justed and stated m 1972 doll.ir.s.^ Following Sargent's specification, 
all tegressiotis iticlitded a linear time tiend, denoted t. 

■Sargent conduc ted his tests with the tiata in hi st dillei elites, h.ivmg 
■issumed the autoregressive parameter in the transitoiy consumption 
l>iocess to be unity, l.'sitig differenced data, the test statislic lor the 
collected lestrictious was 19.192, which indicaies rejection ot the 
hypothesis at the 99 jtercent conlidence level but not at the 99..’) 
peicem level, l estuig the incorrect restriclions, .Sargent reported (p. 
696) a maigirial conlidence level of 1.1)00 when /I, the MPCi out ot 
(jeimaneiit incortie, was consliamed to ecpial .90. 1 he marginal 

conlicletice level of the terse was only .1)97 when H was unrc.-stricted: 
however, the lesulting point estimate was “highly impl.ausible" (6.06 
X 10'). 

For the reasons stated above, 1 think that it is more appropiiate to 
run the regressions in levels than in clilierences. With the data in 

^ W'liilc ihc (lal.i set used to ici uii I t.ill’s ic-gic-ssmiis was c oiisn lu led .10 ot ding to Ins 
Ij.isK definitions ot the v.ii i.cliles, it is not .111 es.ict lepiodiictioii ot Ins d.il.i set. I In' 
data SCI used in table I diflers from llall'.s because il me 01 pot ales siibsetjuent Nll’A 
tecislons ol ibe latest obseiv.ilioiis .iiul Iiec.insc- .1 diileient piue dellatoi was tised to 
dellHte the incimic- senes 
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levels, the test statistic drops to 17.054, which is signihcant at the 97.5 
percent level. The difference in sample periods does not affect the 
results much. 

The results are sensitive to the definition of consumption, however. 
Using Hall’s data, which omit expenditures on durable goods from 
the consumption variable, the test statistic drops to 12.04, which is 
barely significant at the 90 percent level. T hus the difference in the 
consumption variables used by the two authors accounts for a major 
part of the gap between Hall’s results and those reported by Sargent. 
Correcting the restrictions and omitting the autoregressive transfor- 
mati<}n <jf the data does have a noticeable effect, though, decreasing 
the marginal confidence level from the 1.000 reported by Sargent to 
97.5 percent. 


HI. The Role of Current Income in Signaling Changes in 
Permanent Income 

The model in this section is based on the notion that, in forming theii 
expectations ol future income, rational economic agents will exploit 
the fact that income is a sttK'hastic process which exhibits a high 
degree of serial correlation. Baseil on the observation that the current 
leali/ation of income is greater than anticipated, agents will revise 
their expectations of future income, anti therefore their permanent 
inctnne, iipwartl. Further, the magnitude of the revisitm in perma¬ 
nent income associated with the realization of a disturbance in current 
inettme will depend on the parameters of the time-sei ies representa¬ 
tion of cm rent income. In this section, the quantitative relationship 
between an innovation in current income and the revision in perma¬ 
nent income is derivetf for a general ARM A representation of the 
income process. 

This section uses the innovation in the income process to model the 
period-by-period revisions in permanent income and develop a sim¬ 
ple structural econometric model of consumption. The implications 
of the permanent income-rational expectations hypothesis are tested 
in the context of the structural model. The Hall-Sargent test of the 
permanent inctrrnc-rational expectations hypothesis is then shown ti) 
be a test based on the reduced form of the structural model developed 
here. 


The Income Process 

More formally, the time-.series behavior ol aggregate income is mod¬ 
eled as an ARM A process whicfi is driven by a series of independent 
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shocks or “innovations” in the process.® I hus, 

yi = Piyt-I + Piyt-2 + . . . + ppyt-p e, + 

+ 4>2ft-2 + .. . + (28) 


or ji( = [1 + M{L)\I[\ — A{L)]e, where 1. is the lag, or backshift, 
operator; A(L) = p,/. + + . . . + M(/,) = <f>^L + + . . . + 

and e is a while-noise disturbance. The income variable is ex¬ 
pressed in deviations from trend. 

An ARMA model of order {p, q) can be equivalently characterized 
as a pure moving average tnodel of infinite order. That is, the time- 
series representation 

-t- + ^ 3 ^ 1-3 + ■ ■ ■ (29) 

is numerically equivalent to the model 

yt = piyt-i + P0'<-2 + • • ■ + ppyip + 

■f" -2 -t" - - . "f" — q ('^O) 

for Ip, = cf>, + pA-j- 

Expre.s.sed in the pure moving average form, the time-series model 
summarizes the r elationship between the forecasted value tjf the vai i- 
able and the realization of the disturbance term: 


dyn, ^ dy, ^ 

d€, de,-. 


(31) 


Since the distitrbance terms are white noise by constiuction, the 
expectation of all future values of t is zero. When the current forecast 
error e, is realized, the predicted value of income at each date in the 
future is revised proportionally to 

- £»-■>’(+., == (32) 

rhe present value of the lesision, at time /, in the anticipated path 
uf tuiure income is 



^ It shouUi eniphaMiCtl that the paptM the ARMA U) (juanut\ ih(f 

ex lent to which income is serial I > < onclaicH oiih Ikh.uiso the lra?ncv\ni k is a sin^plc \s a\ 
of desenhing the time-series Ix'h.ivioi of iiuome on the aggieg.tie tesel 1 he 11 eat men i 
o( aggregate mcomc as a sialilc ARMA pixxess is iH>t meant to iniplv th.u imlniduLils 
view income as pure nine senes in foiinmg c\(>eciations alxmt then hnvire inioinc. (hi 
the individual level all that is assumed is that t<»nsninei s will exphiii the sei lal coi rela¬ 
tion that (haracicri/es ilieir individual iiicoine slieams foi insiame. a sindt;iM wIh» is 
unable to find a summer )ob realizes that sunimer |obs will piobahlv he scarce next \ear 
as well; the prcxhiclion worker who is ciirienllv lai<l off le.ili/es dial he iiia\ not Ix' 
fehired for some weeks; the stockbroker whose commissions base dcxlined in \ohime 
may expect his income to remain below noimal lor d-f> quaiters. 
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In the inhnite horizon lase, the value of [1/(1 "f" >')]“'/'« tan be 
calciilatecl directly IVoin the parameters <f)g, pj of the ARMA (f), q) 
lepresentation of ilie process. It is shown in Appendix I that 



rientment of Urtdnticifated Capital Cains 

In tlic" siinple,sl version of the modc'l distnssccl in Section 1. the return 
to real wealth, r, was assumed to he c constant. This .issumpiion im|>lies 
iliat there are no unanticipated capital ^ains or losses on nc^nhuman 
wealtli. In the absente ol unanticipated tapilal gains, future lahoi 
income is the only component of permanent income which is not 
peilectly piedictahle. In this case the petiod-hy-period revisions in 
jtennanent income would be modeled as propoitionai to the innova¬ 
tions in labor income alone. 

As an empitical matter, however, unanlici|>atcd capital gains and 
losses on rt(tnhuman wealth probably constittite a signilicant 1 1 action 
of the icwisions in peimanetit incortie winch this model is trying to 
capitne. I'he following clisctission de.sciibes the construclion of an 
appioximaie measure ol unanticipated capital gains on nonhuman 
we.ilth. 

Cionceptu.tlly, the un.inticipatetl capital gain on nonhuman wealtli 
could be delincd as the (treseni value of the revision in the expectetl 
e.linings assoti.ited with the current asset holdings: 

capital gain tn period / f ! - ~(■I-’') 

^ = 0 

wbeie r,,, is the rate of leluiit to c.ipii,il in jieriod / f s. 1 he actual 
path of nonlabor income, will lellet i both the “exogenous” 

movements in the return to capital. />),, wliich we want to captnie, 
ancl the c-nclogenous changes in earnings Hows which result from the 
individual's decisions to acctimulate oi dccumulate nonhtiman 
wealth, I .issume that changes in the rate of reiutn to c apital, On, 
are CjuaiUitatively more important (h.iii the endogenous changes in 
ic,,., in determining the tmie-seiies pioperiies o( the obsei ved path ol 
nonlabor income. With this assumption, unaniic ipated capital gains 
are approximated as the present value of the revision in expected 
nonlabor income. 



adjustmknt ok consumption 


Repealing ihe derivation oi the consumption equation as in Setluni 
1, the unanticipated capital gains in real wealth enter the eciuati(jn 
symmetrically with the present value of the revision in expected labor 
income: 


ft ^ <1 I + r 




l-rb) 




(1 + r)Uf , + u,. 


m) 


where is luture labor income and is luinte nonlabor income. 

Suhsliluting into this e(|natioii the iimc-sci ies t hai atteri/.alion of 
the revision in the jii edit ted path i>( future intonie \ields; 


c, = c, 


+ r 


X 

Vi 



br) N 


«, - (1 + r)u,^, f II,, 


(37) 


where i//, are the paiaineiers ot the moving aiei.tge iepresentation, 
and Cj the innovations, ol the disposable income series, y, = x, + yf. To 
simplilv the notation, let <P = Xr ,|l/(1 -t L'sing this nota¬ 

tion, the present value o( the ie\ ision, at time /, in the antic ip.tted path 
of luture intome is denotetl <!>€,: the ie\ision in permanent income is 
denoted by F.tptation (37) can be rewritten as: 

O - O I + ■ (1 + >')><i-i + 'b (3Ht 

I'inallv, it is necessai v to consider the efteci on aggregate consump- 
iion ol a trend in [)er capita income. While an mdiMdual's permanent 
income Can be tlehned so that its movement over lime is ti endless, the 
presence ot a trend in pet t.ipiia income will induce a trend in 
aggregate peiinaneiil income ,uid iherelore in .iggreg.ite consump¬ 
tion. It per t.qiiia income h,is a posiiiic- treini, due to mcre.ising 
|5roduciivity, lor example, later generations will base greatei 
lifetime we.illh ih,m e.ii heiones. .\s the older generations die and ,ire 
repLtced by the \ounger gener.ilions, ,iggreg,iie consumption will 
trend ujiward .it the same i.tie ol giowih ,ts pei t.q)ii,i income \o 
(liange in the expet taiion of luture income is insohed m iliis treinl 
movement ot ,iggregate ct)nsuin[)tion. not does .ms p.ntitul.u nuii- 
\ithial change his level ot coitsumption. .Since the model is mtentled to 
explain revisions in pl.iimed consiimpiitm whith .tie c.tusetl b\ 
ihanges in the expectations .iboiii tutuie income, et]u.ition (33) 
applies to the movement i>t consumption .iiouiul a trend ati i ibut.ible 
to the trend in per capil.i income. 
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A basic t-mpiriraJ implic ation of tlie model developed in Section ( 
that, even if' the behavioral marginal propensity to consume oui „} 
current income is ?ero, consumption should respond to innovations m 
current income because these innovations provide new informaiion 
about futine income and therefore induce revisiems in per/nanent 
income. K.vpressing tlie income forecast error in terms of current and 
lagged income yields the following .system of eejuations; 

yi = Ml + Piy<-t + PO'c-2 + . . . + p„y,-^ + e„. CM)) 

Ac, = fi-i + k<t>(y, - Ml - Pty,~i - ... - p„y, ,.) 

+ f^ciAy, + ^,Ay,_, + ^2^yt 2 + . . . + /3 p-.iAy,+ Cj,, 

w here 



Pi_P2 P^ 


L 1 + r (I + r)* ■ ■ ■ (1 + r)" J 

r is the quarterly interest rate, and k represents the annuity rate. (If 
the individual's lime horicon is linite. the annuity rate will not he 
equal to the interest rale, as in the infinite horizon case.) 

In the umestricted version of the model, the first difference of 
consunifition resfionds to current ancJ lagged changes in income .is 
well as the innovation in the income process. In equation (40) the /3, 
coefficients are measures of the “excess sensitivity” of consumption to 
current income, that is, sensitivity in excess of the response atlribut- 
able to the new information contained in current income. The em- 
liiiical work discussed in this section provides estimates of the /S, 
parameters and tests the implication of the permanent income- 
rational expectatiejns hypothesis that consumption exhibits no excess 
sensitivity to current income, fi, = 0. 

The permanent income-ratic>iiai expectations hypothesis could be 
tested by estimating the system in its structural representation, 
cc|uations (.39) and (40), by full information maximum likelihocxl 
(FIML) both with and without the restriction /3, = 0 and forming a 
likelihood ratio statistic. 

As discussed below, the unrestricted version of the system is just 
identified. Bccau.se the sy.stem is just identified, the maximum likeli¬ 
hood estimates of the structural parameters can be recovered from 
the estimated coefficients caf the reduced form. Working with the 
reduced form has several advantages; First, estimation of the reduced 
form is computationally simpler than estimating the structural repre- 
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scniaiion by FIML; second, analysis of the redmed form ^ i},,, 

enaa relationship between the likelihocvl i alio test of jSj = () prtn)oscd 
here and theF-iesi proposed by Hall. It will be shown that Hall’s test is 
a test based on the reduced form of the consumption e(,uation (Ih) 
Further, it will be shown that because the structural system (cm, [m 
and [40]) IS just identified, the test statistic f rom Hall’.s test, based on 
the reduced form ol the consumption ecjuation alone, is numerically 
identical to the lest statistic for the hypothesis ft = 0 in the two' 
equation structural system. For the purpose of defer,nimng the star.s- 
iical sigriihtancc of a departure from the permancnl income-rational 
expectations hypothesis, both Hall's test and the test proposed here 
yield identical answers. For the purposes of assessing whether or not 
an observed departure from the hypothesis is of quantitative 
sigmfKance, however, the test proposed here has the advantage of 
peimitting the recovery of the point estimates of ihc siruttura) 
paiameters, the fi\, and their inclividual slandavtl cvtovs, ssheveas 
Hall's proceduie estimates only reduced-fonn coefruients. 


Identification of the Model 

Before discussing the idetnificaiion of the system, 1 will make cxplii ii 
my assumptions concerning the properties of the incoine-lorecast 
error. and the disturbance in the consumption ct|uaiion. ej. 

C'.onceptuallv, one would like to estimate the l-peiiod-aheaU fore¬ 
cast of income, conditional on all avail.ibic irilormaiion, A'(V||/, ,). 
Since the eslimaicd prediciioti etjuation I'df)') iticludes onU a \cr\ 
limited subset of die availaltle information, the pvedietwe value of all 
the lagged values of variables observed by the iiidisidual, but not 
explicitly incorporated in the legrcssion, will be relfecied in die crroi' 
term, Using the notation K(L)x,., to icpieseni die disciepaiicv 
between the l-pericKf-ahead forecast of income conditional on die 
Complete information set !( i and the income force asi conditional on 
the limited information set actually included in the regi ession, ft (/. tv, , 
= . . . ,y, y). the disturbance term e, in equation 

(39) will be the sum of the "tnie” innovation („ and iheft(/.U, i term, 

€ir "" €i( ^ I- 

I'be term R(I.)x,^, tan be thought ol as measurement eiror due in die 
omission of elements of tiie available infoi iiv.iiion set tuiin the toie- 
casting equaiiou. I he R{I.\x, i cei m is assumed to be serialK uncoi i e- 
lated and uncorrelaictd with the cxplaitatoiy tanables includeil m die 
income-prediction equation. By dchniitoii, ihe '‘triie innovation e, is 
uncoirelated with the R(L)x,-t term as well as the included ex¬ 
planatory variables. 
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Since the “true” income innovation, e„ = yt ~ Piyi-i — ■ ■ ■ — Ppyt~p ~ 
R{L)xi^,. is tlie explanatory variable in the consumption equation, the 
consumption disturbance €21 will contain the term —k<i>R(L)xi-u where 
is the coelHcient of the true inimvation. 

The syslenialic part of the consumption equation models the re¬ 
sponse of consumption to the new inl'ormalion contained in the 
cut rent observation on income. Obviously there are many sources of 
new information about future income in addition to the current 
observation on income itself. 'I'liese omitted current variables which 
induce individuals in revi.se their expectations about future income 
will also contribute to the disturbance in the consumption etjuation. 
Let the variable 6, represent the revision, in period I, in permanent 
income induced by other (nonincome) new information about future 
income. The "oinitted news" disturbance P, is assumed to be seiially 
uncorielaled and uiicoi related with lagf^ed vaiiables. 

Bec ause consumption is one component of cut rent income, the 
omitted news tei iu will have it non/cro cot relation with the true 
innovation in income, e„. If, in period /, curieni variables other than 
income induce individuals to revise their lorec.iMs of liitnie income 
iqiw.ird, P, will be positive, fhe positive reali/ation of t9, increases 
cut rent consumption and tberelore increases current income. This 
im|tlies that the omittc'd news term P, is positively correlated with the 
true income innovation, €,,. 

In addition to the ineasureinent error term, -A<t>/f(Z.)x,_,, and the 
omitted news tetm, P,, the consumpiioti e<|uaiion will, in principle, 
also incluclc; a moving average of transiloi v consiimplion, tj, — (1 t- 
k)T}i For the' |5uipose of estimating die incvdel, however, the tiaii- 
sitoiy ccvtisumptioii component ol the ei ror term will be ignored. 
F.ven though an individual's traiisitcu v c onMUiiption 111 .1 given ciuai- 
ter may be nomiegligible in relation to his pcrm.inent consumption, 
the per capita value ol ti ansiioi v tonsumpiion will be small relative to 
peimanent pei cajiita consumption if the inclividu.il reali/atious ol 
tr.iusitorv consumption are latgely indejiendent across the popiila- 
lioM. 1 think that it is reasoiialile to assume that die c|uan(itati\c 
coiuiibiiiion c)f the (laiisitory cciiisiimption term to the overall erioi 
e-j, is small compared to the loiiinbiiiion ol the measuremeni eiioi 
term, -k<PR{L).Xi ,, and the omitted news error term, 0,. 

I his disc ussion leads to the lollowiiig c h.intc lei i/alion ol the sti nc- 
tuiai disturhances; 

c„ = T €„ 


and 


(i:h 


tze = -k(l>R{L).x, , + P,. 
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Based on lliese assumptions, e, and are contemporaneously corre¬ 
lated, 

cov (e„, tj,) = -A(t> var l/{(L)x, ,] + eov (0,. e„), ('ll) 

l)Ut not serially cor related. 

In order lo simplify the tliscussion, identification of the system will 
be attaly^ted for a first-order AR income process. l ire getrerali/ation 
to a longer AR process is straightforward. 

The iiirrestric ted system is: 

>/ = p-i + pyt, + €i/ (43) 

and 

Ac, ■= /J .2 + A(p€„ + /3Ay, + « 2 r. (4h) 

where <t> = 1/{1 — [p/(l + r)]}. I'he value of the pararueter A, which 
represents the trraigmal propensity to consume out of wealth, is 
imposed a pr iori. I he parameter <1>, which represents the change in 
wealth .issociated with an innovation tn income, is a funciirm of the 
ittterest rate and the autoregressive |j.irameiei s of the income process, 
rile value of the intetest rate rs imposed a pi ioi i, so the i(leiitihcalion 
of p identifies <t> as well. 

Sinceei, ~ y, — pyy ,, the consumption c'tjuaiion c.in he i ewi itien .is: 


Ac, - ix-i 4 (k<t> + (i)e„ I- I3(p - l)v, , -t- (47) 

the covariances of the structural rlistui baiu es will be denoted as 
follows: vai (e,,) — <r?; v.ir (e-.,) - trj; and cov (c,,. e^,) = ir,,. 

1 he patameteis of the income process, p,, p, .ind trf. .ue identilieci 
i)V the income d.ri.i alone' 

cov ^ pv,ir(v,). (4*^) 

'-•r fy,) - p- var (y,) -i- <7^. (49) 

and 

p, ^- (1 - p)p„. (39) 

vvlieie p„ is the me.tn ol y 

IJsiiig the f.ict that e,, is hv consliirction oilltogou.il to y, and e.j, is 
assumed iiiicorrelated witli y, ,, 

cov(A(,,y, ,) = - Dvai y,, (31) 


cov (Ac,, y,) = /3(/j — 1) cov (y,.}'r i) + + /ilcrf + tro- (3‘2) 

var (Ac,) = )3^(p - 1)- vai' (y,) + (A<t> 4 /i)'’.ri 

2{k^P + /;4)(7|2 + <rrj. (53) 
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= Mir, + (8(1 - p)([i„,, (54) 

where is the mean of Ac,. 

The model is just identified—equations (48) through (54) constitute 
a system of seven equations in seven unknowrn parameters. Because 
the model is just identified, indirect least squares—that is, estimating 
the model as a reduced form, then transforming the reduced-form 
coellicients to recover the structural parameters—is equivalent to 
FI ML. 

(Consider the following reduced form: 

J'l = Ml + pyt~< + Vu (55) 

Ac, = iJL^+ fi(p ~ l)y,-i + ^ 2 ,, (56) 

wheie I'll = and Pji = + /8)eti + e 2 r- 4 covariance matrix of 

the reduced form, written in terms of the structural covariances, is: 


n = 


ik<P + /3)(r? -I- <r,2 


(k<t> + p)(T‘j + o-,2 

(A<1> -t- /3)V? + 2(A<t> -f- /3)o-,2 -t- crl 


\. (57) 


Note that the maximum likelihood estimates of the sttuctuial 
tuelfitients—M i> Mi. P. /3—are independent of the value of the 
parameter width measures the predicted response of consumption to 
the innovation in cm rent income. That is, the parameters fi,, (^ 2 , p, 
and /3 are identified hy equations (48), (49), (50), and (54), none of 
whit h conlains k<t>. 

F.xamination of equations (53) and (54), or, equivalently, the 
reduced-ldim tovariance matrix, equation (57), inditales that a priori 
restriction on the value of A<t> makes it possible to recover the struc¬ 
tural covariant es fr ,.2 and cr| from the estimated covariances of the 
reduced form; 


cov {i>„, i' 2 () ~ (Atf + /3)tri + cr,j 


(58) 


and 


var (Pjd = (k^ + ^)V| + 2(k<t> + /3)c 7,2 + wj. (59) 

.Since indirect least squares and FI ML are equivalent in the jtiM- 
identified case, the likelihood ratio stati.stic for the test of the resttic- 
tion f3 = 0 computed hy estimating the structural form, equations (4,5) 
and (46), by FI ML will be numerically identical to the likelihood raim 
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test statistic for /8 = 0 computed by estimating the reduced form, 
equations (55) and (56), by multivariate least squares.'* 


Comparison of the Test of /3i = 0 with Hall’s Test 

In the equation system actually estimated, income was modeled as an 
eighth-order pure autoregressive process: 


> = Ml + pcvi-i + p?y<-2 + ■ • • + Psyi-R + cif. 


Ac, = M2 + - Ml - Pi>i-I “ PiV/ 2 - ■■■ - P8>,-8) 

+ /3oAy, + + Cj,. 


(60) 

(61) 


Like the simpler system discussed above, this system is just 
identified. If a less general alternative hypothesis were speciHed—for 
example, if only the contemporaneous and three lagged first dif¬ 
ferences in income were included in the unrestricted model—the 
system would be overidentified. The particular spetificaiion of the 
alternative hypothesis was deliberately chosen in order to achieve just 
identification. Because the model is just identified, the maximum- 
likelihood estimates of the structural paiaiiieters can be obtained by 
estimating the reduced lorin; 


and 


~ Ml Pi.Vi-i P2,Vr-2 + • . . + PsVf-s + Pi, 


(6J) 


Ac, = {^L 2 + /JoMl) + /3o((Pl - 1)>’,-I + p-^, i + . ■ • + Ps.v, h] 

"t" )3iA'Vf-i “t" /^2Ay,-2 a- ... -I- ^^Ay,_7 "t po,. 


(63) 


where p,, = e,, and V 2 t ~ (^'J* + /So)*,, + L'sing the notation tt, to 
denote the reduced-form coefficient of y, 

Ac, = /ij + 7r^v,-i + + • • ■ + 7rsy,-N + P 2 ,- (64) 

K(]uation (63) makes very explicit the rel.itionslii]5 between the 
reduced-form coefficients tt, and the underlying sinictuial parame¬ 
ters: 


* F.q. (51) suggcsis a simple instruineiual vaiial>lcs cstiiiiaior of ihe e\cts> seiisuivuN 
|)aramcifr Noling lli.ii (p ~ 1) var » » i), t'<I (-'>1) it'MiUrii a> 

t'ov (At,, Vr-|) " cov (Av,. ,) ) 

I hus ^ is itleniifird as the ralio itov (Ar,. Vr i )!''Uo\ (Ay(, >( i W liie 11 tiiMiinpiion 

eqviation wuh the iiic(3ine iiiiiovaiuui in Oic eiiot icnn, Ar, = ^ ' ^ 2 ^)- 

one could estimate /3 l^v llsin^ Vf. i a' -ni iiisiiuineni foi Av, imonic. v, |. 

clearly a valid instrurnenl lor Avr, since it is corrolaloi %\)lh Avr bul inu oi ielated ^^uh the 
composite ciror tenn. + e,,)- F’sin^v,-! as an instiinnem loi Av,. die insirumcntal 

variables estimate of'^ is [cov (At,. » ,)]/|<o\ tA^,.yr i)]. " bu h is ev.u tl\ the e\pi eN>u>n 
for obtained from c<j. (51). 
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1 X 2 = 

TT, = fio(Pi - j) + /?I 
TT-i = /3„P2 - + a 

?!■;, = AlP l ^ /^2 + /3.) 

(fia) 

^7 “ ^^OP? “ + ^7 

“ fiaPn ~~ pT- 

Rcjj[ai (tlcss oi llie (iinc-soi it-s pnxess (Icstribinjj; iiH'oiiic, ilie lesiiic- 
lion lliai consumption c\liil)i»s no excess sensitivity to c in rent income, 
/Su ~ fii = ■ . . — (It ~ 0, iin[jlies ili.tl the tetliuetl-fot in coelfi(ienis 77 , 
•ue all et]ual to ^eto. 

I he ledmetl roitn expresses the <ondilional expectation o}‘ the 
etidogenous vaiialile, Ac,, given the precieterininecl variahles, whiih 
in this case are the lagged values of iiuome. I hus, the empirical 
toiitent o( the restrtttioti that fin ~ (i, ~ I3 t = 0 is that liic 

cotulitiotial expectation of Ar, giveti lagged valties ol iticotne is /eio. 
This ini|)lication of die pernianent incoim'-nitional expet tatiotis fn- 
|)othesis is pretisely the hypothesis tested by Hall. Qiiotitig lioin 
Hall's pa{>ei (Hall 1978, [a. 97b). “ f'he tests of the stoi hastit iinplica- 
tiotis of the life t\( le-petin;ttienl intome hypothesis t allied out in 
this papet all have the lonn ol esliinaling a t ontliiional expeiiation. 
A'(<,|c, ,. I). " I's'ft.''■'( 1 IS a vet lor of'tlat.i know n in periotl ( - I, .incl 
then testing the hypothesis th.ii the londilional expetl.ilion is attnalK 
not a f tint lion of x, 1 . " 

I he analvlit.il appioath i.iken by this ptipei was to foimnlaie ,1 
genet.il sit ttt tin,tl model in wfiit li <onsuni|nion lesponded to h<>ih 
niiiov.iiions in peiniaiieni income and ih.niges in tniieni iiucnnc 
I he lestiniions iinposetl by tile peiimment iiu ome • 1 .iiion.il espet la- 
lions hspoiliesis on the siiitciui.d p.ii aineiei s, /i, - I), were shown in 
imply I he 1 esi 1 it non I h.il 1 he techie ed-fc>i m pal a met ei s ttj were eijual 
to /eio. H.ill's .i|)pioaifi ic» foi iniitaling .1 test of the hypoilic''!'' 
.ivoidecl the etoiioiiieli it speiilitaiion ol a siiihiiii.iI c onsimi|)ii"ii 
e(|iiafion F.stliewnig stint iiiial esiiin.itioii allogetliei , Hall .m.ih/cil 
the lesiiitiions imposed by the peimanent irit ome-i .itioii.il expect,i- 
lions hypoifiesis on the tondtitoiial expet lalion of Ar, giteii l.iK.iC'^ 
variables. Ilins Hall ai rived .it tlie restrictions imjiosed on ike 
retfiited-foi rn tonsiiniplitrn etjiiaiiori by an .iiialyiital sfioilciii. i.iil'e' 
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lhan derive the redutcd-loi in lesliiciions Irnm an explitii siiucuual 
cfoiiomevric model, he exploited the inierpi elation ot a reiiuced toim 
as statin}; a eondilitinal expertaiion and eonsidered the implications ol 
vice permanent iiuome-i ational expeciatums hypothesis lor the ton- 
(litional expectation oi (onsnmption. 

AllltonHh l)oth Hall’s analysis and the stiuctural model developed 
in this paper lead lo the same rc^tiiclions on the 1 educed foim oi the 
consumption e<|Ualion, die test proredme used here diriers Iroin 
Hall’s in that the rc'stiiciion is tested hy (oinputinjr the tikelilmod r atio 
statist it: for the 1 vvo-ecinaiioii reduced foim (eejq lti21 and 16:d]). while 
Hall’s test statistic is rompined hv t‘stiinaiin|;; the consumption equa¬ 
tion (etp lh41) alone. As pi oved in A]r])endi\ 11. howesei, the two tests 
vield numerically identical raluc's ol ifie likelihood raiio statistic lor 
the hyjiothcsis that die conditional expectation ot consumption i;i\eri 
lagteed income is /eio."' 

rhe basic advantage ot the sit uciuiallv derived lest over Hall’s test 
is that it |)eiinits the tecovetv ol the point estimates ol ihe siruitnral 
parameters measuiing the excess seitsiuviiv c>l consumption to cur- 
leiii income, die f3,, and tlieii individual siandaicl errois. With Hall's 
test it is tlifhcull to inieipiet the cpianiitaiive impott.mce ol a rle|jai- 
ture Iront the pifdicted behavioi ot consumption since onlv the 
rccluceci-loi III c oettic ieiils .itc* esiim.iiecl 

Data 

1 he lest ol die 1 esti ic tion - (t is .i test ol the joint hv jt.tiliesis 1 h.tt (ai 
toiisiimci s ex[)Ioit the mne-seiies piopeiiic’s ol income in toimmg 
iht'ii estimates ol |}ei maiteiu mcotne .md th) the .iiljusimc^ni ol con- 
siiiiiption tc) .1 levisiou m jtetni.mcnt income is tullv .icineved within 
ilie cju.ulei tliat die iievv intoim.itioM bc'comc's .iv.iilahle. In omIct to 
liinil die c on slim I )l ion c one cq>t to a t oinponeni w hti h c ,m be .tdjtoled 
i.ilticlh and smoothlv nt c h.mges in jtei manent me ome, the c oiismiqj- 
iioii c'once|)t used ,is die dependent v.iti.ible w.is e\|Knclimies on 
iiondinable goods Kven il one vvent to the liouble ot convening the 
Naiion.il I nc cime .11 id I’loduc i .\c c ouiiis ( N 11'.\ I clai.i on ex penchtm cw 
oil dll 1 .title goods into .1 sei tes ol iini>uied sei v ic e Ilou s, 1 he exisienc t“ 
ol substantial trans.n lion msts ot .idjiisting stocks ot dm able goods 
suggests that the eonsnmption ol sc-i\ici’s ol dui.ible goods vvould 
exhibit l.tggeci .icljnsiment to cb.mges m peim.meni nuc'tne. Ihe 

'I .m> iiKtcliCeil lo H.cU (oi suggr-citin tli.il 1 cvCcai.iCo die modi I liv Midiiect le.iM 
s<tii.iics insu-.icl ol HMl 1 he .>l>s<i\.uioi« ili.ii the Wst K.iwa on die mo o(ci,ccioii 
it‘cUi(e<i l(‘?ni ,ind the U'si h.otsi on die lohiod toon ol die ionNCinunion e\tn.clion 
aliine coll oriel niiineiii.illv idenCK.il c.ihie- ol die hkeldiocHi i.mo a.none o .dso due to 
tiall 
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NIPA data on consumption of services are subject to a similar objec¬ 
tion, since services are defined to include the imputed service flow 
from housing. Nondurable goods represent about 45 percent of total 
personal consumption expenditures. 

Both the consumption series and the income series, which was 
disposable personal income, were seasonally adjusted and expressed 
in real per capita terms. After allowing for the construction of the 
lagged variables, the sample period used in the estimation was 
1949:111-1979:1. 

I'he income process was modeled as an eighth-order autoregressive 
process around an exponential trend. In order to simplify the estima¬ 
tion, the income data .scries was first detrended by its estimated 
exponential trend for the sample period 1947:1-1979:1 of .00565259 
per quarter, and the model was estimated with the detrended series. 
As mentioned above, the presence of a trend in per capita income will 
give rise to a trend in per capita consumption. Because the consump¬ 
tion varialtle is expenditures on nondurable goods rather than total 
consumption, the trend in nondurable goods consumption will not be 
equal to the trend in income unless the income elasticity of nondura- 
hles consumption is unity. For this reason, the consumption series was 
detrended by its own estimated trend, which was .0032616 per quar¬ 
ter. Both the income and consumption series were expressed in units 
of (constant) dollars per capita. 


Kmpinetd RestdU 

The reduced Form was written out in terms of the structural param¬ 
eters and estimated by multivariate least stjuarcs. The unconstrained 
system estimated wa.s: 

>'( = Ml + I + + . . . + Psyc-s + I'll (fifi) 

and 

Ac, = M 2 + l3oUpi - DYi-i Pz.V ,-2 + . . . + p»yi-Hl + 
t- fiAyi-z + • • • + + f'li- 

1 he point estimates and their standard errors are reported in table 
2A. 

Imposing the restriction /Jo = ;8i = . ■ • = (8? = 0, the constrained 
system consisting of the income equation (66) and the consumption 
etjuation 

Ac, = ^2 + jij, (68) 

was also estimated by multivariate least squares. The point estimates 
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l ABLE 2 


A. 

Point' Estimaii.s 

AND Sl'ANDARlF F^KRORS FOR 

Equations 

(66) AND 

(67) 


Pi 

P« 

Pa 

P4 Pa 

P* 

Pi 

Ps 

92 54 

.964 

069 

.074 

-.360 112 

.157 

-.008 

- 053 

(64.26) 

(.092) 

(.126) 

(.125) 

(.122) (.123) 

( 124) 

t 122) 

( 085) 

/*! 

f3o 


01 


Pa 

P. 

Pi 

32.71 

.355 

.071 

.049 

- 116 .114 

-.073 

.050 

-.012 

(24.95) 

(.275) 

(.036) 

( 038) 

(050) (.071) 

( 046) 

( 036) 

( 035) 


B. Point Ksiimaifs ani> SrANi>ARi> F.rroks 

FOR FqUaI 

ION (68) 



Pr 

Pi 

Pi 

p4 Ps Pe p7 

P« 

Pj 

55 62 

.898 

067 

231 - 

476 280 -.045 .065 

- 045 

068 

(57.72) 

(.082) 

(.113) (112) (. 

110) (.111) ( 111) (.110) 

( 076) 

( 862) 

C. 

Rniiic H) 

-Form C>)Kfh< unis and Siandard Krrors for tgiAiios 

. (69) 




TTs 

W 4 TTs 


^7 


32 71 

058 

.002 

- 138 

103 - 148 

179 

- 065 

.007 

(25 95) 

( 037) 

( 051) 

(.050) 

(049) (.050) 

( 050) 

( 049) 

( 034) 


and standard errors, based on the constrained estiinaiion. are re¬ 
ported in table 2B. 

Using the notation and to denote the log likelihood ot the 
‘onstrained and unconstrained systems, respectively, the likelihood 
raiK) statistic X = - /,„) for the hypothesis/3o = /3, = ... = /S? = 0 

IS distributed The value of the likelihood ratio test statistic, 

was 27.024. Since the critical value of the x^(^) distribution is 21.96 
for the 0..5 percent significance level, a test statistic of 27.0 represents 
a very decisive rejection of the hypothesis. 

For comparison. Hall's version of the test using the reduced form of 
the consumption et|uation alone, 

Ac, = + 7r,y,-, T TTjVr 2 + . . • + 7T„y,-„ -I- (69) 

was also estimated. The reduced-form coelficienis and their stand.ird 
errors are reported in table 20.® The likelihood ratio statistic for the 
hypothesis that /3o = /3, = . . . = /St = 0, it is necess.u y to compute the 
computer work confirms the assertion that the test of /S,, = /S, — . . . = 

* The esumalect cdiislaiu term lu the ronsir.iinol ceiMon of e(| (li'.l) w,i>i (Hi8. «itli .i 
slandanj eiror of .866. 
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fir " 0 ill ilif (onifxt ot llii* Iwo-eijtialion rL'iliitefl ibnn (eijij. [(5(jJ 
,Hi<l [67]) and flic test of tt, = ttj = . . . — tth = 0 in die eontexi of 
the retliK ed-foi ni <.()nsnni|)(ion equation alone, e(|nalion (6ff), yield 
numerieally identiial values of the likeliliootl ratio statistic. 

1 he stanclat'd ei rots of regression (.SKR) and Durbin-Watson statis¬ 
tics at e rcptiiied in table 3. Tlic SKR of each cijnalion was roniiiiiied 
hv dividitig the sum of s(|uaicd rc^sidnals by the degrees of freetlotii 
rathertlun the niitnber of observatiotis. Die sl.indaid eiror.s have the 
iiitet [iretation of the cfollais per capita piediclion ei ror of each etpia- 
tion. Fot eomparisoti, the standarci deviations of y and Ar weie 
.'S71..’'i7 atitl $9.4"), respetlively. 

1 he discussioti of the identification ol the system showed that the 
maximum-likelihood estimates of the /j, paiameteis were inde- 
pemlent of the value of A<I>. the paiametei which measures ilie pie- 
dicted response ol consumption to an innovation in current income. 
While tlie idcniilu ation of the paiameier k(p is not i ecpnred to test the 
hvijothesis th.it /J,, — = . . . = /Sj = 0, it is necessary to compute the 

value ol k<P in older to lecover the parameters of the covaiiante 
mattix of situcluial disturbances. 

liy stibslituling the esliin.iteil paiameteis of the income process into 
etjuation (.34), one can tompute the vtilue ol *1), which gives the 
c,ipiiali/ed value ol the revision in expecterl future income or the 
change in w ealth associatetl with an innovation in cun cut income. Foi 
the estimated parameters of the income process, and an annual inter¬ 
est rate of .a percent, an innovation in current income implies .i 
(liange in wealth of 1 7.80 times the income innovation.^ Assuming an 

^ ! I) <>» flfj If) MiMplif y I he esiim.iDoii, ihc JiuoiDe pnKcss vv<js inoilrlcrl as an 8-(|iiai ifi 
pure AK pjfx<'ss. Formal lesls o! inoic g^'iicial TJii\e<i ARMA ni<xlc‘ls ot the income 
pifxess Klcmificd ilie 0, 4) speciti<aiioii <thicc AR paiaineleis and lour MA 
p.iiainetci s) and ihe (1, 0, 4) ilwalion as the most p.ii simonioiis iniKod ARM \ 
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average horizon of 20 years, the annuity rate is 0.0198 per quarter." 
1 he change in periiianent income associatetl with an innovation in 
current income is (0.0198)( 17.80), or 0.8.5 times the income innova¬ 
tion. Since the consumption variable used to estimate the system was 
nondurables consumption rather than total consumption, the 
parameter k<t> will be the product of the marginal propensity to 
consume nondurables (jut of peirnanent income and the change in 
permanent income induced by an innovation in current income. With 
nonduiabic consumption accounting for 47.5 percent of total con¬ 
sumption, Act = (().475)(0.0198)(17.80) = ().167.5. 

I'he estiniiited covariance matrix of the distuibances of the unre¬ 
stricted reducerl foi m is: 


Cl = 


467.808 

79.8921 


79.8921 

70.5175 


Having identified ilie value k<t> as 0.1675 and the value of (in as 
.855, the ctrv.ii i<uite matiix of the strut luial dislurb.mces can be 
recovered using et|uaiion (.57). The esiimateti covariance matiix trl 
the structural tlisiui bances is: 


i = 


467.808 
- 164.;>8S 


- 164.588'} 

I 14.745)■ 


Note that the covariante between the structural disturbances of the 
two etptalion.s, cov (e,, is negative and vet v laige rel.itise to the 
variance of the distuibance in the consumption equation, tj. In 
the di.scussion of the |)roperties ol the structural disturbames. it was 
pointed out that the omission of elements ol the available information 
set from the income-forecasting et|uaiion implies ih.ii the income 
residual is suhieci to me.isuiement error and ih.ii the presence of 
nieasuiemeiiL eiror gives lise to a negative covariance Iretween tlie 
siiuctural disluibiuices. 1 he large negative estiin.ite ol cov (Ci. go) 
suggests that the iiieasuiement error iiiiiodiuetl liv using itie 
iiicome-lorecast error ,is a measure of the “true" income imiov.iiioti 
(annot lie dismi.ssed as negligible, file finding ill.it the measurement 
ei ror is rjuaiitiiatively imirori.ml indicates th.ii .iiiempis to esiim.iie A'Cl> 


1 fprcsent.ilKiiis ol lilt* iiuoiiu" poKtws. Mr* v.iIiic* <>1 4> ii!1J)Ik*<1 I>' ilu* (.f O, t) tiKsIt-l 
w.is 16,11, by the (I, 0, -1) iiiik1c*I, l.'i 15 .SiiKf llu* \K\I,\ iiukIcN kIciiiiIriI lliioiii;li 
t<)inial IMiR'-seiies .in.ibsts ot iIr* hr our* senes \ieUI s.ilues ol \ei\ sitnil.ir In llu* 
\akie ot <I) coiiipuled toRii itie S-(|ii.iiIer .\K puness. thete is no iiutK.tlion th.tt tlic 
assumed pine AK piixess is iiiadecjiiaic in iapniimg the tmie-seoes piojxims ol 
income 

" Idle cpiai let Iv .iiiniuO’ i .ile w.is t atctiialc-d wilh die lollowuig loi mu la .muuiu i .ite 
- r/( 1 — I'he cpi.ii lei l> iiileiest i .ilc*. i. w.is assunit'd to In* 0 tU 2;>. tilt* hoi i/oii ot 

the I epi esenr.ilivc uidiMdu.il, tv.is assumed lo In* Mil €|U.ii lei s 
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directly by doing regressions of the change in consumption on the 
income residuals will be subject to severe downward bias. 

I'he discrepancy between the predicted and observed behavior of 
consumption is quantitatively large as well as statistically significant. 
Since the dependent variable is consumption of nondurable goods 
rather than total consumption, the j0, coefficients represent the excess 
sensitivity of nondurables consumption alone. Considering that non¬ 
durable goods consumption represents 45 percent of total consump¬ 
tion expenditures, the point estimates of /Jo, /Ji, and ttf -355, .071, 
and .049, respectively, indicate a strong “excess” response of con¬ 
sumption to current income. Unfortunately, the estimate of the 
parameter /3o is imprecise, as indicated by its fairly large standard 
error of .275. 

Because of the large sampling error of the estimate of the crucial 
parameter /3o, these tests stop short of providing conclusive evidence 
that the permanent income-rational expectations hypothesis fails in a 
quantitatively significant way. .Still, the large point estimate /So, to¬ 
gether with the statistical rejection at the 0.5 percent level of the zero 
restrictions on the /3( coefficients, constitute significant, if not conclu¬ 
sive, evidence against the hypothesis. 

In Hall’s paper, tests of the pretlictive value of lagged income did 
not reveal strong evidence against the permanent-income hypothesis.* 
Hall concluded, “ I'he results of this paper have the strong implication 
that beyotid the next few quarters consumption should be treated as 
an exogenous vat iable. . . . With respect to the analysis of stabilization 
policy, the findings of this paper (support] the view that policy affects 
consumption only as much as it affects permanent income” (p. 986). 

Why is it that the tests reported here reveal substantial evidence 
against the permanent-income hypothesis, whereas Hall’s did not? As 
shown above, the lest based on the two-equation reduced form is iu)t 
intrinsically more powerf ul than Hall’s test. The structurally derived 
reduced-form test provides stronger results for two rea.sons, one 
obvious and the other more subtle. First, by permitting the recoverv 
of the structural parameter estimates measuritig the excess sensitivity 
of consumption to cutremt income, the /3(, the structurally derived 
reduced-form test makes it possible to as.sess the cpiaiilitaii'e 
significance of a departure from the null hypothesis. 

Of the regressions reported by Hall, the regression of current 
consumption (of nondurable goods and services) on lagged con¬ 
sumption and four lagged values of income is closest to the reduced 


“ Hall was able lo sialistiially ie)C<i the perinancnl-imoine hypothesis based on tesn 
ot the predictive value of a .stork pike index. 
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form estimated in this paper. On page 983 Hall reports the lollowing 
coefficient estimates and standard errors. 

C( = -23 + 1.076c,_, + .049y,_, - .051ji,_2 - .023:y,_3 - .024^,- 4 . 

(11) (.047) (.043) (.052) (.051) (.037) 

Testing the hypothesis that the coefficients of all four lagged values of 
income were zero, Hall got an T-statisiic of 2.0, which was high 
enough to reject the hypothesis at the 10 percent level, although not 
the 5 percent level. Noting that the first lagged value of income had a 
small positive coefficient and that the sum of the coefficients was small 
and negative. Hall concluded, “There is a statistically marginal and 
numerically small relation between consumption and very recent 
levels of disposable income" (p. 984). 

The statement that the relationship Ijetween consumption and dis¬ 
posable income is “numerically small” refers to the magnitude of the 
reduced-form coefficients, the tti, not the structural parameters /3(. 
Hall is explicit on the point that his regressions did not provide 
estimates of structural parameters: “No claim is made that the true 
structural relation between ctmsumpiion anti its determinants is re¬ 
vealed by this approach" (p. 977). 

The obvious advantage of the structurally derived reduced-form 
test is that it pet mits the estimation of the stiuctuial parameters. In 
equation systetn (54), the leduced-form coefficients are written out in 
terms of the underlying structural parameters. Equation system ( 68 ) 
indicates that even if consumption responds strongly to current and 
lagged changes in income, that is, the /3( are large and positive, the 
leduced-form parameters will be small. Consider in particular the 
reduced-form coelficient ofvf-i: 

TTi = /3fl(Pl “ 1) + /3|. ("D 

When nondurables consumption alone is used to form the dependent 
variable, the estimate of is .355 and the estimate of /3, is .071. 
However, since the first aiiloregressive parameter in the income pro¬ 
cess, pi, is estimated to be ,964, the value of tti is only .058. When ihe 
consumption concept is defined as nondurable goods and services, /3o 
and ) 8 , are estimated to be .461 and .128, respectively. 1 he resulting 
reduced-form coelficient tt, in this case is . 112 . In addition lo the 
4-quarter distributed lag. Hall reported the results for a test using a 
12-quarter Almon lag. Iti both cases the sum of the reduced-form 
parameters was small and negative. Using ccpiation system ((i5), note 
that 

8 8 

^TT, = - 1). 


( 72 ) 
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f'hc sum of the autoregressive parameters of the income process is 
about .95 for quaiterly data, t herefore, even if fi„ has a healthy 
positive value, the sum of the red need-form parameters will be small 
and negative. 

While the test based on the reducctl form of the consumption 
equation alone gives the same test statistic as the test based on the 
tvvo-equatioii reduced form, it does not by itself provide any informa¬ 
tion about the quantitative importance of a departure from the null 
hypothesis. Observing that the reduced-form coefficients are small 
does not imply that the structural coefficients are small. To the con- 
itary, large positive values of the cocllicienis are as.sociaied with 
reduced-foi m paiameteis which are small tmtl which sum tt) a nega¬ 
tive \'alue. 

1 he secoinl, mote subtle advanttige of the structural appioacfi is as 
follows: Heciuise it requires the explicit specifitation of a structural 
model, the stiiictuial .qjprotich provides some guidance coucei iiing 
the specification (jf the reduced form. 

While H.til's apprtjtich of anttly/.ing the restrictions on the condi- 
tiotiai expectatioti of consumption led to the correct conclusion that 
the teditced-fot tn coef ficienis of lagged vtnittbles shoitld be /ero, it 
dief not suggest atty criieri.t fc)r selecting it paiticular collection of 
vtiriables to be iticltided in the reduced form. By studying the 
St I uctural spec ificatioti, one learns that, under the alteriiitiive hypoth¬ 
esis ( 0^1 > 0), ;uiy lagged value of incotne which has tt tion/eic) 
coefficient in the income atitorc’gression will in general have a non- 
zeio cctefficieiu in the consumption reduced form. 

Ftii thei, if one believes th;it the income process is best modeled ;is 
an AR process .iiound ,m exjjonential trend, then the consumption 
reduced foi in should bc‘ fit to the detrended income series rtither ih.in 
the raw income sei ies. Alternatively, oneccvidd leave the income series 
in its otigm.tl (not detiendeci) form and include an expemential trend 
term, e"", in the consumption reduced foim. In Hall’s original tests, 
the income series was not detrended and the reduced form did not 
include a trend tertn. If the income process does include a trend. 
Hall’s tests are misspecified under the alternative hypothesis. One 
would expec t that the failure to allc>w for the trend in income, which 
causes the reduced form to be misspecified under the alternative 
hypothesis, would reduce the power of ifie test. As an empirical 
matter, the treatment of the income trend does have a noticeable 
effect (jti ilie value- of the likelihood-ratio statistic.'" In all of the tc'si'- 
reported above, the incrmie .series have been detrended. 


In .1 loiiinu’m on Hall's papn, Stari/ (1971*) poiiiud (jui tfiat ilic iiulusioii ol 
exjxfneiitial tiend in ihc 1 iciuiof* lo iiurrast* iIk' value ol the test stati'-tit 
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Having shown thai ihe test based on the bivariate reduced foirn and 
Hall’ s test based on the sitigle-etjnation reduced form yield nuiner- 
icaily identical values of the likelihood ratio statistic for comparable 
specifications, it is necessary to tfetermine why the rational 
expectations-permaneni income hypothesis is rejected at the 0.5 pei - 
cent level in the tests reported here, wliile the hypothesis could not be 
rejected at the 5 percent level in Hall’s original tests. In jiarticular, one 
would like to know whether the difference in the test statistic is 
attributable to diflerences in the data sets, or to differences in the 
specification of the alternative hypothesis, in terms ol the number of 
lagged values of income iruluclefi aiul the treatment of the trend in 
income. 

Hall useti a broadei concept of consumption—expenditures on 
nondurable goo<ls and services—than the consumjition concept, 
nondurable goods, used in this pafier. Data on real per capita con- 
sumptioti of nondurahlcs and services were constructed to provide a 
direct comparison to Hall’s original results. While the data set used to 
rerun Hall's regiessions was constructed according to his basic dehni- 
tions of the variables, it is not an exact reprochic lion of his data set 
The data set used here differs from Hall's because it incorporates 
subseejuent NIFA revisions of the tnosi recent obseiv.itions and be¬ 
cause a different pi ice deflator w,is used to deflate the tneome set ies 
In the regression of cm rent consumption on l.igged consumption and 
lour lagged v.ilues of income. Hall obtained an F-statistic for tlie 
exclusion of the lagged values of income of 2.0, which is somewhat 
lielow the critical value of 2.45 tor the 5 percent level. F(.)r the same 
sample period atid sjiecificalion, the test statistic vvas 2.75 using the 
tec oust rue ted data set. I bus, minor diflerences in the construe tion of 
the data raise the significance level of ihe test siaiisiic from somevdiat 
below to somewhat above the 5 percent level, (amsideraiioii of the 
.vinictural model indicates that the reduced form should include as 
many lagged values of income as the income autoiegiessioii and that 
some allowance for a irc'iid in income should be made, either b\ 
detrending the income senes or by including .i iiend term m the 
consumption redticecl foim. f'siiig the original (not detrended) in¬ 
come set ies but increasing the number of lagged values cil income 
ineliided in the coiisumptioii reduced foini from lour to eight iii- 
crcMsed the value of the likelihood-ratio statistic from 10,95 to 22.bl(i. 
1 his taises the signilicance levcd of the test liom the .5 pcTeent level to 
the 0.5 percetit level. Detretiding the income sei ies also lias a substan¬ 
tial effect on the test statistic. For tlie sjx‘cifuation vMth loin lagged 
values of income’, tlie test statistic was Hi. 192 for the’ detrended 
income sc^iies (which is significant at the 0.5 percent level) as opposed 
to 10.95 for the raw income' seiit’s For the spec itieaiion with C’ighi 
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lagged values of income, detrending the income series raises the test 
statistic from 22.616 to 27.108. Thus, the insight concerning the 
appropriate specihcation of the reduced form which is obtained by 
explicitly formulating the underlying structural model was responsi¬ 
ble for raising the significance level of the test from a marginal 
rejection to a decisive rejection. 


IV. Conclusions 

This paper develops a simple structural econometric model of con¬ 
sumption. In the general spccilication of the model, consumption 
responds to the changes in permanent income signaled by innovations 
in the current income process and to changes in cut rent income itself. 
The response of consumption to current income beyond that at¬ 
tributable to the role of current income in signaling changes in per¬ 
manent income is termed the "excess sensitivity" of consumption to 
current income. The econometric model is then used to estimate the 
excess sensitivity of consumption to current income and to test the 
implicatitm of the permanent-income hypothesis that the excess sen¬ 
sitivity of consumption is zero. 

1 he tests reveal substantial evidence against the permanent-income 
hypothesis. Using either nondurables consumption or consumption 
of nonduiables and services as the dependent variable, the hypothesis 
that consumption exhibits no excess sensitivity to cm rent income can 
be rejected at the 0.5 percent level. Using expenditures on nonduta¬ 
ble goods as the consumption variable, the point estimate of the excess 
sensitivity ot consumption to the contemporaneous change in income 
IS .355. Keeping in mind that nondurable goods represent only a 
fraction of total personal consumption expenditures—about 45 
percent—a point estimate of .355 repre.sents a large departure from 
the permanent-income hypothesis. 

The paper also shows that the test of the permanent incomc- 
lational expectations hypothesis proptised by Hall (1978) can be¬ 
thought of as a test based on the retluced form of the structural model 
developed here. Rather than develop a structural econometric model 
and derive the associated restrictions on the reduced-lorm parame¬ 
ters, Hall’s approach was to analyze the implications of the permanent 
income-rational expectations hypothesis for the conditional expecta¬ 
tion of consumption given lagged variables. Since a reduced form has 
the interpretation oi expressing a conditional expectation. Halts 
analysis led directly to testable restrictions on the parameters of .i 
reduced-form consumption equation. When the structural model is 
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just identified, both tests yield numerically identical values of the test 
statistic. While the structural approach does not provide a statistically 
more powerful test, in general, than the reduced-form approach, it 
does have the advantage of providing estimates of the structural 
parameters of the model, including estimates of the “excess sensitiv¬ 
ity” of consumption to current income. 
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Appendix I 

For the genetal model 

yi = Pi>/"1 + P2.Yf-2 + • ■ . + PpY/ » + <( + + 4>2fi 2 + ■ ■ + (Al) 

(*2> 

(.0 

where the «/(( are the infinite series of moving average parameters in the 
pure moving average representation of the model' v, = e, + <pi^r-i + 

+ . . . . Solving for i|/, in terms of the estimated p, and if),, 
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Appendix II 


Pi ■oof thill llie likelihood ratio statistic for the lest of y = 0 in the equation 

At, = p. + + V, (A 13 ) 

is nuinciicallv identical lo the likelihood ratio .statistic for the lest of /3 = 0 in 
the hivan.ile reduced form; 


(/■ X 2 ) ( 7 ' X 2 ) (2 X 2 ) 

(v, AoJ = [l >’, 


/3(P 


1 ) 


('/' X 2) 
+ It'll Fill- 


(AH) 


Tfie (il.S transformation of equation (A 14) is obtained by postinultiplying by 
the lower li iaiigulai matrix P such that PP' = ll where II is the (tivan.ime 
matrix o( die reduced form; 



It will be t OHM'llie in lo normalize P sue h that the lower-i iglil-haiid element is 
equal lo one. Define 



where = [PiJP-ji) and P,^ = (P,ilPrt)- rransfoiming (AH) by P yields: 

P,,y, t-P,jAi, (P|,pi + PxiHt) + >'i-tlP,iP I- Pi>/f(p - 1)1 ■+■ Fii (A15) 

ami 


A(, - Hi e y, i/?(p - 1) + Vit (Alb) 

\shere r,, = PiiPu ^ 

Let /,' denote the log likelihinxl of the ti anslormed income equation (A 1.')), 
denote the log likelihood of the con.stimpiioii e(|uaiion (Alb), and 
denote the log likelihoiKl of the bisaiiate system. Sinie toy (e,,. Vn) = I), the 
yaliie ol the log likeliho<id Iiiikiioii of llie bivati.ile system is llie sum ol ifie 
values ol the log likelihood of ifie tivo iiidiyidual e((ualious. I.' ^ — J.' + /.' 
Imposing the < onsli ,unl that /J = 0 reduces the value o I U, l.f. < I.'f,. I in posing 
tfie lestiiilioii that /S O cIik-s not restrict llie log likelihood fuiiclioii ol tlie 
Itansformed im oiiie equation, since p, andpare uni esi i ic led ])ai ameters, / ,' 
= f-i. 1 hereloi e, all of the* dec rease in the likelihood I uiic lion of the bivariate 
system IS atlnbut.ible lo the decrease in the likelihood Inncliou ol the coii- 
sumptioii equation alone; - LJ - Lf. 
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Measurement of Monopoly Behavior: 

An Application to the Cigarette Industry 


Daniel A. Sumner 

\urlh Cnrolm/i Stale Unwer\ity 


A simple scheme is proposed lo measure monopoly pricing hehavior. 
1 lie coelf'uient of the tav rale reriii in a price eqiialion idenlihes ihe 
raiio of pine to inarginal tost. No direct measurement of costs is 
I e(|uired. so a major piobleni for other empirical studies of moiio|>oly 
IS avoided. EmpiiKal resulls for cros.s-.seclion time-series data sup- 
poit lejectiori of atomistic competition but also provide evidence 
against the operation of an effective cartel in the cigarette industry. 
[ he model lepreseiit.s an alternative inteipretalion of related result.s 
m a I C( ent paper by Bai /el. .Application of the methodology to other 
mat kets is feasible. 


I. Introduction 

The cigarette indtistiy i.s often tisetl as an example of a le.ss than 
perfectly competitive industry. For exainjrle, in llie famous Tobacco 
Case of f946. the major cigarette matuifaclurcrs were accused of 
operating an illegal cartel; thongfi they were convicted, the consen.sus 
was that company behavior was not altered by the verdict.' This paper 

fills is a leviseit version ol "InleisUle Cigarellc Price Dilferenlials and die Mea- 
sureinent of Finn la-vel Demand Klaslkiiics” (()rlol>er 1979). I wish lo thank Yorain 
Bar/cl. Rolx-rl Claik, jarnes Kasley, David Hath, (ihailes KncK-ber, Michael Kusnii. 
fhornas Mat.'urdy, Michael Me FIroy, Ronald .Schiiinper, .Michael Wohlgenant, and an 
unknown lefeiee (or iheir loniineiils I his papei is no 6626 ol the |ouinal .Senes ol 
the Noilh t'.aiolina Agnculluial Reseanh Service. Raleigh. 

' See Nitholls (1949), Bain (1968). oi feiinaiil (197 I) toi ciiscussion of the I946case 
and of llie "strutluie, concliicl and pc-i lonnaiice" of the cigaielte industry. .Stigln 
(1966, p. 244) jneseius a table of indusiries with high concentration; cigarettes raid 
lounh in ihc share of assets held by the “giant firms.’’ Fncdman (t953, p. 37) also 

\Jtmnuil of PdUtwni httm/nny, I'WI, vtil 8‘/, f»t» 't] 

hv rhe UnivctAity t»f C:hk.n;t> fK>Z2-'l«<W/HI/8905-00(HI$0! '»0 
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attempts to investigate the alleged market power of cigarette man¬ 
ufacturers with a simple scheme for measuring the weighted-average 
price elasticity of demand facing firms in this industry. 

The empirical examination of the pricing behavior of the cigarette 
indu.stry finds an absence of evidence of an effective cartel, but the 
hypothesis of atomistic competition is also rejected. Ihe approach 
u.ses data on excise taxes, but it does not depend on measuring ihe 
cost of production. .Since reliable cost data are not generally available, 
this paper avoids an important weakness of directly measuring 
monopoly power by the ratio of product price to marginal cost. The 
model suggested here is also applicable to other industries with varia¬ 
tion in excise taxes or some other directly measurable components of 
marginal cost.® 


II. A Simple Model 

The equilibrium output condition, that marginal revenue equals 
marginal cost, implies that 

Pj = lvAv>+ mM,. ( 1 ) 

where Py is the product price, ATj is the marginal cost of production, 
and r)j is the price elasticity of the demand curve lacing firm ;. If the 
industry is purely inonopoli.stic, t)j is equal to the market elasticity t)'"; 
while if it is perfectly competitive. 7 )j = -» and is independent of tj'". 
T hus r)j is a measure of the degree to which the firm is not strictly <t 
ptice taker.* However, stnee good measurements of .W, are difficult ttt 
fittd, equation (1) does not offer a practical solution to the problem of 
determining potential monopoly power. 

For a product like cigarettes, which has a per unit excise tax, the 
marginal cost tet m may be expanded ,is 

Mj = Xj/3 + €j + R, ( 2 ) 


(tisfusses ptiteini.il monopoly power of rigaienc linns, prerlunii); lli.4l in .in .in.ilssis ol 
response Ui (ax race thaiiges. "bro.idly roirerc results will lie nbi.iineil b\ ireaniin 
lil^arcllc firms as if ihey weie pnxIiKiii); an uteinicai pitKluri .iiui were in [lei (eel 
competition." 

* I he empiric.il iikkIcI denved in this p.n)er is simil.ii to tli.it used lot a diHeicnt 
puipose by Bar/el (1976) (and also in a coiiimeiil b\ Johnson 11978]) Ihe piesr’iii 
paper ivas well under way when (he Barrel lesulls were pointed out to rue in tlh.nles 
Kitoeber. A paper by Manihcslci (1976) also pieseiits (in ait ap(x'iidi\) a regicssion 
related to lho.se eslimaied Ix-low, ffowevei, M.iiuhestei diws not iiitetptei his i.is t.ne 
coerficietils in that paper. 

^ Kor inultiprodiicl firiri.s, optiin.il priiinj{ decisions ie(|iiiie .itieiilioii to cioss elas- 
tieuics of demand. 1 will siiiiply ignore ihis reftnenienl and implunK assume that n is 
not empiruallv itnpotlanl. In what tollows I use die teim "hiin ' lo mean iho telcvani 
pricing decision maker for the given prixluct 



I<)l6 JOURNAL OF POLITICAL ECONOMY 

is captured by slate dummy variables and does not bias the coefficient 
of 

IV. Empirical Results and Interpretations 

The major empirical results of this study are reported in table 1. In 
this section, these results are presented and interpreted. A potential 
alternative interpretation suggesteil by Barzel (1976) is also consid¬ 
ered. 

The estimates reported are based on annual lax rates and j^rice data 
for the yeais 1954-78 for 47 states and the District ot Columbia. 
Alaska, Hawaii, and New Hampshire are excluded from the sample 
for the analysis repottetl in the table because of lack of data and 
peculiarities in their tax laws. (See n. 11 below for moie discussion of 
the New Hampshire case.) 

Infoimalion on cigarette prices and lax rates is collet led and jjub- 
lished Ity the Tobacco fax Council. The “weighte<l-avei age price per 
package" published foi each slate and year uses national weights for 
type of cigarette (tegular, king, 100 mtn) anti for type of transaction 
(c.ntfin, single pack, machine). VVhole.saleis arttiuid the countly are 
asked to provide estimates of the average retail price in their area for 
each type of cigarette and type of sale. These numbers ate turned into 
state averages by the couittil. (This information is fiom corre- 
sitondence with the very helpful jtersonnel at the Tobacco I'ax Coun- 

cil.) 

T.iltle I sliows estimates of the linear model of etpiation (4) untler 
thiee alternative specifications of the .V,, vector and the tlisturbance 
term. The table reports the estimated 6 and, to aid in interpretation, 
the intplied price elasticity, tj. In row 1, X„ consists of a dummy 
vai iable for each state and for each year. The tax rale elfett in this 
row is 1.074, which savs, lor example, that a 10 tent difference in lax 
rates causes a 10.74 cent iutiease in price. The coefficient is 
significantly abttve 1.0 (standard eiior = 0.013), whith is supjjorl for 
lejecimg the hypothesis ctf atomistic competition in this industry. 
However, the implied price elasticity is -13.5, which indicates a 
lather Hat demantl curve. Row 2 rejilaces year dummies with the 
National (Consumer Trice Index and drojjs tfie slate dummy variables, 
while row 3 shows estimates Irtim an ajjjtlication of a variance- 
comjjonents model with the price inde.x as the only additional cxjjlan- 
atory variable. The estimates ot 0 for these rows are 1,029 and 1.069, 
and both are significantly above 1.0. Other sjjecifications of .Y„ and f/„ 
yielded estimates of d similar to tlutse reported. 

"This basic model was also estimated separately for each year using 
states as the unit of ob.scrvalion. The estimates of 0 were generally in 
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l ABLE I 


Esiima i ks ok tuk Ekkect of the Tax Rate on Cioarmte Prk e in Ai ieknaiive 
SKEr.lHCAITONS OK IHE I.INEAR MoIKK.L (E(], [4]) 


Estiin.Tied 0 

Implied Price 
Elasticity rj 

Other Variables 
Included 

Method 

1.074 

-13 5 

r)y,j)s, 

Ol s 

(.013) 

1 029 

34 5 

NC.PI 

OLS 

(Oil) 

1.069 

-14.5 

NCPI 

vet 

(.013) 





Nou — R*'i> in each of ilirsr w<-rc hif>h. ihosc wiih (iuiiiiii> v.inabk's were a^Miui and ihos** 

widiout thr (luTiinnc-s wt-iv arnim<l ‘J'i I here w*-i*? I,‘200 obsei vaiion% (25 v<'at-<« lor t ach of 48 stales) SKs in 
|>a renihrsrs 

• Dy, IS a fliiinmv \ai lablc f<ir ea« li yrar wilh 107M rx( ludfd./>S, is a duiniiiy variable for eai fi stale vsitfi W'voniinn 
rxiluded, and N(-V*I is iVir iialioiul coiisuirier price index t<ir each vear 

t In (Ins ecpiaiion a v<«ri.in( e-< oiiiponeiil (VC.) crrol scheme with an individual laiidoin rlieci for each stale and 
each year w.is used See fuller and Haltcsc (1974) for a discussion 


the range reported in table I for the pooled model and were not 
sigtiifiranily different from one another. No trend in these estimates 
was noted. 'I'iinc-set ies cliattges in the industry do not seeiti to have 
caused biases in the estimates in table I.** 

Two further comments should be made in relation to these results. 
First, these estimates ate not consistent with an effective cartel in the 
cigarette industiy. I he firm-level elasticity estimates are well below 
the range of the industry dematid elasticity estimates usuallv fotind 
for cigarettes. ( The.se are generally between -0.3 and -0.8.)"’ Second, 
the effect of monopoly petwer as a source of price variation tn the 
industry is small compared with tax late differences and other cost 
differences over time and space. 


“ III tilling ihc nonlincai spfcificalioas presfiilcci in ii. 7 .ilxne. ihc Ii\ pcdhesis dial X 
— I IS not ifjfcictl. I'oi (■!') A — 0.972 wilh SK - 0.027, and <ai (4") A — 0 981 wjdi St' = 
0,020 These tcsls isupport the appropriateness ofihe linear mixiel lesnli-i piestnit^d in 
(able 1. The esliiiiaie nt y hoin ( T) is I 029, and liuiii {4") ii is 1 062. I hrst*. togt ihei 
wilh the A estimaie.s alxne. imply values ol B in the lange ot those in lahlc 1 or NlighiK 
lower. This supports the hypothesis dial while pciletlly competitise pi mug liehaiior is 
rejetted, monopoly powei is nol a niajoi fac lor in dctennintng cigai ette pi k e dil fei cn- 
iials. The hypoihe.sis that intci.siaie smuggling would imply ih.ii piitc is a miu.ne 
liimtion oi ia\ laie, as discussed in n. 6 aliovc, is iu>t supported by these lesnlis. .As a 
liiilher nonlincai specihiation, a .squaied i.ix rate leim was .iddcd to the sep.iraie 
yearly [itke equations In most years the \aiiable ha<l an msigmiuani etieil on 

piite. Ill onlv a lew yeais out of 25 weic the <iK'Hi<ienis loi positive and (/^„)* 
ncgaiivc. 

'® In related work (Sumner 1979), using cioss-section tune-senes data loi the last 10 
years, which alleinpts to indirectly qiiantily the extent ol inleistale iigaielte Ixxideg- 
ging (“butllcggmg ”), I bud the national industiy-Un'el cXAsin.iw ol demand for ngaieiies 
to l)e approximately -0.40. This is in the langc ofothei estimates lioni time-series 
data. 
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Barzel (1976) has suggested an alternative interpretation for the 
coefficient on in the pricing equation. He argues that a higher per 
unit tax on a good will induce a higher quality per unit to be sold. 
Since the relative price of quality is lower in high-tax areas, substitu¬ 
tion of quality for quantity will occur. The after-tax price per unit will 
be higher in a higher tax rate area to reflect the higher quality of the 
product. He also notes that ad valorem taxes fixed as a percentage of 
retail price (as was the practice in New Hampshire prior to 1975) 
would not have this effect. 

As Barzel points out, products actually traded are combinations of 
many characteristics, most of which are extremely difficult for either 
the government or the empirical economist to measure, d’hus, when 
Barzel argues that the quality of cigarettes varies positively with the 
tax, no direct measurement of quality changes could reject that hy¬ 
pothesis. Barzel turns instead to price data on cigarettes across years 
and states as published by the Tobacco Tax Council. ITe nature of 
these data is crucial. As noted above, the price data available for 
cigarettes have the most obvious quality variations (type of sale and 
type of cigarette) held fixed across states. Furthermore, given cen- 
iralized manufacturing and national distribution of brands, there is 
little scope for direct product-quality variation. This leads one to 
question the strength of the expected Barzel effect in these data even 
if it were present iti the real world. Hence it would appear that the 
“elasticity of demand” interpretation of 0 is a serious contender with 
Barzel’s quality shifter for explaining a 6 coefficient greater than 1.0. 
Finally, note that the Barzel effect would cause an upward bias in the 
measurement of monopoly power so that there is an even stronger 
case tor concluding that the cigarette industry does not act as a 
(at tel." 


" As fuilhcr (onsicleiation ot ihe strength of the Bar/el ctfeil in (igareltes, some 
esliin.nion w.is done whuh included data front New Hampshire with the other slates. 
These results were es.sentially identical to those refxrrted in the table. New Hanapshire, 
whit h had an ad v.ilorein cigarette ctx until 1975, was found to have a strongly negattve 
slate dummy variable. Howevet, it may be the ca.se that New Hampshire has a low price 
ot (igaretles net of taxes not Ijccausc its tax was ad valorem as suggested by Barzel but 
foi the same reason that the per < apita sales of cigarettes in New Hampshire are more 
than double the average of the other New F.ngland states. That is, residents of nearby 
states buy a signifitant portion of their cigarettes in New Hampshire. Hiven size 
economics for sales transaetjons, a high pioportion of this interstate trade would tend 
to icduie the average price net of taxes in New Hampshire. Barzcl’s ad valorem tax 
hypolliesis implies that the special po.sitioii of New Hampshire would have evaporated 
dllci 1975 (when a fixed per unit tax replated the ad valorem tax). The transaction cost 
hypothesis (among otheis) predicts that thcie will be no effect of the 1975 law change. I 
found that the New Hampshire residuals (tom the linear model estimated for each year 
arc just as negative after 1975 as before 1975. which indicates that the ad valorem tax 
was not the different New Hampshire characteristic. This is not to suggest that the 
Batzel effect is not theoreucally sound but, rather, that interstate cigarette prices may 
not lie useful to lest the theory. 
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The statistical results presented in table 1 support rejection ot the 
hypothesis of atomistic competition in the cigarette industry. The 
demand curves facing cigarette firms are not Hat but are close to it by 
most standards. Taking 6 = 1.05, a 1 percent rise in the price of a 
firm’s product would cause a 20 percent fall in the quantity sold. Jf the 
cigarette prices are above marginal cost by 5 percent, monopoly 
power adds approximately 3 cents to the average price of a package of 
cigarettes. 

This paper has proposed and demonstrated the applicability of a 
scheme for measuring monopoly behavior that may be relevant to 
several different industries. For example, application to the U.S. 
ga.soline market is currently being pursued. Data requirements are 
information on product price and some factor that has a directly 
measurable per unit cost. Variation over lime or space or some other 
dimension is required for statistical implementation. 
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Swedish Tax Rates, Labor Supply, 
and Tax Revenues 


Charles E. Stuart 

Unij’t'nity of CnUfornia, Santa liai^bara anti UnwerMiy 0/ I.wul 


Kllooive rriaifjriii.il l.ix r.tifs on labor iiu'otnc Irjt ihc “rfprr’SfnIalivt’’’ 
Swede ha\e me le.ised I10111 roiijrlily TiO peHoiu in 1959 lo 80 perccnl 
today. I lie effects of ifiis iiicioasc in the level of tav.ition ate exam- 
iiicd vising a two-scelor model pai .»mctcri/.eii 10 tovvespontl to vlie 
Swedish etonotny. l lic model contains a single household which 
• illoc.ites lahoi toeitliei t.ixc’d (essenlially 111.11 ke() 01 iiiilaxed (largely 
limtsehold) uses The estimated long-i cm elletts .n e siifhcient 10 
expl.nn tip to 75 percent of the recent decline m the measured 
gtowth rate of the Swedish CINP. C.alculations cil total tax revenues 
are <ilso deiived frotii the model These peak, when the t.tx rate is 
.ippioxiinately 70 peicenl, indn.itiiig lh.it .Sweden is piesenlly on the 
downw.iid-sloping poitioii of its “l.alTer Curve.'’ 


'l'hi.s paper consitlets the slateinent that higli taxes can have seriou.s 
consecjuences. Cliven the imponance of the issue and the attention 
paid it in infoitnttl disctissioiis, it is siii prising that little toimal analy¬ 
sis hits heen diiected at it. IFic presetit tretitnient fbciiscs on the 
■Svvedi.sh expeiience. This i.s a ca.se ol (urient events steeling an 
economist’s eye. The growth of the Swedish CiNP lias slowed consid¬ 
erably in recent years, falling from 4.4 percent per year during 
1962-67, to 2.9 percent during 1967-72, to 1.4 percent during 
1972—77. At the same time, tax revenues as a portion of GNP have 

Th.iiiks .ire due to .Syed Ahsjii. Mil hael Boido, Geoff tiarliiier, ]iigciii.ir Ilaii.ssori, 
Chiisiiiia Joniiiig, l 4 |oi ii I.iiulgren, (an.ill Noi 111.11111, Gor.in Skogh, and .111 .inoiiyinous 
icletce Csefiil coinrncnls weie prondecf hy patlictp.iiii.s of seminais w'hich were held 
ai (iailctciii University, Oitawd, Concordia College, Monircal, and in Lund and 
Upp.sala. I'lic research support of the Bank of Sweden rerceiileii.ary Toundation is 
gralefully ,«knowledgccl. 
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increased from 0.388 in 1962 to 0.627 in 1977.* While the exercise of 
presenting these numbers proves nothing, it does indicate that a study 
of the allocational effects of high taxes may provide insight into the 
faltering of the Swedish economy. 

There are many possible explanations of the Swedish economic 
stagnation, and the goal here is not to test competing explanations 
against each other. Instead, the strategy is to concentrate on house¬ 
hold willingness to supply labor to a taxed (market) sector and to ask 
whether changes in the tax rate might be expected to have significant 
long-run effects on the equilibrium quantity of labor supplied. The 
first major conclusion of this paper is that the calculated “supply-side” 
effects of high taxes are of sufficient magnitude to at least partially 
explain the observed stagnation. 

A useful summary statistic for the effect of the level of taxation on 
labor supply is the (marginal) tax rate which maximizes total tax 
revenues (or, iti more popular terms, the rate whicfi yields a 
maximum of the “[.after Curve” for labor). The usefulness of this 
rate—hereafter clenoted t *—is that as taxes increase and t* is ap¬ 
proached, the marginal social cost of additional tax revenues becomes 
infinite. While the socially optimal tax rate cannot be determined 
without regaid to a welfare criterion, rather extreme assumptions 
must be made about the nature of social welfaic in order to imply an 
optimal tax rate greater than /*, since reductions in the tax rate from 
levels exceeding t* would increase both the worth of output and tax 
revenues.** Values of I* are calculateil here; depending on the as¬ 
sumptions used, the best estimates of t* lie in the inletval 69-73 
percent. This leads to the second major conclusion of this paper, that 
actual tax rates in Sweden are in excess of t*. The aggregate marginal 
tax rate (defined below) in Sweden reached the 70 percent level in the 
early 1970.S and is slightly in excess of 80 percent today. 

The vehicle of analysis is a two-sector (Harberger) model where an 
aggregate household allocates labor to either a taxed or an untaxed 
sector.** I'he former corresponds closely to the legal, market portion 
of the economy and is typically represented in national income ac¬ 
counts. The latter includes home production and activities yielding 
undeclared income (illicit tax evasion); leisure may also be included. 
The advantage of such a general equilibrium framewoik is that it 

' These iMilude direct .itul indirect l.ixes, scxial iiisui.incc pieiiiiuiiis, and esi.ite .ind 
gill taxes. 

’ See C'oolcr and Helpman (1974) for a discussion of optimal taxation under dil lei - 
cut welfare liintiions. 

^ A similar sectorial division was used by Boskiii (1975) .As in Harberger’s (1962) 
tre.tlinent, Boskin’s analysis is not based on an explicit ulilny tiincliun; dein.ind elas¬ 
ticities ate instead taken as parameters. The piesem model does explititlv siiecitc a 
utility fiinc lion; in this lesjjet t, it is similar to the model of Shoven and Whallec (1972), 
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yields a clear picture of how taxation affects the relative benefits of 
different uses of labor and how this in turn impinges on labor supply 
and output. An alternative and simpler approach would be to inter¬ 
pret econometric estimates of labor-supply elasticities as reflecting the 
net-wage sensitivity of labor supplied to taxed uses and to assume that 
any tax increase leads to an exactly corresponding reduction in the 
net wage of taxed labor. However, this alternative approach suffers 
from several weaknesses: (I) it is partial; (2) it would in practice be 
based on constant elasticity estimates, whereas (as will be seen below) 
labor-supply elasticities should vary with the marginal tax rate; (3) it 
does not yield estimates of output under different tax rates and thus 
makes simple welfare comparisons difficult; and (4) indeed, the set of 
activities measured in traditional labor-supply studies does not coin¬ 
cide exactly with the set ol taxed uses of labor. Despite this care in the 
choice of a general equilibrium framework, the present study should 
still be regarded as more of a first analysis than a final answer to the 
problem of the supply-side effects of taxation. Perhaps the two most 
severe limitations of the analysis are the highly aggregated treatment 
of untaxed uses ol labor and the relative lack of attention given capital 
accutnulation. 

In the next section the model is formulated and parameterized. 
Results are presented in Section II. Section III is concerned with the 
predictive power of the results. (Concluding remarks are made in 
.Section IV. 

I. The Model 

,4. Structure 

(Consider a highly aggregated economy with two sectors, a taxed 
sector, in which employed labor is subject to a tax, and an untaxed 
.sector, where supplied labor is not taxed. One may think of the 
untaxed sector as involving not only illicit tax evasion via undeclared 
exchange but also fully legal forms of tax evasion, such as painting 
one’s own home instead of hiring a professional painter, cooking 
fancy meals instead of eating out at a fine restaurant, repairing an 
automobile oneself, sewing clothes, watching children, cleaning the 
house instead of hiring domestic help, etc. It is also possible to inter¬ 
pret leisur e as an output of the untaxed sector.* The untaxed sector 
thus consists largely of home production activities. On the other hand, 
the taxed sector is that part of the economy which is traditionally 
treated in national income accounts. 

* A irioie detailed model of home protiuction, leisure, and market work is contained 
in (ironau (1977). 
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Each of these sectors is represented by a Cobb-Douglas production 
function, a choice motivated by the observed statistical stability of 
these functions (see Douglas 1976). Inciting E, and Ej denote the 
outputs of the taxed and untaxed sectors, respectively, and letting/., 
and Lj signify the corresponding uses of labor in these sectors, the 
production functions are; 

E, (la) 

E2 = BL^ (lb) 

where A, a, B, and h are parameters. Formally, capital is subsumed in 
the coefficients A and B, while a and 6 represent labor’s shares in the 
two sectors. This formulation implicitly assumes perfectly inelastic 
supplies of capital to both sectors so that capital remains constant 
during the simulations. Aside from simplifying the analysis, such a 
treatment avoids the difficult empirical problem of estimating the 
capital stock in the untaxed sector. In addition, relaxing the constant 
capital assumption—by letting the supply of capital to either sector 
depend positively on sectorial net return—would cause the calculated 
value of i* to fall and hence strengthen the conclusion that the 
aggregate marginal tax rate in Sweden today exceeds t*. This is 
because an increase in the tax rate on lalx>r income would then give 
rise to the following additional negative feedback on the tax base: As 
the amount of taxed-sector labor fell, capital's marginal product in 
taxed production would also fall, decreasing the demand and hence 
equilibrium quantity ol taxed-sector capital, thereby reducing the 
gross wage obtained in the taxed sector. The result is that the taxed 
sector’s wage bill would be lower (than it would have been with fixed 
capital).* 

Household choice dictates the allocathm of labor to the taxed and 
untaxed sectors; the household supply constraint for laljor is 

L, +/.2=L. (2) 

For simplicity, all taxes are treated as impinging on households. 

The problem of the household—and again the analysis is simplified 
by considering the behavior of a single, aggregate household—is to 


‘ An additional .source of erosion in the tax base for labor income can rxrcur if t .ipiial 
is variable and if the lax under consideration falls on capit.il as well .is labor income. 
Ibis cxtiirs with income taxes Here, an inctcase in the lax rale will cause an iiicioase 
(or no decrease) in the purchase price of capital, and Ic.ss capital will lx- used in ihc 
taxed .sector. This in tuin reduces the taxed sector's wage bill. T he net result is that tax 
revenues must peak at a lower lax rale when capiial is taxed than when it is iiiitaxcxi. 
Again, the assumption of fixed capital leads to the '‘correct'’ bias, relaxing it while 
simultaneously allowing taxation of capital can only strengthen the lonc lns.on that ihe 
aggregate marginal tax rate in Sweden today exceeds I* 
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maximize a utility function dependent on K, and Y^. Two alternative 
forms for utility are proposed: 

t/ = K, + Y^, (3a) 

U = (3b) 

The former encompasses the hypothesis that the household is an in¬ 
come maximizer and that Y, and Y 2 are fierfect substitutes. The latter 
formulation allows imperfect substitutability between y, and Y^. 

From the point of view of the household, tax revenues are redis¬ 
tributed on a lump-sum basis, and capital incomes are obtained on the 
same basis. 'Fhe amount of K, consumed by the household is thus 

y, = (1 -/)a-/., (4) 

where / is the average fax rate, iv is the wage, R is redistributions, I is 
capital income, and where the output of the taxed sector is taken as 
numeraire. It is assumed that the government biulget balances in 
etpiilibriuin, R = twL„ and that capital’s share of y, is 1 — a, so I = 
(1 — rt)y,. These conditions guarantee that the amounts of yj protluced 
and consumed are equal. 1 he amount of hj consumed is taken as the 
amount produced, as no taxes are levied on or L 2 - 

riie first-order condition for a maximization of utility subject to 
(lb), (2), and (4) is 



where the left-hand side is marginal letentions of y, from a unit of 
l.ibor allocated to taxed production, given marginal lax rale and 
whcie the two teiins in parentheses on the right-hand side are the 
marginal r.ite of substitution and the marginal pioduct of labor in 
uniaxed production, respectively. The marginal rate of substitution is 
either one, for (3a), or 1(1 •- «)/«) • (Y,/Y 2 ), for (3b). I'he import of (.5) 
is that taxes ailed the household by alteiing the relative marginal 
benefits of labor allocated to the two sectors. 

rhe model is closed by assuming that the wage in the taxed sector 
is determined under competitive conditions: 

w = aAL1~'. (6) 


B. Parnmeienzation 

To use the framework presented above as a basis for simulations, it is 
necessary to obtain parameter estimates of A, a, B, h, and a, as well as 
to estimate the amount of labor available, L. A summary of the 
method of parameterization is pre.scnted here; a more detailed statis¬ 
tical discussion is contained in the Appendix. 
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The technique for parameterizing the model is to pick an appro¬ 
priate base year, choose units of measurement for labor so that the 
wage equals 1 during that year, and then to find parameters consis¬ 
tent with actual economic behavior in Sweden for that year. The 
choice of the base year for the study was 1969 as it was a year of 
“normal” business activity; that is, it was neither a peak nor a trough 
of a business cycle (1968 was a low, while a peak occurred in 
1970-71). 

First, a is taken as labor’s share of measured GNP (see Appendix, 
item 1). This yields a = 0.752, whence a knowledge of F„ /.,, and a 
allows A = 4.19 to be calculated in accordance with (la). 

Kstimating the shape of the untaxed sector’s production function is 
more difficult. Due to the long-term statistical stability of labor's share 
in Cobb-Douglas functions and the considciation that many of the 
commodities produced by one sector ran also be produced by the 
other, the naive assumption that b = a = 0.752 is used for most 
simulations. However, given the desirability of conducting sensitivity 
analysis, as well as the fact that 0.752 is somewhat greater than the 
values of labor’s share found by many other authors (for discussion, 
see Douglas [ 1976)), the alternative assumption that h = 0.5 is applied 
to two sets of simulations. 

Estimates of the atnoutif of titne spent tn uniaxcd production and 
the vahte of labor in untaxed ptoduction are derived from time- 
budget atid wage data, with the latter calculation based on a variant of 
the tnethod used by Nordhaus and Tobin (1972) in their work on 
“measures of economic wellare” (“MEW”s) (see Appendix, item 2). 
In most of the simulations, /-j and Y.^ arc defined as not including 
leisure; this implies L.^ — 102.68 and a value of uniaxcd labor of 
54.45.” In ihc remaining cases leisure is included, yielding = 
205.26 and a value of untaxed labor tif 105.14 for 1969. Aside from 
the obvious beneht of including leisure as an alternative use of 
household time, the exercise of running simulations on 1,.^ defined to 
include leisure has the advantage of providing a sensitivity test on the 
otheiwise fairly coarse estimates of untaxed production. Such a sen¬ 
sitivity test is particularly valuable, since estimated untaxed produc- 


® 1 lawi yiysViyii (1976) survev'* earlier eMiiricUcs ol ihe value of home pro<luLnon aiui 
discusses ihe appropriaieiiess of ihe iiieihocis usctl. He also lecalculaics e.iiliei esti¬ 
mates to put them on a eoinpai alile and more correct basis. I le (ondudes that the \aliic 
of labor in home productjoii is 35-40 pen eni ol GNP (see also Mui phv [ 1978] and 
Adler and Hawiylyshyn [19781 lor moie rceeiu csiimaies for ihc United States and 
Canada). The ratio of value of labor in home pro<lu( tion to (iNP in the present siiuh is 
40 percent. One might have expet icfl a highei value foi SvNcden in 1969, since gi eater 
taxes should imply relatively more lime devoted lo untaxed (home) pUMluction. Note, 
however, that these cai Uer stiulies use a 20 pcixeni tax rate in deriving after tax uages: 
the rates use<l in the picsent study arc 30-40 pcicent 
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lion is calculated as home production and thus fails to capture prt 
duction which is umaxed due to illegal tax evasion/ 

A knowledge of the amount of time spent in untaxed productic 
and of the value of labor in untaxed production is then used to find 
value of B for each case. The method is to solve 

, _ (value of labor in untaxed production) 

f'2 

for Y 2 and then to substitute into (lb). 

Throughout most of the simulations, the total amount of labc 
available to the household is assumed to be perfectly inelastic. Th 
value ofL is thus the sum ofL, andLj for 1969.1'his procedure migf 
be objected to since greater after-tax wages may create an inducemer 
for households to supply more labor. To evaluate the sensitivity of th 
results to this sort of effect (which might also arise if net immigratio 
depends on after-tax wages) as well as to treat an alternative, “ind 
reel" method of introducing leisure, several simulations are cor 
ducted using the specification 

L=cMl (8 

where c is chosen so that L = 205.26 when the 1969 values of w and / 
are inserted. This formulation assutnes a constant, compensated elas 
ticity ol supply of labor of 0.1.* 

Finally, data on faxes are needed to complete the parameterization 
To get a feel for how marginal taxes should be interpreted in th 
model, it is useful to think for the moment of the taxed and untaxei 
sectors as bcitig market and nonmarket sectors, respectively, and L 
ask. What taxes will an individual face if he operates in the marke 
sector? First, the individual sells his labor (Z.,). The employer pays 
wage ic. before (he employee receives his paycheck, payroll taxes ar 
drawn. Next, income taxes are paid and the individual is left with , 
.sum of money. This money is taken to a commodity market to 1> 
reconverted into goods and services. Indirect taxes are then drawn 
'Fhiis the tax rate relevant to the model includes payroll, income, am 

' Uiiloi lunaicly, no careful economy-wide estimates of the extent of illegal ta 
evasion exist for Sweden For the United Stales, ievent woik by Feige (1979, 19tt( 
suggests that ihe size ot the “monetary unobserved sector” (i.e., the volume ol unre 
[xirted transattions which use money as a medium of c.xchange) ha,s increased greatly ii 
lecent years. Feige calculates that the size of this .sector was roughly 5 percent o 
measured CINP in the early 1960s hut amounts to 25-30 percent of measured (IN 
today, with most of the growth taking place in the 19705. (In correspondence, Feig 
expi esses gieater confidence in his csumaies of growth rales for this sector than in th 
estimated absolute sizes of the sector.) 

* Actually the elasticity is almost compensated since (1) all tax revenues are returnei 
to the household on a lump-sum basis in the model, but (2) marginal increases in taxe 
may decrease national income slightly. 
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TABLE 1 

Swedish Tax Rates 1954-74 


027 


Year 


Average Tax (t) 


Marginal Tax (t') 


1954 

1959 

1964 

1969 

1974 


.271 

.419 

.321 

491 

.393 

583 

.468 

.655 

.557 

,764 


indirect taxes. In addition, certain government transfers are in¬ 
come-indexed, so a correction must be made for the marginal loss 
of subsidies due to increased labor (Li) income. The starting point for 
a calculation of these “total” tax rates is a study by Jaltobsson and 
Normann (1974), where the aggregate average and marginal income 
tax rates for the Swedish economy were estimated.® Table 1 contains 
calculated values of the taxes used in the model (for details, sec 
Appendix, item 3). 

On the basis of these rates, it is assumed that increases in tax rates 
satisfy 

t = 0.829<' - 0.075. (9) 

The slope of this function is calculated as the change in marginal tax 
rates divided by the change in average tax rates from 1954 to 1974. 
The constant (-0.075) adjusts so that (9) holds exactly for 1969. 

Four .scenarios (sets of simulations) are considered under the 
income-maximization hypothesis. Scenario A is the basic simulation; it 
has b = 0.752, leisure not included in L^, and constant /,. In scenario 
B, the value of b is reduced from 0.752 to 0.5 as a sensitivity test. 
Scenarios A and C are identical except that leisure is included in 1,2 in 
the latter. Finally, scenario D differs from scenario A only m that L is 
assumed to vary according to (8). 

It is interesting that if the parameter values for scenarios A-D and 
1969 values of L, and Z -2 are in.serted into the lirst-order condition for 
a maximization of income (eq. [5] with a unitary MRS), it is fouiul that 
the implied values of the marginal tax rate are signincantly less than 
the rate which actually prevailed in 1969. These implied rates are 
between 0.47 and 0.49, which approximate the marginal tax rates in 
1959-60. I’he interpretation is that if the income-maximizadon hy- 


’ This was done by formulating an aggregate lolal tax function based on lax l.ov .ind 
population (mainly income-distribution) data, simulating dilferent total income-total 
tax values, and regressing total taxes on total income. Rates lor 1974 weie suppliesl bs 
t). Nonnann. 
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I ABLE 2 

St tNAKios AND Parameter Values 


Scenario h B a 


L 


F.xplanatioii 


Income Maximi/ation 


A. 

B 

c. 

U 

7.52 

,5 

,752 

.752 

2 22 
10.75 
2 55 
2 22 


205 26 

205 20 
507.89 

220|n.(l - or- 

Basic 

Sensitivity analysis, f) 

Leisure inrludcH in Lj 

L depends on afler-lax 



Cobb*Douglas Utility Maximisation 

A‘. 

752 

2 22 

.744 

205 26 

Basic 

B', 

5 

10.75 

.6.5<} 

205.26 

Sensitivity analysis, b 

C 

.752 

2 55 

592 

307.89 

Leisure included in Lj 

D’ 

752 

2 22 

.744 

228(u-(l - /')]“■ 

L depends on alicr-lax wage 

E'. 

,752 

2,22 

,700 

205 26 

5-ycar adjuMincni lag 

N(Mk — 

li 15 .tsMinicH 

ih.ll a = 

(»75'.i .. 

iiul — 't 19 tliituighcuit 



pothcsis is correct, there was a 9-10-year time lag in the atijustinent of 
household behavior to lax lates. Such a long lime lag may seem 
unreasonahly high hut is not impossible. Attitudes toward labor mar¬ 
ket paiticipaiion and the “correct” amount of wcirk arc likely to 
change slowly, especially given the rigidity of social institutions such as 
the 8-hour <lay, the 40-hour week, etc. 

When the household is assiiiiied to maximize the Cobb-Douglas 
utility luiiciion (.'Ih), a parameter value fora must he specified. This 
value w.is chosen so as to he consistent with (or the root of) the 
first-order condition (5) for 1969 when all olhei parameters, en- 
clogenous variables, and the lax late were inserted. Five scenaiios are 
investigated for this utility function. The fiist four are similar to the 
four scenarios considered for income maximization. In the fifth case, 
it is assumed that a .5-year lag existed in the adjustment of /.| and to 
changes m the lax rale. I his is accomplished by letting the tax rale at 
which the moilel is parameterized be the marginal rate which oc¬ 
curred in 1964 (which, it may be noted, occupied a position in its 
business cycle similar to that of 1969). 

A summary of the .scenarios is given in table 2. 

II. Results 

Within the f ramework presented above, simulations are conducted by 
plugging various marginal tax rates into condition (5) and then .solv¬ 
ing numerically lor the values of /., and and, hence, also of K, and 
F^, which solve (2), (5), (6), and, when applicable, (8). Eejuation (9) is 
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Scenano and Marginal 'lax (%) 


%^Y, 

yjY, 

/* (%) 

A: 





80 

- 22 ..S 

-156 

10 2 


(i5 

-100 

- 7.41 

1 86 

46 

50 

- 4.63 

- 3.44 

.63 


B 





80 

- 19.0 

-14.6 

7 92 


65 

- 7 41 

- 5 64 

1 81 

51 

50 

- 3 12 

- 2.29 

.84 


C 





80 

-19.4 

- 15 3 

,17.7 


65 

10.57 

- 7.91 

3 26 

43 

.50 

- 5.69 

- 4 26 

1.10 


D: 





80 

- 19 8 

-1.5.7 

10 2 


65 

- 9 78 

- 7.45 

1 86 

46 

50 

- 4.73 

- 3.51 

6.3 


A'- 





80 

- 3.24 

- 2 25 

.80 


65 

- 1.46 

- 1.10 

52 

73 

50 

- 83 

65 

.40 


B' 





80 

3 23 

2 43 

1 13 


65 

- 1.36 

- 1 10 

79 

73 

50 

H5 

58 

64 


C 





80 

3 98 

3 00 

1 54 


65 

- 1 95 

1.39 

1 01 

69 

50 

- I 24 

- 87 

78 


D'- 





80 

3.68 

- 2 72 

80 


65 

- 1.75 

- 1 25 

52 

71 

50 

- .96 

76 

.40 


K' 





80 

- 3 51 

2 68 

.92 


65 

- 1.61 

1.19 

.60 

72 

50 

99 

- .69 

46 


N'tiiF —IVnemagc thangrt arc i.iit ulatcd using <iiltrieiKr% in 

naliiial log.itilhius toi lax i 

MU* 1114 1 ease's t)f 1 


(eg , MO pcuetii (o 81 }^rr<cm) 


then used to determine total tax revenues (i.e., to derive the “I.arfer 
Curve”). 

The result.s of the .simiilatiotis are siimmarii'cd in table 3. Since the 
model i.s parameterized using data from 1969, when the marginal tax 
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rate was 65.5 percent, simulations at tax rates significantly greater or 
less than 65.5 percent may be less reliable than simulations at rates 
close to 65.5 percent. For this reason, results are reported for the 65.5 
percent marginal tax rate, as well as for ±15 percent changes in 
marginal rates. Reported in the columns marked and 

“%AK|” are the percentage changes in L, and Y, induced by a 1 
percent increase in marginal taxes (i.e., from 80 percent to 81 per¬ 
cent, 65 percent to 66 percent, and 50 percent to 51 percent). The 
column Y^fY , gives the ratio of outputs of the two sectors. Finally, the 
tax rate which maximizes total tax revenues, t*, is presented. 

Several facets of the results stand out. First, the scenarios based on 
income maximization yield quite striking results. As tax rates increase 
from the 50 percent to 80 percent brackets, the ratio of untaxed to 
taxed output rises dramatically, as do the sensitivities of taxed-sector 
employment and output to small tax increases. The scenarios based 
on Cobb-Douglas utility maximization give somewhat less striking 
results, hut the general pattern is the same. For instance, a shift from 
50 percent to 80 percent taxes generally implies a dtiubling in the size 
of untaxed production relative to taxed production. 

An important aspect of the results is that, as tax rates rise in 1 
percent increments from 50 percent to 80 percent, the effects of 
successive increments on taxed-sector employment and output be¬ 
come increasingly larger. Part of the explanation is that the optimiz¬ 
ing household is interested in the rate of marginal retentions (1 minus 
the marginal tax tate), and marginal retentions fall by increasing 
percentages as marginal taxes rise incrementally. For example, if 
equations (5) and (6) are consulted and the marginal rate of substitu¬ 
tion is assumed constant (the income-maximization hypothesis), then 
a I percent lax increase around the 50 percent bracket requires a 
change in L , and L 2 such that the ratio of labor’s marginal products in 
the taxed and untaxed .sectors rises by 2 percent, but at an 80 percent 
tax rate, the same 1 percent tax rise requires a 5 percent increase in 
the ratio of labor’s tnarginal products. 

I he calculated values of C* are on the order of 50 percent under 
income maximization and 70 percent under Cobb-Douglas utility 
maximization. The sensitivity of I* around the 50 percent and 70 
percent figures is small, indicating a fair degree of robustness to the 
details of the parameterization of the model. The robustness of the 
calculated values of t* is enhanced by the fact that these values lie 
relatively near the tax rate which prevailed in 1969, the year used for 
parameterization. Further, the inclusion of leisure into L 2 leads to 
relatively small declines in I* of 3-4 percent from the basic scenarios 
(A and A'). 

The values of I* calculated here are for taxes on labor income alone. 
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However, there ate two reasons why increases in the aggregate tax 
rate used in the present analysis might also affect the extent of 
taxation of capital; (1) except for the payroll tax, all of the compo¬ 
nents of this tax rate also apply, at least partially, to income from 
capital; and (2) in attempting to finance a greater volume of public 
spending, it is likely that the government would utilize (or has 
utilized) a mix of tax increases for both labor and capital incomes 
instead of relying solely on increased taxation of labor. One might 
therefore ask. How would the calculated values of t* change if t* were 
redefined to imply a maximum of tax revenues from both labor and 
capital incomes? 

To investigate this issue, it is useful to retain for the moment the 
assumption that capital is fixed. In this context, a simultaneous in¬ 
crease in the tax rates on labor and capital will have a positive initial 
impact on revenues from capital taxation but a negative feedback 
effect since national income, and hence the amount of capital income 
subject to taxation, will decline. The following calculations indicate 
that the net effect on the previously calculated values of t* is likely 
to be negligible. In 1969, returns to capital were 33.8 (billions of 
kronor). Total taxes collected in the economy were 62.8, of which 48.0 
were levied on labor income. Of the lemainder, someihing like 14.0 
can be regarded as taxes on capital, whence the average tax rate on 
capital was 0.41 (= 14/33.8). From 1954 to 1974, the average tax rate 
on labor income rose by 0.286 (see table 1). This rise was partially due 
to a hike in the average income tax rate of 0.15 and to a rise of 0.02 in 
the indirect tax component of the rate on laboi income. One possible 
set of assumptions, then, is that the average tax rate on capital rose by 
0.17 and that each 1 percent increase in the average tax on labor thus 
“caused" a 0.6 percent (= 0.17/0.286) increase in the aveiage tax on 
capital, so that the tax on capital satisfied — 0.6t,, 0.13 (with 0.13 

chosen so the 1969 values, = 0.41 and I,, = 0.47, hold simultane¬ 
ously). Then, with capital in perfectly inelastic supply and capital’s 
share of F, equal to 0.248 (= 1 — 0.752): 

Total tax revenues = taxes on labor + (0.6t, -f- . 13)(0.248F,). 

Values for taxes on labor,/;,, and F, can be obtained from the simula¬ 
tions. It is then a simple matter to calculate the modified tax i.ites 
which maximize total tax revenues, including taxes on capital. For 
scenarios A-D and A'-F,', these are, respectively, 0.45, 0.50, 0.41, 
0.44, 0.72, 0.72, 0.68, 0.70, and 0.71. Correcting for taxes on capital 
in this way thus reduces the table 3 estimates of/* by abtiut 0.01. 

Of course, it might be argued that the relationship between tax 
rates a.s.sumed in this formulation (/« = 0.6/,, - 1 - 0.13) understates the 
tendency for taxes on capital to rise where the rate on labor income 
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goes up, and that ihis yields an overly pessimistic view of the ability of 
the government to increase tax revenues. Perhaps a more plausible 
formulation would be that and lie in constant proportion to each 
other, = /3<*. (and, say,/3 = 0.47/0.41). In this case, observe that: 


Total tax revenues = li/iY, + /^(l “ 


tiX 




and a maximum of t/ K, must imply simultaneous maxima for both 
taxes on labor (/,/iP,) and for total lax revenues. 

.'\llowing in this way for capital taxation but keeping capital fixed 
thus has little or no effect on the calculated values oft*. What then is 
the effect of relaxing the a.ssumplion that capital is fixed? There arc- 
two reasons why this must cause the calculated values of/* to fall. 
Fit St, if the total supply tif capital to the economy depends positively 
on capital’s net return—-as the work of Boskin (1978) suggests—then 
iiK leased taxes on capital will reduce the amount of capital in the 
economy, causing a decline in income and resultant declines in tax 
revenues fiom both capital and labor incomes. Second, to the extent 
that the taxed sectors for labor and capital coincide, a ri.se in the rate 
on capital will drive capital out ol the taxed .sectoi', and again taxed 
sector income, and hence tax revenues from both capital and labor-, 
will fall. 


III. Does the Model Predict Well? 

It (inns out that the simulations based on (lobb-l)ouglas utility 
maximiyaiion predict fairly well the actual bebavioi of output anri 
employment during the period of rising taxes in Sweden. The income 
inaximi/aiion hypothesis, on the other hand, leads to predicted de¬ 
clines in output and employinent which greatly exceed actual de¬ 
clines. This allows a tentative rejection of the income maximisation 
hypothesis. In what follows, rough calculations of the effects of recent 
inci eases in marginal taxes in .Sweden are made under the assumption 
of (lobb-l)ouglas utility maximi/aiitm. 1 hese prediclions are then 
compared with the actual development of output and employmem. 

First, it was noted at the outset that yeaily growth rates declined 
from 4.4 peicent in the early and mici- 1960s to 1.4 percent in the mid- 
and late 1970s. This amounts to a decline in the growth rate of I) 
peicent pet year. Over this period, marginal tax t ates rose an average 
of something like 1..5 percent per year, increasing from around 55 
percent to upward of 80 percent. In 1972, the marginal lax rate was 
70-71 percent. A calculation of %Ai'i at a 70 percent rate yields 
values of between 1.4 percent and 1.7 percent, depending on the 
scenario chosen. Applying a figure of 1.5 percent to the entire period 
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1972-77 (a procedure which effectively assumes a slight time lag in 
the adjustment of Z., to changes in tax rates) implies a 2.25 percent 
output decline per year (i.e., the 1.5 percent yearly increase in tax 
rates x the 1.5 percent decline in output per percent increase in 
taxes). This calculated yearly output decline due to rising taxes 
amounts to 75 percent of the 3 percent fall in the growth rate.'® 

The model also predicts a decline in employment. During the 
period 1964-69, marginal taxes rose from 58.3 percent to 65.5 p>er- 
cent, which implies average yearly reductions in L, of between 1.8 
percent and 2.5 percent, depending on (he .scenario, it there was no 
adjustment lag. From 1964 to 1969, the number of hours worked 
divided by the population of working age (16-74) actually fell at a 1.6 
percent yearly rate. This is a fairly good lit. Moreover, a Icgi.slated 
reduction in the workweek was effected in 1970-72, and this may be 
interpreted as partially representing an adjustment to the tax rates in 
effect in the late 1960s. 

During the period 1969-77, the ratio of hours woiked divided by 
the population of working age fell roughly 7 percent. T he increase in 
marginal taxes which occurred during this period should, according 
to the simulations, have led to leductions in employment significantly 
in excess of this figure. While there is no leason to believe that the 
economy was particularly lax in adjusting to its et)uilibrium level ot 
employment in 1969 (indeed, there was net immigration throughout 
the late 1960s), there are two rea.sons for supposing such a lag 
existed iti the 1970s. First, the very size of the decline in Z, predicted 
from scenarios A'-F' (on the order of 30 percent) makes it unlikely 
that full adjustment could occur in a time span of less than 10 years. 
.Second, a host of governmental and government-sponsored job- 
support programs, as well as a strict job-tenure law, were instituted 
and expanded in the 1970s. Assume then that, as of 1977, the econ¬ 
omy was adjusted to a 74 percent tax rate (the rate as of about 1973). 
Depending on the scenaiio, this should have Ic'd to a 1.5—20 peicent 
decline in L^ from 1969 to 1977. Since measured unemployment 
increased only slightly, this leaves roughly a 10 percent discrepancy 
between measured and predicted employment in the taxed sector as 
of 1977. Given the afoiementioned job-support progiains and the 

A prcilicuoii of lutiiic growth i jles iii Sweden tail he tlenterl A.vMime ihat ihe 
inaiginal tax i.iic in cdetl itKi.iy, atxnit 80 peitenl. leinains toiisiaiil and ihai ihe 
economy is prcseiul\' aiijiistcd to a tax r.ttc ol 75 jjeixent. this asstiiiies .i 5-vear 
adjiistmeiil l.ig Coiiipaiiiig output levels at 75 pertenl and 8(1 peiieni in.trgin.il l.ix 
tales, sonicthing like an I 1 pciceiit output decline (c I [leiceni, de|x'nding on the 
scenario) is lo be expexted .Spreading tliis oul over the next 5 tears. Swedes c.in .iw.iii 
an avciage giowlh laie of loughlv 1.5 ireueni over the next 5 le.irs (4.4. less the 
unexplained 0.75, less 11/5), wi(h ycailv giowih talcs lu the tteighiMii hixKt ol 3,5-4.0 
percent iheieafier. 
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job-tenure law, the most reasonable use of this 10 percent figure is to 
interpret it as an estimate of the increase in the level of “hidden 
unemployment” (including on-the-job leisure) in the taxed sector of 
the Swedish economy. Since officially measured unemployment is 
typically only 0.5-2 percent, a 10 percent hidden unemployment 
figure is not unreasonable. 


IV. Concluding Remarks 

The analysis here suggests that high taxes on labor income can have 
quite significant effects on the sale of labor in taxed markets when 
alternative untaxed uses of labor exist. In this way, taxes can and 
probably have alfected the recent growth performance of the Swedish 
economy. Further, the treatment permits the conclusion that total tax 
revenues in Sweden would be maximized at a tax rate of roughly 70 
percent, which is 10 percent less than the rate prevailing today.” 

It is important that the rather severe effects of tax increases above 
some critical level (70 percent?) may be associated with adjustment 
lags, at least if the tax increases occur over a short period of time. 
Social institutions such as the 40-hour workweek and attitudes toward 
work and labor market participation act as a brake on the adjustment 
process. Similarly, altitudes and effort devoted toward tax evasion 
(both legal and illicit) change slowly. Measuring these is perhaps as 
much the task ot the sociologist as the economist. I will refrain from 
making subjective judgments about whether I think attitudes have 
changed in the last 10 years in .Sweden. Ihere are a number of 
indications of pressure on the institutions, however; Legislated 
maternity leaves have been increased; the yearly vacation has been 
increased from 4 to 5 weeks; there arc increasing demands for part- 
time jobs;*'^ and an official commission recently investigated the 
feasibility of a 30-hour workweek, citing that “the long-run goal [of 


li hoiiseliolds .ire assumed to maxiitii/e a cotislaiit ela.stn ity of substitution (CE.S) 
utility function with ati elasticity of substitution of 0..5, and the assumptions of scenario 
A' arc used to ticrivc parameter estimates, the calculated value of <• is 82 percent. This 
IS close to the present tax rate. Decreasing; the elasticity of stilistitution from 1 (as m the 
Cobb-Uouglas case) to 0.5 takes away about half of the explanatory power of rising 
taxes as a determinant of growth rates. It also implies a simulated yearly dec line in 
fioiii l!)64 to 19fi9 of 1.1 percent (the actual itite was 1.6 percent), and implies a 9.6 
percent diop in /,, from 1969 to 1977. 

“ A siii^ey condut ted by the university graduates labor union in 1978 and publi.shed 
in the union paper (SACO-SR tidning, March 5, 1979) asked, "Do you wish to work part 
timet” fhiuy-onc percent answeicd yes. .Since 1975, the i>ercentage of men answering 
yes lose from 14 percent to 27 percent. A thiid of all M.D.’s answered yes. Sixty percent 
of the members surveyed preferred a shorter workweek to increased pay. Forty-four 
percent of the women wanted a 6-hour day, and 43 percent of the women with full-ume 
jobs wanted to work part ume. 
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the labor unions] is a 30-hour workweek” (Statens Offentliga Utred- 
ningar 1976, p. 9).'* 


Appendix 

A more detailed description of the parameterization of the model is pre¬ 
sented here. Unle.ss stated otherwise, figures apply to 1969. 

1. GNP at factor prices was 136.4 billion kronor, whence Y, = 136.4. 
Wages were 92.07 (in billions of kronor). In addition, there was an operating 
surplus of 39.7, some of whkh is returns to laixtt. I'o estimate how much, 
note that wages paid by industries and households were 62.7. Note also that 
hours worked by entrepreneurs and employees were 761 and 4,518 million, 
respectively. The assumption that entrepreneurs have the same wage as 
employees leads to an estimate of the implicit wages ol entrepreneurs of 62.7 
X (761/4,518) = 10.56. Thus/,, = 92.07 + 10.56 = 102.63 and « = 102.63/ 
1.36.4 = 0.752. 

2. Unfortunately, the only household time-budget study available was for 
1976, Activities included in untaxed production are cooking, child care, 
clothing lepair, care of pets, rare of dwelling, cate of cars, boats, and bikes, 
care of garden, and other (cleaning, washing, dishes, purchasing, etc.). Time 
spent in these activities averaged 672 minutes/day. .Since the population 
iticludcd in the study conststed of families with children living in apartment 
buildings, and since the study was conducted duiitig winter months when 
tare of dwelling, garden, etc., are likely to be seasonally low, 20 mimites/day 
was added to time spent in untaxed activities to seasonally adjust the data. 
Work (and study) look 642 ininuies/day. This implies that/,: = (692/642)L, = 
1.08/,, = 110.84. However, taxes were greater in 1976 than in 1969, so the 
estimate of/-j for 1969 was that it just equalled /,,, that is, /.^ = 102.63. 

This estimate of does not include leisure. To include leisure, note that 
647 minutes/day were spent by the average household in the study in recre¬ 
ation and relaxation in 1976. On this basis, was taken as exactly twice /., 
(i.e., /,2 = 205.26) when leisure was included. 

The value of untaxed production was estimated by 

S njlihrjwT + h’fi'f). 

where ri) is the number of households of type j. hi) is the number of hours/ 
year spent by males in household j at activity t, lej" is the aftcr-lax male wage 
for activity i, and /tfj and tvfare similarly defined times and wages for females. 
Six household types were considered: with one adult and 0. 1, or S' 2 
children, and with two (or more) adults and 0, 1, or s 2 children. The basic 
data on the n ^ were taken from the \9H) Population and Housing Census part 9. 


In addition, all of the miijor politic.il p,trties in Sweden except the conseiv.uivcs 
(with roughly 20 jiercent of the vote) support a 6-hour working day, the consetcantes 
state only that the matter should be decided in union-management negotiations. 
One might ask. How does a 30-hour workweek fit with the predictions of the model? .\ 
shift from 40 to 30 hours/week is a 29 percent reduction, file at tii.il reduction m.iy Iw 
somewhat less if such a change is Icgislatetl, however, since lalxir-rnarkei pai ticipalion 
might rise (it has risen historically during .such changes). Thus something like a 20 
percent decline m taxed-sector labor would tKcur, This is almost exactly the decline 
needed to restoie equilibrium at the 80 percent tax rates prevailing KkI.iv gisen the 
logic of scenarios A'-F,' (cf. Sec. HI). 
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An adjustment was made for the fact that roughly 30 percent of the persons 
reported to be single adults living with chiltlren are actually also living with 
another adult (lor tax and subsidy reasons, it is advantageous to report in this 
way—the source for the 30 percent figure is the time-budget study, Kon- 
sumenlveikct 1977, p, 30). 

Wage data were taken frtnn Wages, 1969. The relevant tax rate is the 
maiginal income tax rate plus a correction for income-based subsidies. For 
1909, these were 0.478 and 0.052, respectively (for discussion, see Appendix, 
item 3 in this study). However, application ol a 53 percent tax rate is likely to 
overestimate the taxes which would be paid on marginal income, due to the 
lacf that iiidny ot the itxlividuals devoting large amounts ol time to home 
production have low incomes and hence lower maiginal taxes than the aver- 
.igo income eainer. A 40 percent lax rate was thus applied to all untaxed 
at tivities except care of pets, cate of garden, and other—a 30 percent rate was 
used lor these 

rhe following assumptions were used to estimate hours spent in unlaxcd 
production. (1) 1 he two-adult, childless household was consideied to have the 
same uses of tune in imtaxed production as a two-adult, one-child (ages 4-6) 
householtl except that lime spent in child caie was set at /cro. (2) fimes in 
unlaxed prodiK tion lor a one-adult, one-child householtl were 0.94 times the 
hoin s lor a woman in a two-adult, one-chilii (ages 4-6) bouseholi). I'he figure 
0.94 denves ftoni the (onsideralion that a single patent is likely to spend 
more time in m<u ket work than is an average parent in a two-adult householtl. 
(3) 1 he single-adult, no-t hild household was assumed to have the same uses 
ol lime .IS the aver.ige over males anti lemales ol lime spent in a iwo-adull, 
siiigle-i hild (<tges 4-6) householtl, except that time spent on child care was set 
cciiial to zeio. (4) 1 he one-adult, ^ 2 child householtl was assumed to have 
(he same uses of lime as a leniale in a (wo-.itiull, single-thild (ages 1-6) 
hou.sehold. 

1 n evaluating the value of leisure lime, the after-tax “w.ige'’ rale was taken 
.IS 5 kionor/houi. This figure was generally less than the after-tax wage tales 
for unt.txed pioduclive .tttiviiies. 

3, For each )c-ii, the t.ix-iatc base used for talciilating tax lates was one 
plus the payroll i.ix late (sources' liidiisiriens uticdningsinsiitui 1976, pp. 
77-78; .Sveiiska aibelsgivaieloieniiig 1978). I'he total lax talc consisted ol 
the payroll tax plus the aveiage or marginal rate tan income and the average 
or rn.irgiiial indirect tax tate, plus a coitcction for income-basetl subsidies, all 
divided by the r.iie base. Iiidirett lax i.ites weie found by dividing total 
indirect taxes by disposable income and iniilliplying by 1 minus the avei age or 
maiginal tax i.iles on income to find average or maiginal indirect tax rales, 
respectively. It was then assumed that 75 peicenl of the calculated indirect 
r.ites were borne on the margin by labor income. Approximate collection 
I .lies lor transfe-rs were deiived by dividing lianslers lo houseluslds by dis¬ 
posable income. Payroll taxes vary fiom a few percent in the 195()s lo 22.4 
percent in 1974 (to 35.1 percent today). Indirect tales, after being multiplied 
by 0.75, aie cm the cjrder of 5-10 peiccnt. 1 he transfer conection teim rose 
fiom -3 5 pciteni in 1954 to 6.9 jktcciii in 1974. 

.■\ sample calculation—foi the 1969 aggregate marginal tax laie of 65.5 
percent—is as follows. Jakobsson and Normann (1974) report an aggiegate 
marginal income lax rate of 0.478. The payioll tax rale lor 1969 was 0.125. 
Note that this rate is assessed pci krona of gros.s wages (i.e., wages befote 
income laxe.s) paid by employe!s. Indiietl tax revenues divided by disposable 
income were 0.209 (= 18,409/87,979), implying a marginal, indirect-tax rate 
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on nel-of-income-ux income of 0.157 (= 0.209[3M1). In addition, transfer 
payments divided by di.sposable income were 0.052 <= 4.553/87,979). I'hiis, 
an employer who paid 1.125 kr for tabor generated payroll taxes of 0.125 kr 
and gross income of 1.0 kr. Of the income, 0.478 kr went to income taxes 
while the remainder (1 — 0.478) was available for consumption and saving 
and is assumed to bave been taxed at a 0.157 rate. 1 he 0.052 transfer rate is 
treated as an additional marginal tax payment on gross wages. The aggregate 
lax rale is thus total taxes generated by the 1.125 kr intrement in income, 
divided by 1.125: 

0.125 + 0.478 + - 0.4781(0.157) + 0.052 ^ q 
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Comments 


A Superior Solution to Captain MacWhIrr's 
Problem: An Illustration of Information 
Problems and Entitlement Structures 

Gene E. Mumy 
Ohio Stale UmverMty 


You couldn't tell one man’s dollars from another’s, he 
said, and if you asked each man how much money he 
brouj(ht on board he was afraid they would lie, and he 
would find himself a long way short. [Joseph Conrad, 
Typhoon] 


In his novel Typhoon, Joseph Conrad po.sed a classic problem of 
information revelation. At first glance, the problem appears to V>e 
insoluble because of the apparent incentive for individuals to gener¬ 
ate wrong informadoti. After further consideration, however, it turns 
out that ati appropriate chatige in the entitlement structure of the 
problem allows it to be solvetl by generating an incentive for each 
individual to reveal correct information. In this sense, the problem is 
akin to the class of free-rider problems a.ssociated with public goods 
that can be .solved by using the demand-revealing process ( Tideman 
and fullock 1976). 

Conrad’s problem is stated in Section 1, along with the arbitrary 
solution imposed by the fictional character Captain MaeWhirr. .A 
superior solution, which actually generates correct information, is 
presented in Section II. Section 111 contains a brief di,scussion ol the 
similarities between the superior .solution and the demand-revealing 
proce.ss for public goods, followed by conclusions. 


The author would like to thank Richard Jensen and William M.uison for their helpful 
comments, t hey are not, however, responsible for any errors in this paper. 
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I. The Problem and Captain Mac Whirr’s Solution 

In Typhoon, 200 Chinese workers are beinfr transported to their home 
province on a ship commanded by Captain MacWhirr, after working 
for 7 years in various tropical colonies. Each worker’s accumulated 
savings of silver dollars are stored in his own camphorwood chest. As 
the title of the story sugge.sts, all is not smooth sailing during the trip 
and the ship runs into a typhoon. 

The problem that we are interested in arises when, during the 
course of the storm, all the wooden chests get smashed and the silver 
dollars are all scattered between decks. A riot ensues among the 
Chinese workers as they try to retrieve their money. In order to stop 
the conflict, the captain sends the first mate and some men to pick up 
all of the money, which they do. When the storm is over, the captain, 
being a fair man, wants to return the money to its rightful owners, but 
he has a problem. He does not know how tntich money each man had. 

rhe problem here is to obtain informaiion about how much money 
each man actually had, assuming only that the captain knows the total 
amoutti of money atid that each man knows the amount he had but 
tioi necessaiily rhe amount that anybody else had or even the total 
attiount of tnoney. The captain realizes that he cannot just ask each 
ttiati how itiuc h tnotiey he had and thett give hitn tliat much because 
some of them ittight overstate the amount. In this event, the sum of 
the claims wtiuld exceed the total amoutn of money, thus making a 
distrtbution oti this basis impossible. The captaiti concludes that it is 
impossible to guarantee correct information and therefore imposes 
att arbitrary distribution that he thinks is fair. According to him, the 
fair assumption is to assume that all the men had the same amount of 
money and, therefore, he gives an eqttal sh.ire of the total to each 
man. 

It is obvious that this distribution is not necessarily fair at all unless 
all the men really did have the same am<mnt of money, which is not 
known to be the ca.se. The captain’s solution to the problem is not a 
solutioti to the information problem but lather a surretider. Such a 
surrendet, however, is not necessary, as will now be demotistrated. 


II. A Superior Solution 

If only the individual knows how much money he had, the captain i; 
right to think that this infortnation can only be obtained by getting the 
individual to reveal the correct amount. He is wrong, on the othei 
hand, to think that there is nece.s.sarily an incentive for the individua 
to overstate the amount, fhe mistake here is the implicit assuinptioi 
that the amount stated by an individual is also the amount to which In 
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is entitled. Under such an entitlement structure, each individual has a 
clear monetary incentive to overstate his share. The way to solve the 
problem, then, is to cut this direct link between amount revealed and 
entitlement in such a way as to obtain correct information while at the 
same time entitling individuals to the amounts they actually had. 

In Captain MacWhirr’s problem, there is a fairly straightforward 
entitlement structure which generates an incentive for individuals to 
reveal correct information. Instead of letting the amount claimed he 
the entitlement, let the maximum amount to which an individual is 
entitled be the difference between the total amount of money and the 
amount claimed by everyotie else. If an individual claims less than the 
maximum amovint, then he gels what he claims, but if he claims more 
he is penaliitcd and actually receives less than the maximum amount 
to which he is entitled. As a result, no individual has an incentive to 
overstate his share (or to understate it) and he reveals the amount he 
actually had. 'I'his also means that each individual is thereby entitled 
to the amount he actually had. A more foimal presentation ot the 
problem will make this point clearer. 

For the following analysis, let r, = the amount received by the tih 
individual; C) = the amount claimed by the tth individual; rij = the 
amount the till individual actually had; atul A = the total amount of 
money. Under an entitlement structure like the one sketched out 
ab(jve, the amount received by the tth individual can be written as 


r, = c, 


max (1 + «) 


c, -(d-^rj),0 ; -l 


■ Ut, (1) 


where A — is the difference between the total amount of money 

and the amount claimed by everyone else and is, hence, the tth 
individual’s maximum entitlement. It follows that a > 0 if individuals 
are to be penali/cd for claiming amounts greater than the nraximum 
entitlement.' 

■fhe individual’s proljlem is ttj choose r, so as to maximize r,. I'his, 
however, calls for an assumption about Fhe reasonable as¬ 
sumption here is that as will be seen by the solution to 

the individuars problem. If we substitute for 2j,,rj and note 

that it follows by definition that A — = Of "e ran rewrite 

equation (1) as 

r, = r, — max [(1 + a){c, — «,), OJ. (2) 


' If <■( < A llicn r, = r, Ik'< aii.se (I -t- a)[f( — (.i — i.'Clso note ih.it it 

is assumed that iho ainouiii of money ihai mdiviiluals are pinalocd is krpi bv the 
<aptain. 
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From equation (2), it is easy to see that n = at if Ci = but that n < oj if 
Ct § Of, and as a result, the individual honestly reveals and receives the 
amount of money he actually had. Furthermore, this is true for all i, 
which means that the assumption of is validated and 

we have a Nash equilibrium solution for our problem.* 

This solution is obviously superior to the captain’s solution because 
it returns to all individuals the amounts of money they actually had, 
regardless of what the actual amounts were. This depends on obtain¬ 
ing correct infoi ination, which is accomplished by eliminating the 
direct link between individuals’ claims and their entitlements. It is that 
direct link which gives rise to the incentive for wrong information. 

III. Comparison with the Demand-revealing Process and 
Conclusion 

It was slated in the introduction that the superior solution to Captain 
MacWhirr’s problem has a certain similarity with the demand- 
revealing process for determining preferences for public goods. We 
are now in a position to see what this similarity consists of. In brief, the 
similarity is that both are solutions to information problems that are 
obtained by separating the information revealed by an individual 
from the individual’s entitlement. 

The classic public goods problem is that the free-rider phenome¬ 
non generates an incentive for individuals to understate their valua¬ 
tions of a public gooti if their share of the financing depends on that 
valuation. That is, if individuals feel that their own share of total 
financing is negligible and, hence, that they are quantity takers from 
the total collective decision, then they have an incentive to understate 
their demand for the good if their entitlement (obligation) to share in 
the financing is determined by the demand they reveal. The twist in 
the demand-revealing process that causes individuals to accurately 
reveal their demands is that the entitlement structure is changed from 
(he one that is usually implicitly assumed. Instead of obligating the 
individual to share only in the financing of the slated quantity desired 
at a given tax price, the individual is obligated, rather, to share in tht 
hnancing, at a given tax price, of the quantity chosen by everyone else 
'Fhis simple change in entitlement basically eliminates the possibility 
of a free ride. I hen, by using suitably designed penalties for deviatin} 

* This IS not the case with other assumptions about and, hence, the justificatio 

for the assumption used, fhat is. if it is initially assumed that this mean 

that r, is maximized with r, < a,. But, if each person knows that everyone else has th 
same incentive to act in a similar fashion, this would lead to the conclusion that • 

which contradicLs the initial assumption. In the same way, the initial assumptio 
of also leads to a coiitradiaion. 
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from the basic entitlement given by everyone else, the individual has 
the incentive to reveal his true demand for the good. Becau.se 
everyone thus has incentive to reveal his true demand, the total 
demand for the good is honestly revealed and the appropriate quan¬ 
tity can be chosen. The basic feature of this process is strikingly 
similar to the basic feature of the superior solution to Captain Mac- 
Whirr’s problem. 

The point here is not that there is neces.sarily a large class of 
economics problems that correspond to the fictional problem posed 
by Conrad and susceptible to solution by the method developed above 
in Section II. Rather, the fictional problem nicely illustrates that the 
apparent incompatibility between economic incentives and revealing 
true information does not mean that the information problem is 
in.soluble. On the contrary, it indicates that a suitable entitlement 
structure restores the compatibility between incentives and true in¬ 
formation, thus allowing the information problem to be solved. 


Reference 

Tideman, T. Nicolaus, and Tullock, (Jordon. “A New and Superior Process 
for Making Scx ial Choices.”y P.E. 8-1, no. t> (December I97G). 1 145-59. 



Concentration Changes and Inflation: 
Some Evidence 

E, Woodrow Etkiird, Jr. 

(fi^ieral Motors Coipotnhnn 


In a recent article in this Journal, Gisser and Johnson (1979; hereafter 
cited as G & J) examine the relationship between industrial concentra¬ 
tion changes and inflation. They develop an algebraic model relating 
concentration levels to an economy-wide price index, make assump¬ 
tions regarding parameter values, and then use the model to simulate 
a relationship between concentration changes and inflation. I'he cen¬ 
tral feature of the model is a Cournot-type specification of the re¬ 
lationship between concentration (or number of firms) and industry 
price level.* Naturally enough, this produces a positive relationship 
between concentration and the price index and, by extension, be¬ 
tween concentration changes and index changes.** However, the re¬ 
lationship is weak in the usually observed concentration ranges, and 
so the authors conclude that “the act of reducing concentration, if 
successful, is likely to have little impact on the consumer price index” 
(p. 1377). 

The purpo.se of this note is to present .some empiiical evidence 
regarding this issue. In particular, do the facts support the positive, if 

Helpful coiniiierus fiom 1 (i. Maix and T. F Wallon ;ire ^laiclully .uknowlcd^cd. 
Any reiTi. lining errors aiul all opinions expressed heiciii are ihe sole rcsponsilnliiv of 
(he author. 

' Rather than using, say, a lour-hrm toncemraiion ratio, C» & J (1979, p. 1378) assume 
each industry contains some nunilx’r of identical hnns. Concentration is therefciie 
measured as the inverse of this nunihci. 

^ See Stigler (1988, pp. 36-37) for a description of the Cournot model. However, its 
theoi L’tical validity, as applied to oligopoly pricing, has been seriously undeimined (see, 
e.g., Stiglcr 1968, pp. 36-37; Scherer 1970, p. 135). The continued use of this 
mcxlcl in industrial oigani/.ation research appeals to he due solely to its mathematical 
tractahiliiy. 

[J(*urnal of l‘olitual Economy^ lUHI. vol 89, no SJ 

(g) 1981 by rh« University of Chicago 0022-380H/81/89<)5-0010$01.50 
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weak, relationship predicted by the G & J simulation? First, a measure 
of the amount of inflation which can be “associated” with an indi¬ 
vidual industry is developed. Next, using Census of Manufactures 
data, this measure is calculated for a sample of four-digit industries 
over three time periods and then correlated with the corresponding 
changes in industry concentration.^ While average economy-wide 
concentration has been relatively stable in recent years (as noted by 
G &: J), concentration ratios for individual industries have varied 
sufficiently for identification purposes. 'The results indicate that con¬ 
centration changes are unrelated to inflation. 


An Inflation Measure 

This section develops an empirical inflation measure based on the 
theoretical model used by G & J.'* In their model, G & J assume a two- 
good economy (X and Y) with a single factor of production and no 
intermediate goods. They could therefore write money income (/) as 
follows: 


I — Pt<1t + Pll<lv ~ 

where p is unit price, q is number of units, and VS is value of ship¬ 
ments. However, in a more general model, intermediate shipments 
occur. Therefore, to avoid "double counting," money income must be 
written as a function <jf value added: 

.V ,v 

^ = 2 !''■ 9 * ^ 

»*I i-t 

where Vj is unit value added (i.e., pritne lactor cost per unit) and VA, is 
ttJtal value added for the (th of N goods. The use of unit value added, 
rather than price, is important in the present context because it 
eliminates the effect of (nonprime factor) input prices on the 
industry-specific inflation measure to be developed below. I'hese 
prices are assumed to be taken as parameters by firms in any particu¬ 
lar industry. 

A Lespeyres-type "inflation index" can now be defined as follows: 


^ rhe of the in.trkel powei-inll.uioii iclationship implies ihal the cun eel eni- 
piricdl speiirualion is between inll.nion and coiHentr.mon cb.m^es r.ithei th.in con- 
eeniralion levels. For a dese iissioii of this point, sec Slij^Iei pp 8-9) 

* The inllatioii measure tfcvcloped here is simiiai In that of Biaiii.iid and Lovell 
(1966, pp. 860-61). For a discus.sioii of this measure m ihc conieNl ot .1 simple 
monetary model of mllatioii, see Eckaid (1978, <hap 2) 
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^ -> ( 3 ) 

1-1 

where superscripts 1 and 0 refer to current and base years, respec¬ 
tively, and '/® is a hypothetical nominal income calculated at current- 
year unit value addeds and base-year quantities.^ As defined, is 
unobservable. However, this can be remedied via some algebraic 
manipulation. First, rewrite equation (.S) as follows: 



L 




yjf 

-pr- 


Defining inflation as the change in L during a given period (AZ, = 
/,* — L°), .specifying L® = 1, and recalling equation (2) yields: 



This equation states that a given amount of inflation can be par¬ 
titioned into components associated with each of N elements com¬ 
prising the economy. Each component is the proportionate increase 
in that element’s unit value added, weighted by its base-year, total 
value added as a fraction of total income. 

Abstracting from the effect of relative size (as represented by 
VAyP), the amount of inflation associated with each element is mea- 

* Note that this index overstates the “true" price-level change to the extent that (1) the 
increase in unit value added corresponds to a quality increase and (2) the output vector 
in peritxl 1 differs from the initial period vector due to consumer responses to relative 
price changes. 
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sured by the first term inside the parentheses of equation (4). For 
convenience, this ratio is defined as follovvis; 


Rt = 



(5) 


The above is, of course, analogous to a change in average unit price. 
However, the confounding effects of price changes in inputs pur¬ 
chased from other industries are absent. 


Empirical Results 

In this section, Rt is calculated for a set of “concentrated” four-digit 
SIC industries using Census of Manufactures data. This infiation 
measure is then correlated with concentration changes. But first it is 
necessary to convert equation (5) to a form which can be calculated 
from available data. This is accomplished by recalling from equation 
(2) that Vi = yAi/qi' therefore: 

_ VAUq} 

* VAVq^i 

^ VAHVA^i 

qHq^ 

Value added is available from the Census of Manufactures four-digit 
industry data.* While qi is not available, a proxy for the ratio ql/q? can 
be obtained for certain periods from the Census of Manufactures 
Indexes of Production. These indexes are estimates of the change in 
physical output calculated at the four-digit industry level. 

Based on these data./I^ is calculated for the time periods 1958-63, 
1963-67, and 1967-72.^ To restrict the sample to “oligopolies,” only 
those industries with four-firm concentration ratios (CR4’s) equal to 
50 percent or greater at either the beginning or the end of a period 
are included.* The data for the 3 periods are then pooled. To nor- 


• The census value-added data are at best a rough proxy for “true” value added, i e., 
the amount of national income originating in each industry. I n fact, census value added 
for all manufacturing industries has consistently overestimated national income 
originating in manufacturing by roughly one-third (see U.S Department of Commerce 
1972, pp. xxxii-xxxiv). Because this error has been fairly con.sislent over the period 
covered by the study, we assume that the change in census value added is an unbiased 
estimate of the change in the "true” value added for each industry 

' These periods were stipulated by data availability. 

• The selection of the 50 percent cutoff point is, of course, arbitrary. Its purpose is to 
avoid a possible bias by excluding those indu.siries which few would aigue possess the 
market power required for the adverse pricing effect postulated under the 
concentration-inflation hypothe.sis. The iiidu.stry ctinceniration ratios u.sed aie those 
published by the Census of Manufactures. Their shortcomings as measures of m.irkel 
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TABLE 1 

Avkraok Indum ry Inu.aiion (/?,) by Chan(;e in Cayncentration for Thrff, Pooled 
Time Periods (1958-63, 1963-67, and 1967-72) 


Nurnbri'of Average Standard 


Changes ill CR4 Indusltics 7?( (Y< 100) Error 


<-IO . 10 88.5 3.4 

- 610 -10 . 32 91.4 2,7 

-1 to - 5 . 76 93,1 1 3 

0 . 30 92.3 1.7 

1 to 5 . 85 95.9 9 

6 to 10 . 27 91.2 2.6 

> 10 . 14 _ 89.5 ^ 

Total . 274 93.1 .7 


niaJi/e the observation.s for different “underlying” inflation rates in 
these periods, each R, is divided by the increase in the implicit price 
deflator for the appropriate period.” Finally, the data are grouped by 
the change in ('R4. 

I'he results are presented in table 1 (note that the RiS are multi¬ 
plied by 100). 'file data for falling CR4’s appear to support the 
hypothesis that a positive relatiotiship exists between CR4 changes 
and inflation.Average Ri (x 100) declines from 93.1, for ()R4 
reductions of 1 -5 percent, to 88.5 for CR4 reductions of greater than 
10 percent. However, the data for rising CR4’s do not support the 
hypothesis. For CR4 increases of 1-5 percent, average R, (x 100) is 
95.9, while for CR4 increases of 6-10 percent and greater than 10 
percent, Ri (x 100) declines to 91.2 and 89.5, respectively. Thus, R, 
appears to be more closely related to the absolute value of CR4 
change rather than to the direction of change. 

This is made clearer in table 2, where the data of table 1 are 
rctabulated by absolute value of CR4 change. For industries with CR4 
changes greater than 10 percent in absolute value, average /?((X 100) 


(onccniiatioii have Ijccn well <lo< inncmcd (see, e.g , Bock 1972, pp. 11-30). These data 
are used here becHuse they have come to dominare empnical thinking on concentra¬ 
tion, a development best explained by (I) their accessdiiliiy and (2) their toirc- 
spondciice fo (he mass of other data published by the census at (he iour-digii imiu.stry 
level. 

” I'hc average industry concentiation for the universe four-digit SIC industries 
has remained essentially constant between 1958 and 1972 (see, e.g., Allen 1976, p. 665, 
table 1 ). I'herefore, the inflation raies (x:curring in each of the three periods must have 
been due to other (i.e., monetary) phenoiiieiia. Removing the effects of these diffeient 
“underlying” rates eliminates one major soiiice ol variance in the pooled data. 

*® 'rhe .simple correlation between iff and concentration change for all 274 obser¬ 
vations is positive although small in magnitude (r = .024) and statistically insignificant 
(/ = .4). 
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I'ABLE 2 

Avkra(;e Industry 1nfi.aiion ( R ,) by Absolute Vai ue of C::HANf.E in Concf.n 1 ration 
FOR Thrfe Pooled Time Periods (1958-63, 1963-67, and 1967-72) 


Absolute Value of Nuinijerol Average Standard 

Changes in CR4 Industries Wi(x 100) Error 


>10 . 24 89.1 2 5 

6-10 . 59 91.3 1 9 

1- 5 . 161 94 5 .8 

0 30 _92^_ IJ 

Total . 274 93 1 .7 


equals 89.1. For intlustries with absolute concentration changes in the 
1-5 percent range, average Rf (x 100) rises to 94.5.” While the 
average RiCx 100) for the 30 industries with no change in CR4 is less 
than that for the 1-5 percent change group, it is still greater than the 
average /?( (x 100) for 6 percent and greater CR4 change groups. 
Stated alternatively, for the 83 observations with CR4 changes greater 
in absolute value than 5 percent, average /?, (x 100) equals 90.6. For 
the 191 observations with CR4 changes of 5 percent or less in absolute 
value, average Ri (x 100) is 94.2. 

What might account for this pattern? One explanation is that com¬ 
petition in coticentrated industries (as well as others) often takes the 
form of cost reductions through productivity increases. Since payroll 
costs are a substantial portion of value added, productivity increases 
would be discernible as reductions in unit value added, ceteris 
parilfus. If such cost savings occur among one or more of the top four 
firms, (;R4 rises as the firm (or linns) iniroducitig the cost reduction 
enjoys a relative increase in .sales. If a cost reduction occurs among the 
smaller firms, CR4 in general falls.However, in either case average 
industry costs fall. 1 he data in tables 1 and 2 may therefore merelv 
reflect a tendency for firms introducing greater cost reductions to also 
have greater sales increases, that is, to induce greater CR4 changes. 


" 1 he .simple coi lelaiioii belweeii /J, .iiid ihe .ibs<ilme v.due of coiKemraiion thaii^e 
lor all 274 obsei vatioiLs IS negative (r = - 18) .ind slalisiicallv signiht ani ai the I percent 
level 

I'he exception CKcuts tf (1) the sales increase ol the sinallei firm is sufhcieni to 
place it among die top lour and (2) the C.R4 ol the new top four is not less than that ol 
the old, t’livcii the relatively shoit time |H’ii<m1s aiialv/cd (4-5 seats), siith excepiioiis 
arc pi obably infi eijnent 

Pelt/man (1977) has examined the relationship lielsseen unit tost changes and 
concentration changes using a sample tltawii Iroiti the s.unc universe of census kiiit- 
digit inthislries. He found that both concentlation iiureases and decreases are cortc- 
lalcd with cost decreases. However, for loncenUiUion dccieases, the coiielauon is 
“statistically insignificant and only a fi action of the effect of similar increases" (p 245) 
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One final observation is in order. Since each individual value of/?(is 
corrected for the appropriate “underlying” inflation rate, an Rf (x 
100) of 100 indicates a "contribution” to the inflation rate equal to that 
of the economy as a whole, that is, equal to the underlying rate. But, 
as indicated in table 1, the average 100) for all 274 observations 
is only 93.1. Thus, the “concentrated” industries in the Census four¬ 
digit industry universe, for the given periods, performed significantly 
better than the remainder of the economy.''* 

Conclusions 

In short, the evidence suggests that a policy of increasing concentra¬ 
tion would be as effective in ea.sing inflation as a policy of decreasing 
concentration, which is to say, not very effective at all. That is because 
the concentration-inflation hypothesis which formed the basis for the 
G & ] simulation has the direction of causality between concentration 
changes and price changes reversed. Concentration changes are in¬ 
duced hy corresponding changes in the relative competitive strengths 
of firms in an industry as manifested by (among other things) price 
changes, not vice versa. In fact, the evidence suggests that neither 
increasing nor relatively high concentration is associated with “bad” 
inflationary pet formance. Policymakers will have to look elsewhere 
for a cure for inflation. 
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The Crisis in Keynesian Economics. By Sitt John R. I{u.ks. 

Oxford: Oxford University Press, 1974. Pp. 88. £1.75. 

My j;eneralion accepted tlic inc.ssage conveyed by the General I'heoiy with 
remarkable agility. The analytic inierprelalion of the message offered by Sir 
John Hicks effeclivelv contiibuted to the victorious sweep. But the prospects 
of a "New Economics" faded within the first 2 decades of the postwar pet tod. 
rhe piomise of ac tivist slabilir.alion policies failed and eventually yielded a 
reality dominated by an appaienily iniiactable inllatton, protracted iinem- 
ploymenl, and volatile exchange rales. Analytic lelleclions increasingly ques¬ 
tioned, moreover, the “Keynesian resolution” of the intelletlual dilemma 
posed by the experiences of the mterwar period. These experiences seemed 
incompatible w ith iiihei ited pi ice theory. I'hcy revealed what appeared to be 
an immanent Haw of the price mechanism. Two options were available. One 
would refoi mulalc traclitional price theory in order to subsume these experi¬ 
ences within an extended liamewoik, and the cither would abandon price 
theory <ind replace it with "sociology" or "social psychology.” Keynes chose 
essentially the second option. This was panicularly rellected in the treatment 
of the labor market. It was also rellected in his approach to long-run expecta¬ 
tions and their lel.iiion to investment expenditures (remember the “animal 
spiiiis") Ibis inlluerice pervades the Keynesian esiablishment and .tlfccts 
much of the "posl-Keynesian” literature and the post-Keynesian approach to 
the inflation problem. 

1 he evolution of nionet.iry theoi y over the postwar decades may be inter¬ 
preted to I esult f I out a systematic reconsideration of the Keynesian resolution 
of the inlelleclual dilemma. It was increasingly aigued that an intellectually 
moie productive resolution would continue a viable “classical program.” This 
program is centered on an extended price theory incorporating pervasive 
information problems but would discard the forms ol the “neoclassic syn¬ 
thesis" successfully established during the eai ly postwar phase. This back¬ 
ground of our current struggles attaches particular significance to the 
comments on "the crisis in Keynesian economics” offered by Hicks to the 
profession. 

His material is organi/ed into three chapters: the fiist covering "the mul¬ 
tiplier”; the second attending to money, interest, and liquidity; and the last 
exploring wages and ifie labor market. The chaptei on the multiplier exam¬ 
ines issues badly neglected in the traditional Keynesian liteiarure. Hicks 
approac hes in a somewhat implicit and indirect manner two aspects associated 
with supply behavior in the context ol macroanalysis. The first aspect con- 


[Juumal of Pnluual t'ronomy, 1981, vol 89, no 5) 
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cerns the role of stocks (i.e., inventories) in the multiplier process, I'he 
argument elaborates the importance of initial conditions expressed by the 
inherited pattern of slocks for the magnitude and persistence of the multi¬ 
plier response to variations in autonomous expenditures. The reader of these 
passages tan hardly avoid wondering about the usefulness of activist fiscal or 
monetary policy against the background of this chapter. 

The other aspect of supply behavior involves the determination of prices. 
Hicks elaborates his previous differentiation between “fixprice” and “Hex- 
pricc" market systems. He notes that the traditional multiplier analysis pie- 
supposes a fixprice system. The discus.sion of this problem offers some 
thoughtful and suggestive remarks to the reader. But it also contains linger¬ 
ing residues of the Keynesian resolution. This appears most particularly in 
the characterization of the dcstinct market systems. Flexprices are said 10 
reflect the tiperation of supply and demand, whereas fixprice patterns move 
beyond any exposure to market forces. We also read that flexprices are 
formed by the market and fixprices are set by suppliers (or demanders). I'his 
descriptiejn is quite misleading. All prices are set, and all reflect market forces. 
The issue tenters on the relative frequency of “resetting" and the nature of 
the market forces shaping the price setting. Several explanations emerged in 
recent years to account for the relative inflexibilities of prices as a rational 
matket response to specifiable uncertainties and transaction costs. One of the 
approaches emphasizes that prites arc geared in the context of substantial 
transaction costs to perceived “permanent" market tonditions. Price setting 
disregards tinder these conditions transitory changes in prevailing conditions. 
Changes in permanent conditions are moreover perceived only slowly, and 
prites are thus inflexible relative to the actual but insufficiently perceived 
underlying market conditions. Such extensions of price theory avoid the 
Keynesian resolution dritiing into ad hot argiimeius without a coherent 
analytic framework. 

The sectind chapter bears on aspects of the Keynesian transmission mecha¬ 
nism and the role of liquidity. The remarks on the latter point are closely 
related to an analysis which emphasizes that costs of information and transac¬ 
tion differ widely over the spectrum of assets. This variance assures the 
emergeiKc of "transaction-dominating assets." Future opportunities depend 
under the circumstances, as Hicks correctly elaborates, on current portfolio 
compositions. Moreover, the author argues cogently for a wider spectrum of 
substitution relations surrounding money beyond the narrow choice of 
“money versus bonds." He still .seems to confine, however, the transmission 
channels to returns on finatuial as.sets. The “City Syndrome," so prevalent in 
the Keynesian traditions, finds it apparently difficult to comprehend that in 
most countries monetary impulses are dominantly transmitted, in the absence 
of developed capital markets, via substitutions between money and goods or 
real assets. Money as the transaction-dominating asset substitutes at the mar¬ 
gin ill all directions. 

The last chapter is particularly informative and suggestive. Hicks discusses 
the lack of a wage theory in the Keynesian tradition. The reader also inlet s 
that Keynes’s "wage theorem" opened the gate to a post-Keynesian flood of 
inflation theories based on more or less explicit denial of economic anahsis 
Hicks emphasizes in particular the need for an endogenous incorporation of 
wage push. The formulation couched in terms of “economic" and 
“noneconomic” causes is singularly misleading, however. On the other hand, 
the attention given by Hicks to the comparative durability of market relations 
seems particularly promising for a useful approach to the explanation of 
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wage behavior. Such reasoning would link the labor market phenomena with 
evolutions on the output market indicated by the fixprice patterns. The 
allusions to “fairness” considerations in the wage process seem redundant, 
however. They have, as they stand, no basis in analysis and add nothing to the 
substance of Hicks’s important suggestions. 

The remarks of a thoughtful professional are always worth pondering, and 
Sir John Hicks certainly deserves our attention. But I wonder about his 
conclu.sion that “reformulated Keynes is much more like Keynes than it is like 
the cruder forms of neo-classical doctrine.” I wonder about this statement as 
intriguing suggestions strewn along the text lead us, as some analytic de¬ 
velopments would appear to do, beyond “Keynes” (reformulated or not) and 
also beyond “neo-classical exercises,” crude or sophisticated. 

Karl Brunner 

Umversily of Rochester 


The /.ero-Sum Society: Distribution and the Possibilities for Economic Change. By 

l.K.siER C;. I'hlirow. 

New York: Basic Books, 1980. Pp. 230. $12.95 (cloth). New York: Penguin Books, 

1980. $4.95 (paper). 

This is a book about the economy of the United States, but it could be about 
almost any other Western developed economy. With perhaps one or two 
exceptions the advanced countries of the Western world face the same eco¬ 
nomic problems and have already several years' experience of high unem¬ 
ployment, slow growth, and rapid inflation. They all—with one exception— 
face special probletns arising out of the increased price of oil (the exception, 
Cireat Britain, has managed to have its own special problems as a result of not 
having an oil problem and having, instead, a deindustrialization problem 
caused partly by a high excliatige rate). In many cases the problems that 
riiurow enumerates apply even more acutely to other countries. Some of 
them are even moie dependent on imported oil than is the United States; 
most of them protect their ailing firms or industries more than does the 
United States; and most of them spend higher shares of their national prod¬ 
uct on welfare payments than does the United Slates. But give or take a 
problem or two, the picture he paints of the current economic performance 
of the United .States is very familiar in other Western countries, and his 
diagnosis probably applies in large measure to the other countries as well. 

I'he main theme of the book is that the economic problems facing the 
American economy today are solvable but only at the cost of somebody losing 
in the process, and it is almost impossible these days to make the required 
losses stick on anybody. I'here arc no longer powerless groups in society to 
whom the burdens of any economic change can ultimately be passed. Civil 
rights movements, black power, women’s lib, poverty lobbies, and so on have 
become more militant and effective. In addition, there has been increasing 
Balkanization of the country, and individual slates are increasingly reluctant 
to make sacrifices in the wider national interest—for example, to be the state 
where the nuclear power stations are to be sited or where the open-cast coal is 
to be mined. 

As I'hurow correctly points out, virtually any economic policy decision will 
affect the distribution of income, and those groups who arc likely to be 
affected arc now quick to recognize the threat and to take effective defensive 
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action. Until recently, changes in the distribution of income between groups 
that resulted from changes in the economic structure or from government 
policy measures could be accommodated within a growing economy. Now, 
however, economic growth has dramatically slowed down, so that we arc 
playing a zero-sum game. Those who lose out relatively to the others also lose 
out absolutely, and this is more than can be tolerated within the present 
balance of political power. 

Thurow then goes on to show how this general theme applies to some of the 
major economic problems facing the U.S. economy today. He begins with the 
energy problem and explains how the free market solution—letting the price 
of oil rise to world levels—means a large cut in real incomes of certain sections 
of the population, particularly the poorest. Furthermore, although inflation 
usually only redistributes income among groups within the economy and docs 
not necessarily impoverish the economy as a whole, it is still greatly resented 
so that the overall inflationary impact of rising energy prices has to be taken 
into account as well. 

And other measures to alleviate the energy problem—notably more resort 
to some of the more economically viable energy alternatives such as open-cast 
coal mining or nuclear power—are opposed by special local interests who are 
able effectively to delay projects to the point that they kill them off. And, 
anyway, speculative investment in alternative forms of energy is not attractive 
to private industry since the high price of imported oil is the result of a cartel, 
not of corresponding high costs of production. So if some alternative form of 
energy could be produced that was competitive with oil at, say, $30 per barrel, 
the oil producers could Just cut their price, while still making fat profits, and 
put the new competitor out of business. Hence, large-scale investment in the 
search for alternative energy sources must rely largely on the government one 
way or another. But that means more taxes, and we are all against them. 
Moral; Whatever way one turns, there seems to be no effective solution in 
sight to the energy problem without some group or other getting hurt, and all 
the potential victims are now able to protect themselves. 

As regards the problem of inflation. Thurow's analysis follows that of many 
other commentators and is, in a sense, .somewhat inconsistent with much of 
his basic theme. He spells out the widespread view that inflation is largely the 
result of certain key groups in sin icty being able to protect themselves against 
cuts in their real incomes. Since prices and wages are flexible upward but not 
downward, random changes in various prices and costs will have an upward 
ratchet effect on the overall price level. 

All this is familiar, but, in addition, Thurow introduces into the stor> an 
analysis of the labor market which is more original and is the one he has set 
out in more detail in an earlier hoo)f. {Generating Inequality) and which demon¬ 
strates why a cut in demand and an increase in unemployment is likely to have 
a negligible effect on the rate of inttalion. (This is very much in line with many 
of the econometric macro mixlels on the market these days—not that 
confirmation from current econometric models is aece.ssanly much comfort 
to anybody.) 

But a major conclusion of Thurow's analysis is that the burden of futile 
restrictionist monetary or fiscal policies falls tnainly on those who are already 
the weakest economic groups in the economy—siich as young people, 
women, or ethnic minorities. As he puts it: “If a rcces.sion is to be u.sed to stop 
inflation, we draft inflation fighters in a very uneven pattern. In proportion to 
their size in the labor force, sixteen to I wenty-four-year-olds are three times as 
likely to be drafted as adults. Females are 38 percent more likely to be drafted 
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than males; blacks are twice as likely to be drafted as whites; and Hispanics are 
75 percent more likely to be drafted than whites” (p. 65). 

However, in this part of the book Thurow seems to be arguing that society 
does contain C|uite a sizable number of “fall guys” who can bear the burden of 
whatever particular policy society adopts. Hence, what Thurow needs to do in 
order to ensure greater consistency with his basic theme is to show that the fall 
guys in the inflation case cannot be made to bear the burden of the adjust¬ 
ment that would be needed in order to solve the other problems. This might 
not be difficult to do—for example, a rise in gasoline prices would hit white 
males aged 25-64 as much as, if not more than, automobile owners in other 
age, sex, or ethnic groups. 

Thurow goes on to discuss other current i.ssues, some of which are clear 
applications of the zero-sum game theme, such as the environmentalist pres¬ 
sures which really involve special pleading by the middle classes to protect 
those aspects of their standards ol life that have moved up in the ranking of 
their priorities and that can only fx; fully satisfied at the cost of sacrifices by 
the poorer groups of the population. 

There is also a connection between the zero-sum game theme and 
Thurow’s discussion of the reasons for the slowdown in the rate of growth in 
prcjciuctivity in America during the last decade. For he atiribuies a large pat I 
of the slowdown to govei nment policies that prop up ailing firms or industries 
in one way or another, again becau.sc of the effectiveness of political pressures 
to ensure that nobody suffers from economic change—at least as long as the 
Itotential victims are identified in terms of firms, or industries, or cities, or 
regions, and not in tertns of groups of individuals, such as young people or 
ethnic minorities, for whom the old American virtues of self-reliance and 
individual initiative must be preserved and protected against the insidious 
effects of income support schemes. 

■Tliioughout the book Thurow repc-atedly refers to the overriding impor¬ 
tance of society achieving some sort of consensus as to the c ritcria that should 
be used to dec ide which grotips should lo.se out as a result of some policy, and 
by how much and in what c ircumstances, and so on. He provides some idea of 
what sort of criteria of distributive justice might be reasonable, one of the 
most important of which is that the earnings dispersion of all groups in the 
economy should be no more unequal than it is now for fully employed white 
males, who, he shows, have a much smaller dispersion than other groups. 
Presumably, he would also want to change the relationship between the mean 
eainings of white males and of these other groups, since the latter might nut 
complain about having higher dispeision if their means were twice as high 
and, conversely, they might not be all that pleased to have zero dispersion if 
their mean was 10 percent of that of white males. 

Thete arc, first of all, some technic al objections to Thurow’s rather casual 
u.se c>f earnings dispersions for welfare purposes, such as the fact that earners 
are less than half the population, or that earnings may not be highly corre¬ 
lated w'ith family incomes per capita (a measure that Thurow uses in a 
different context, although adult ecjuivalent units would have been a better 
deflator). But the big question is how to achieve the desired dispersion. 
Thurow advexates some kind of governmental guaranteed job program—not 
at minimum wagets but at the wages appropriate to the skills and so on of the 
people in question. However, the precise functioning of this program is not 
spelled out in much detail. 

One is likely to be equally skeptical as Ut the chances of society reaching 
some sort of consensus on the various distributive objective's that Thurow 
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enumerates. Distributitmal criteria have been the subject of debate since 
Aristotle (if not earlier), who first introduced the concept of “distributive 
justice” and the notion that people should be rewarded in proportion to their 
merits. However, as Aristotle pointed out, there is usually much disagreement 
as to what constitutes a merit. And in the following 2,000 years or more, not 
much progress has been made in sorting it all out to everybody's satisfaction. 
Just appointing a couple of philosophers to settle the matter for us will not 
help; there is a chance that the two appointed wcjuld be Rawls and Nozick. 

Governments have to obtain some idea—however vaguely—as to what the 
public might, on the whole, prefer and, in the light also of other considera¬ 
tions of the national interest, act accordingly. If, of course, one is saddled with 
a government that is incapable of acting since its leaders want to be loved by 
everybody, it is not surprising that no clear distributive policy is ever adopted. 

t)f course, I hurow recognizes at one point that, perhaps, he is being naive 
in supposing that society (an solve its problems, and he recognizes that “in the 
records of history, we certainly would not be the Hist society that failed to 
tome to grips with its fundamental internal problems” (p. 207). Of course, 
problems that seem insoluble at any particular point of time—in the’sense 
that the theoretical sohilions are not feasible—sometimes Izetome soluble with 
the passage of time. The actors in the drama may change or learn new lines, 
rhurow’s message might actually get through to .some of them. He has an 
extraordinaiy gift for clarity and simplicity of exposition, illuminated from 
time to time with striking phrases, without being technically inaccurate. Such 
inaccuracies as I have spotted have not been the result of oversimplificatioti 
but of occasional lapses itito carelessness, such as his incorrect attribution to 
two authors of an article by only one of them (Malcolm Sawyer), or his failure 
toexplaiti why at one poitit (p. 1 1) he says that “welfare" (i.e., inccmie support 
payments) constitutes only 1.2 percent of GNP in the United States, whereas 
at another point (pp, 155 and 156) he says that redistributive transler pay¬ 
ments amount to over 10 percent of GNP. 

On the whole the diagnosis is probably laiily mainstream though Thurow 
does well to bring out the zero-sum game element in the situation and their 
distributional consequences, as well as drawing all the strands together m a 
more coheretit and authoritative exposition than is easily available elsewheie. 
It is his polity prescription that is the least convincing part ol the book. 
Pet haps there is no solution and never will be. .So perhaps we just have to 
suffer pseudo-controllcd experiments, such as the current experiment in 
testing the British economy to destruction. It remains to be seen whether the 
U.S constitutional setup can prevent its being subjected to similar experi¬ 
ments. 

VVlt.KRtD BtCKtKM.CN 
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This paper lorimilates and esiiniates a siruriuial inieitemporal 
model of labor supply. Using iheoreiKal eharacleiizations derived 
Iroin an economic model of lifetime bchavitir, a iwo-siep empirical 
analysis yields estimates of inteitemporal and uncompensated sub¬ 
stitution effects which provide the information needed to predict the 
response of hotns of W'ork to lile-cvcie wage growth anrf shifts in the 
lifetime wage path. 


Introduction 

Over the past several years there has Ireeii considerable activity in 
fonnulatitig life-cycle models of labor supply, Mtrst of this work has 
gone iintioticed in the empirical literature. I'his study develops an 
estimable model of labor supply that fully incorporates life-cycle f.ic- 
tors, and it devises simple econometric prtxedure.s for estimating this 
model. 

Most etnpirical work (in lalK>r supply assumes tlecision tnakitig in a 
one-period context. Typically, annual hours ol work are regressed on 
the current hourly wage rate and some measure of property income. 
A worket, liowever, determines his current labor supply in a life-cvcle 
setting. Unless credit markets are “perfectly imperfect" and there is 
no human capital accuinulation, the supply (if lalior is a function of 
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current and future discounted wage rates as well as wealth and con¬ 
straints in other periods. Accordingly, regressions of hours of work 
on current hourly wage rates yield a wage coefficient that confuses the 
response of labor supply to wage changes of three types: those arising 
from movements along a given lifetime wage profile, those arising 
from shifts in the wage profile, and those arising from changes in the 
profile slope. As a result, the wage coefficient usually reported in 
empirical studies has no behavioral interpretation in the context of a 
life-cycle framework. 

The theory underlying the model of lifetime consumption and 
hours of work in this paper represents a natural extension of Fried¬ 
man’s (1957) permanent income theory to a situation in which the 
relative price of consumption and leisure varies over the life cycle. 
Theoretical characterizations of consumption and labor supply de¬ 
rived from this theory sharply distinguish between factors determin¬ 
ing a consumer’s dynamic behavior and factors determining dif¬ 
ferences in consumption and hours of work across consumers. This 
separation leads to a manageable empirical model that allows one to 
discriminate the responses of labor supply to wage changes attribut¬ 
able to movements along a lifetime wage profile from those responses 
aitrihutable to parametric changes in this profile. In addition, one can 
estimate the effects ot wealth and demographic charactei istics on 
lifetime hours of work. 

The orgatiization of this paper is as follows. Section 1 outlines an 
ecotiomic tnodel of life-cycle behavior. vSection II develops and dis¬ 
cusses an empirical model of labor supply. Section III interprets the 
parameters of this empirical model. Section IV contains the empirical 
analysis. 


I. A Life-Cycle Model of Consumption and Labor Supply 

The consumer is assumed to choose consumption and hours of leisure 
at each age to maximize a lifetime preference function that is strongly 
separable over rime, subject to a wealth constraint. Let utility at age I 
be given by the concave function U\C(t), L(<)|, where C(0 is the 
amount of market goods consumed and L(l) is the number of hours 
spent in nonmarket activities at age t. Fhe consumer starts life with 
assets ^(0) and operates in an environment of perfect certainty. At 
each age t he faces a real wage rate equal to lV(t) assumed to be 
exogenously given. The consumer can freely borrow and lend at a 
real rate of interest equal to r(l) in period t, and his rate of time 
preference is p. A lifetime is assumed to consist of T 1 periods with 
L* being the total number of hours in each period. 
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Formally, the consumer’s problem is to choose C(t) and L(t) at each 
age to maximize the lifetime preference function 



1 

(1 + py 


UlC(t), L(e)] 


( 1 ) 


subject to the wealth constraint 
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where G (•) is a monotonically increasing function, N(l) = L* - L(t) is 
hours of work at age <, and/f(t) = 1/{[1 + r(l)][l + r(2)J... [1 + r(t)] j 
is the discount rate which converts real income in period l into its 
period 0 equivalent with H(0) = 1. 

Conditions for an optimum are satisfaction of the lifetime budget ' 
constraint and 


U,[C{t), L{t)\ = R{t){l + p)'X. t = 0. T, (3) 

+ p)'kW(t), < = 0.T. (4) 

where subscripts denote partial derivatives and X is defined by X = 
X*/G', where X* is the Lagrange multiplier associated with the wealth 
constraint (i.e., X’" is the marginal utility of wealth in period 0) and O' 
is the derivative of G. According to condition (3), consumption is 
chosen so that the marginal utility of consumption equals the margi¬ 
nal utility of wealth after adjusting for a discount factor which de¬ 
pends on the rate of time preference and the rate of interest. Condi¬ 
tion (4) determines the consumer’s choice of leisure. If it is an 
equality, then a positive amount of labor is supplied to the market. If 
it is a strict inequality, then all time is devoted to tionmarket activities. 

Using the definition of labor supply (i.e., A[/] = L* — Z.[/J) and the 
implicit-function theorem, it is possible to solve equations (3) and (4) 
for consumption and labor supply as functions of the form 

C(0 = C[«(0(1 + p)'X. »F(0]. < = 0.7'. (5) 

N(t) = NlR(t)(l + p)'X, W(t)], 1 = 0 . T. (6) 

The functions C(-,-) and N(- ,■) depentl only oti the functional fortn 
of {/(■, ■)• Asa consequence of concavity of U and the assumption that 
consumption and leisure arc normal goods, they satisfy 

C, < 0,/V, S 0,Nt ^ 0.' • (7) 


‘ See Heckman (1974, 1976) for proofs i>f these inequalities Heckin.iii develops and 
uses functions equivalent to those given by eqq. (5) anti (6) in his anal) sis ol the behavior 
of con.sumption and labtir supply over the life evde. 
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I'hese consumption and labor-supply functions allow for corner so¬ 
lutions for hours of work either at age t or at any other age t'. No 
matter what the consumer’s labor-force participation pattern over his 
lifetime, consumption and labor-supply decisions at any age (includ¬ 
ing the decision to set hours of work equal to zero) are completely 
determined by the functions C(-,-) and and the values of the 

variables R(t)(\ + p)'X and W(t).^ 

The relationships given by (5) and (6) hereafter will be referred to 
as the constant” consumption and labor-supply functions. They 
represent the marginal utility of wealth constant demand functions 
for consumiition and leisure for a particular form of the lifetime 
prelerence function given by (1), namely, the one obtained when G is 
the identity transformation. For this particular choice oi'G, X. is the 
marginal utility of wealth in period 0. Given a choice of (/(■,•), it is 
theoretically possible to compute a unique value for X using data on 
an individual’s consumption, labor supply, and wage rate at a point in 
time. 1 hi.s fact receives much attention in the foimiilation of the 
empirical model which is discussed in the next section. 

Substituting the X constant consumption and labor-supply func¬ 
tions into the budget constraint given by (2) yields the equation 

T 

A{0) = y «(0{C’(«(0(1 + P)'X, IT(t)j}. 

(-0 ( 8 ) 

- w{i){Nm)(i + p)'x, iy(<)]}. 


This eciuation implicitly determines the optimal value of X; X, then, 
can be expressed as a function of initial assets, lifetime wages, interest 
rates, rates of time preference, and ’’consumer tastes.” Concavity of 
preferences implies 


d\ ^ 1 dX 

yrdh dW{i) 


0. 


/ = 0 . TA 


(9) 


^ II ir IS optiMid) loi the fon.suiiicr ic> woi k dge /, then (ondiDon (4) »s jn equjljiy 

aiKl the hjiKti<Jiis Ci •, ) hikI A^( *. ) represeni the soitaioiis of ecjq. (3) and (4) for the 
variables ^’(0 dnHiV(0 - L* - L(t), respeclively- If, on the other hand, the nctessaiy 
contlition given by (4) is an inequality, i.e., L /2 f-*] ^ f?(0(l + p)^kW(C), then the 

consumer chooses not to woik. In this case. ^{’.’) — 0 and the function is the 

soil) I ion of the equation U |[C(0, ^*] = W{0(l + p)^k for C(0- In either case, f’( • ,■) ancl 
) only contain the variables + p)*X and W{t) as arguments and their 

functional form depends only on the form of the period t utility function For 

fuither discussion on this issue see Heckman and MaCurdy (1980). Introducing age 
dependence into ihe unlay function cI<k‘s not change any of this analysis. II the utility 
function at age t is given by U[Cil)y L{t), X(/)}, where Xit) i,s a vector of time-varying 
determinants of “consumer tastes,” (hen a third argument X(0 enters the consumption 
and labor-supply functions given by (5) and (6). Fhese functions satisfy the restrictions 
given by (7), and they also allow for corner solutions. 

^ See Heckman (1974, 1976) for prcKjfof these prc^poMiions. 



MODEL OK LABOR SIJPPLV 1 ofi"^ 

Inspection of the X constant functions reveals that consumption and 
labor-supply decisions at a point in time are related to variables 
outside the decision period only through X. Thus, except for the value 
of the current wage rale, X summari/.cs all information about lifetime 
wages and property income that a consumer requires to determine his 
optimal current consumption and labor supply. At any age, any path 
of wages or property income over a consumer’s lifetime that keeps X 
and the current wage constant implies the same optimal current 
consumption and labor-supply behavior. 

The X constant functions represent an extension of Friedman’s 
(1957) permanent income theory to a situation in which the relative 
price of consumption and leisure varies over the life cycle. According 
to these functions, current consumption and labor-supply decisions 
depend on a permanent component and the current wage rate. The 
variable X is like permanent income in the theory of the consurtiption 
function. At each point in time it is a suflicient statistic for all historic 
and future information about lifetime wages and property income 
that is relevant to the current choice of consumption and labor sup¬ 
ply. The usual concept of permanent income or wealth does not 
qualify as a sufficient statistic for this retrospective and prospective 
information.^ Given knowledge of permanent income, a consutner 
also requires information on future wages to determine his optimal 
current consumption and labor supply. Only if wages are constant 
over the life cycle, or labor supply is exogenously determined, can X 
be written as a simple function of permanent income or wealth. 

The X constant consumption and labor-supply functions fullv 
characterize a consumer’s dynamic behavior in a world of perfect 
certainty.’ Accorditig to the.se functions, there are two reasons why it 
consumer tnight change his consumption or hours ol work as he ages; 
(1) the real wage rate chttnges, or (2) the r ate of interest varies anti ts 
tiot eqital to the tate of titne preference.® 


* Pci maiient income here in dehiied as lhai mi earn of income nn hose discouuieci value 
e(|ual$ the tonsuiner's wealth in piesent value leinis Koiinallv. peimanent incnnie in 
period 0. Yg,, is dehned by the equation .-1(0) + R{()N{t)\V{t) = Y,, X/L,i^(0 

*Scc MaC-uidy (1978, 1980, in press) for a discussion exf the unccitairiiv rase, It is 
shown in these papers tliai with itiinoi iiUKlirKaiions the empiiical spt'cdications of 
lalixir supply cicveloped in this scciioii and iheir implememaiion in the Idhowing 
sec lions are consistent with a world in whic h the consumer is unc ertain about Ins future 
Idcuitie path of wa^cs and property income. 

® Changes in a toiisuiner’s tastes can also be a reason for adjustments in cnnsumpiion 
and lalx)r supply over the life cycle. As discussed in n. above, if the pciuxi unhi\ 
function is age dependent, (he A constant consumption and lalx>!-supplv functions will 
also be age dependent. It is still true, however, that the A constant functions (ully 
characien7e a consumer’s dynamic behavior. 
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II. An Empirical Model 

This section formulates an empirical model of labor supply that is 
based on the economic model described above. Specific functional 
foriiLs are proposed for the X constant labor-supply function and for 
the relationship between X and such variables as lifetime wages and 
initial assets. The following discussion assumes the availability of 
panel data. 


An Empirical Specification for the X Constant 
Labor-Supply Function 

Assume that consumer i at age / has utility given by 

ulcfi). Lift)] = Y,^/)[CK<)]“' - y,At)[Nit)r\ (10) 

where 0 < < 0 i < 1 and cuj > I are time-invariant parameters common 
across workers, and Yi((/), Yift) > 0 are age-specific modifiers of 
“tastes.” The variables Yft) and Y^iit) depend on all of consumer t’s 
characteristics which plausibly affect his preferences at age <; these 
characteristics may include such variables as the number of children 
present at age I, the consumer’s education, and even age itself. 

Assuming an interior optimum,’ the implied X constant labor- 
supply function for consumer i at age t in natural logs is 

In Nft) =-5—r {In X( - In Ytift) - In 

a>2 - 1 ‘ 

, ( 11 ) 

lnt«(<)(l + p)'l + In IT,(0}- 

Assuming that "tastes” for work are randomly distributed over the 
population according to the equation In Y^fft) = tr, — uf(t), the 
labor-supply function can be written as 

In Nft) = F, + S ^ [p - r(X)] -t- 6 In W ft) + uft), (12) 

fc =0 

where/", = [l /(<02 - l)](ln X, - o-j - In ( 02 ), S = l/ion-i - 1), uft) - 
Su’f(t), r(0) = p, and we have used the approximations In [I r(<)] ~ 
r(0andln(l + p) == p. I'he unobserved variables o-j and U((<) represent 
the unmeasured characteristics of consumer i; cr,- is a permanent 
component, and M|(t) is a time-varying error term with zero mean. If 
we assume that the real rate of interest, r(t) for < > 1, is constant over 

' Since this study’s empirical objective is to examine the labor-supply behavior of 
prime-cige male&. this assumption is not unreasonable. 
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the life cycle and equal to r, then the \ constant hours of work 
function reduces to 

In A/((0 = Fi + bt + b In W+ u,{l), (13) 

where b = S(p — r).* 

The intercept term in this equation represents a time-invariant 
component that is unique to individual i. This study treats Ft as a fixed 
effect. Since Ff contains In \( as one of its components, one cannot 
assume that Ft is a “random factor” uncorrelated with exogenous 
variables of the model. Inspection of equation (8) reveals that \ 
depends on the values of variables and constraints in all periods. By 
construction, F, is correlated with any exogenous variables used to 
predict a consumer’s wages or wealth. Hence, treating as part of the 
error term would result in biased parameter estimates of the labor- 
supply function. Treating F( as a fixed effect, on the other hand,' 
avoids this bias. 

Estimating the parameters of equation (12) only requires variables 
observed within the sample period. Regressions ol current hours of 
work on individual specific intercepts and current wage rates produce 
a full set of parameter estimates.® Because F, captures the effect of In 
\(, its estimated value summarizes all of the retrospective and pro¬ 
spective information relevant to consumer t’s current choices. As there 
is no need to forecast any life-cycle variables that are outside the 

‘There are other forms of the utility function that have convenient cinpinc.il 
.specificauons for the \ constant consumption and leisure demand functions. Two such 
functions are 

f/ilO = Ki(0(C,(() + Mf(01“‘lL,(() + 

<a*. <a > 0. Ol* + oj < 1 Ol >c,((). oj*. tu < 0, 

CM* (M 

> 0, <M*, CM < I . 

where kT(/), K^t), w*, diui <o are all parameiers. Both of these functions arc 

concave. They include Cobb-DouglavS, adctilog, CES, and Stonc-Gearv as special cases. 
The A constant functions for consumption and leisure aie log linear in A, wages, and the 
coefficients Kf(0 and «|{0. which represent speuhe modifiers of tastes Eliis stud\ u.ses 
the utility function given by (10) to formulate an empiricdi model lx*cause it implies a 
form for the labor-supply function whkh tan be leadilv compaied with lalx>r-siipplv 
equations found in existing empirical work. 

•If utility at age t depends on measured characteristics of the consumer that vai\ 
over the sample period, then cm tent values of these “taste-shifter ’ variables would also 
be included as legressors. A iiaUiral way lonm oducc such lasVc-shiflev variables is to 
mcxiel the taste coefficient Yj(0 *18 a function of the form In Y<(0 = «■( + - i/f(/). 

where is a vector of variables influencing tastes and is a pai ameici hoi this ca.se, 
enters as an additional linear term in the A constant labor-suppiv equation given 
by (12) or (13). 
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sample period, the X constant functions afford a considerable sim¬ 
plification of the empirical analysis. 

Use of the X constant functions allows one to estimate parameters 
needed to characterize dynamic behavior without introducing any 
assumptions regarding a consumer’s behavior outside the sample 
period. To appreciate this point, consider the problem of predicting 
the additional hours of work a consumer will supply in response to 
observing a higher wage rate than he observed at a younger age. I'o 
obtain an estimate of this response using a traditional model of life- 
cycle labor supply, one must formally incorporate the worker’s future 
plans in the model. For example, if the worker anticipates an early 
retirement, future wages corresponding to the retirement years do 
nf)t influence current labor supply. Thus, the researcher must not 
include these wages as explanatory variables. Such considerations lead 
to difficult data retjuirements and complicated estimation procedures. 
Using equation (12), oti the other hand, it is possible to analyze this 
prttblem without knowing anything about a worker’s future plans; an 
individual constant term for each worker accounts foi a worker’s 
future plans in a parametrically simple way. 

/in Empinca! Specification for Individvol Effects 

Estimation of the X constant labor-supply function given by (12) docs 
not tlirectly estimate all of the parameters retjuired to characterize all 
aspects of labor supply. Uifferences among individuals in itiitial 
wealth or lifetime wage paths affect the level of hours of work 
through F,. i’o explain any aspect of labor supply other than dynamic 
behavior (e.g., how the hours of work of two individuals differ at a 
point in time), one must confront the problem of predicting indi¬ 
vidual effects. 

From the theoretical analysis above, we know that the value ofT, or 
more properly X, is unitjuely determined by the implicit equation 
given by (8). 'This equation does not admit an analytical solution for X 
given the specific form of the utility function given by (10), even if it is 
known that this function applies to all ages and the consumer works in 
each period. I'he variable X is a complicated function of initial assets, 
lifetime wages, the interest rate, the rale of time preference, and 
parameters representing unobserved “ta.stc” variables.Using such a 
relationship as an empirical specification is not feasible. 

t he e<|ualion determining X( is 

= . 

- -«(0(l + p)%(V,(/)]'''“' " 
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This study assumes that equation (8) implies a solution lor A in 
which In A tan be approximated as a linear function of measured 
characteristics, the natural log of wages at each age, initial wealth, and 
an unobserved random variable representing unmeasured character¬ 
istics. With this assumption the implied equation for F, is 

/=« 

where Zj is a vector of observed variables (c.g., family background 
variables), tq is an error term, and </», y(l), and 0 are parameters 
assumed to be constant across consumers." I'his structuial relation¬ 
ship for/', implicitly assumes that each consumer has a working life of 
7’* -I- 1 years. According to the theoretical restrictions given by (9), the 
•y(/)’s and 0 should all be negative. 

Unfortunately, to formulate an estimable version of an equation for 
/'(, we require additional assumptions concerning the forms of the 
lifetime wage and income paths. In contrast to the A constant labor- 
supply function, estimating the parameters of equation (14) requires 
data which normally are not available. Most variables appearing in 
this erjuation are tiot directly observed, including the dependent 
variable/’i, wages outside the sample period, and initial wealth. While 
estimates of F, are obtained as a by-pi oduct from estimating equation 
(12), we still require a mechanism for predicting wages outside the 
satnple period and itiitial permanettt income. We do this by ititro- 
ducitig lifetime prohles for wages and income. 

This study assuities that the lifetime wage path is 

In = TToi + Itti, + + yi{t), (15) 


“ The cissiiinpiioM ihjl p.tiaiiicici s are ciHisuiit a< ross tonsuniei s is. of com se, 

on!) an appioMin.ition Foi nialiy. il c.iii l>e shown ihai 


y(0 = S 


() In X 
d In H'(f) 


- ^ 


a In MO 
a In X 


r I -■ 


ft In X 

'MToJ 




where E*{l) - .V(t)W'*(t) and W*(t) = W’(/)«(/) .iie die peiKMl 0 pie-.eiii r.ilue i)l 
eainings and wage.s ni peiiod 1. t his ielaiioiiship tf>i -ytf) is derisefi h\ ditterenti.iung 
Roy's identity tor A'(t) with lespeet to.-ttO) to ohiain the eqiialioii 


dX _ dA'(t) 
dw*(i) ^ a-t(()) 


+ .V(/) 


fix 

a^(()) 



d In .Vtt) 
d In X 



d In X 
d.-KO) 


X rdln.V(() 
IV*0)[ dInX 


+ 1 


d In X 
, d.d(O) 


A«(/) 


.Since the cinpiiieat sperihealion for the X coiisiain hours ot w'oi k lunriuin implies |d In 
iV(/)|/(d In X) = 8. we see that we taniiot foniiallv hare y{f) = S (d In X)/(d In tV’(/)| .mil 0 
= 8(d 111 X)/[d.-t(0)] ron.stant. Notice ih.it the effects of the inieicst i.ite anil time 
prcfetcnce on Ft are absorbeil into the coelhcieiits of speiiliiation (I t) 
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where ttoi, itu, and are linear functions of the form 


_;• = 0,1,2, (16) 

Ml is a vector of exogenous determinants of wages which are constant 
over the consumer’s lifetime (e.g., education and background vari¬ 
ables), gj,j = 0, 1,2, are vectors of parameters, and VM an error 
term. This path assumes that wages follow a quadratic equation in age 
with an intercept and slope coefficients that depend on age-invariant 
characteristics of the consumer.’* 

Predicting a consumer’s initial wealth is complicated by the fact that 
most data .sets do not contain extensive measures of even the con¬ 
sumer’s current wealth. Some measure of the consumer’s property or 
nonwage income during the sample period, however, is usually avail¬ 
able.'* Let y^t) and d^t) denote the property income and assets of 
consumer i at age 1. IfPjit) is the income flow generated by investing 
assets /f i(t) at a rate of interest equal tor, we have the relationship L|(t) 
= Alt)r. Assume that the following quadratic equation in age ap¬ 
proximates the lifetitne path for properly income; 

ViiO = Q!ot + tan + t*a2( + P((<), (17) 

where «(«, an, and an are linear functions of the form 

ocji = ^'iigj, j = 0,1,2, (18) 

Si is a vector of measured age-invariant characteristics of consumer i 
(e.g., education anti background variables), qj, j = 0, 1, 2, are 
parameter vectors, and u^t) is an error term.'^l'he intercept aot can 
be thought of as a measure of consumer t’s permanent income at age 
0; that is, am = /l,(fl)r.'’ 

Combining the lifetime paths for wages and income with equation 
(14) creates an equation for Ff that can be estimated using data 


the following analysis it is assume<l that this wa^e equation generates unbiased 
picdu tions tor lifetime wages. These prcdutions need not be efficient nor do they need 
to be the same piedictions used by consumers. It is pos.sible to inlrociute many alierna- 
live loj ms for the wage equation, .such as higher-ordcr polynomial.s or polynomials with 
other functions of lime (e.g., leciprocals) replacing / and t®, with only minor modifica¬ 
tions of empirical sfjecifications used iii (he following analysis. 

These income measures seldom include imputed income generated by consumer 
durables, which is a major source of propierty income Fc^r most consumers. 

In contrast to Y{t) is determined endogenously in this model. Kq. (17) can be 
viewed as an approximation to the optimal lifetime path For Y (<) expressed a.s a function 
of the exogenous variables of the model. 

*®Foi mally, this relationship between a<M and /4,(()) is correct only in a continuous lime 
framework,. When modeling the pioblem in discrete time, one must distinguish assets 
held at the beginning, at the end of the period, and exactly when asset income is earned 
within the period. 
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observed within the sample period. Substituting the wage process 
given by (15) and the relationship aoi = rA ,(0) into equation (14) yields 

Ft = Zt(f> + TTotVo + rrityi + -iTifyz + aofi + Vt> (19) 

or substituting relations (16) and (18) yields 

Ft = Ki4$ + at. ( 20 ) 

where 

T» 

= > = 0,1.2, d = e/r, ( 2 i) 

(=0 

Ki is a vector including all age-invariant characteristics determining 
either wages, income, or X (i.e., all the elements of Mt, Sf, and Z,), is a 
vector of coefficients, and tjj is a disturbance term which is randomly 
distributed across workers with zero mean. Equation (19) is a struc¬ 
tural relation.ship between F and the characteristics of a consumer’s 
wage and income profiles. The empirical analysis of this study focuses 
on estimating the parameters of this equation, <f>, 70 , "yi, 72 . and 6. As 
we shall see shortly, these parameters have a sound economic inter¬ 
pretation. Equation (20) is essentially a reduced-form equation for 
(19). By estimating the parameters of this equation, it is possible to 
predict how F varies across consumers using only age-invariant char¬ 
acteristics of the consumer as explanatory variables. 

III. Interpretation of Parameters 

In investigating the effect of changes in wages on labor supply, it is 
important to separate parametric change of the sort usually con¬ 
templated in comparative static exercises from evolutionary change 
due to movement along a life-cycle wage path.'* A parametric wage 
change refers to shifts in a life-cycle wage profile (e.g., a shift from 
path II to path I in fig. 1 ), while an evolutionary wage change refers to 
movements along a given profile (i.e., along any path in fig. 1). Thus, 
parametric wage changes refer to differences in wages across con¬ 
sumers, while evolutionary wage changes refer to differences in wages 
across time for the same consumer.'^ 

Consider the behavior of labor supply over the life cycle. As a 
consumer ages, he adjusts his hours of work in response to the 

'•This distinction goe.s back to Ghez. and Becker (197.'j). 

" t his statement is true only in an environment of perfect certainty tl there is 
uncertainty about the future, a consumer can experience parametric wage changes as 
he acquires new information atxtut his lifetime wage path For a discussion of these 
issues see MaCurdy (1980, in pres.s). 
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different wage rates he observes at each point in his lifetime. I'hcsc 
labor-.supply adjustnients represent responses to evolutionary wage 
changes; they reflect the consumer’s desire to supply more hours in 
those periods with highest wages. There is no wealth effect associated 
with this kind of wage variation since the wage profile is known tet tfie 
coristimer at the beginning of his lifetime and changes in wages are 
due only to movement along this given profile. It is apparent from the 
lalior-supply function given by (12) that the value of the parameter 8 
determines the hours of work response to evolutionary wage changes. 
Hereafter, I will refer to 8 as the intertemporal substitution elasticity. 
The theoretical prediction for its sign is positive. For the particular 
form of the utility function given by (10), 5 is also the direct elasticity 
of substitution for hours of work in any two periods. 

Now comitai e the labor-supply profiles of two consumers who face 
wage paths 11 and 111, respectively. As illustrated in figure 1, the wage 
profiles lor consumers 11 and III are the same except at age t' when 
consumer Ill's wage rate is iiigher than consumer ITs, I.et A denote 
the absolute value of this difference in period (' wages. This wage 
difference represents a parametric wage change because it involves a 
shift in the lifetime |)ath of wages. It causes the labor-supply profiles 
for consumers II and 111 to be different at all ages. Compaiing these 
labor-supply profiles is the sort of problem usually considered in 
comparative static exercises. In terms of the empirical model outlined 
above, this higher wage rate has two effects on consumer Ill’s labor 
supply. The first effect is on the value of F. According to equation 
(14), consumer III will .set a value for F which is lower than the value 
of F tor consumer II by an amount equal to y(l') • A. This decline in F 
implies that at all ages other than <' consumer Ill’s labor supply will be 
less than consumer ITs by a constant fraction. At age I' there is a 
second effect of the wage difference. Neglecting the decline in F, 
consumer Ill’s labor supply at age I' will be higher by an amount 
equal to 8 • A. Thus, the total impact on consumer Ill’s hours of work 
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at age l' is [S + 7(<‘)] "A. Since S > 0 and y(t') < 0, there is no sign 
prediction for 8 + y(t'), so consumer Ill’s hours of work at age may 
be greater than or less than consumer II’s. 

The parameters y(t') and 8 -I- y(t'), then, determine the difference 
in consumer Il’s and consumer Ill’s labor-supply profiles which is 
due to the discrepancy in their wage rates at age t'. I he quantities 
•y(t') and S + y(t') correspond to the usual concepts of cross- and 
own-uncompensated substitution elasticities. I hese elasticities de¬ 
scribe the response of labor supply to parametric wage changes. They 
can be u.sed to predict differences in labor supply acro.ss consumers. 
These elasticities do not directly provide information on the response 
of labor supply to evolutionary wage changes, so they cannot be used 
to predict differences in a given consumer’s labor supply over time. 
Since the intertemporal substitution elasticity exceeds the own- 
uncompensated substitution elasticities (i.e., 8 > S -I- ylt’]), one ex¬ 
pects an evolutionary wage change to induce a larger labor-supply 
respotise than a comparable parametric wage change. The wealth 
effect associated with a parametric wage change accounts for the 
smaller labor-supply response. 

Cotnparing the labor-supply profiles lor consumers I and II also 
involves a paramettic wage change. As illu.strated in figure 1, ton- 
suttier ITs wage pi ofile exceeds consumer I's by a constant ft action 
over the entire life cycle. The parametric wage change associated with 
moving frotn consutner 1 to consumer II’s wage profile is analogous to 
increasing the value of the intercept of the lifetime wage path, tto. 
’This has two effects oti labor supply in each period. First, a consumer 
adjusts his value of F in response to the profile shift. According to 
et|Uation (19), F declines by ati amount equal to = 2/^,, y{l) times 
the increase in the value of tth. This decline in F implies a fall in hours 
of work at each age. Second, there is a direct impact on each period’s 
labor supply. Holding the value of F constant, a consumer increases 
his hours of work by an amount ecjual to 8 times the increase in Wo- 
The implied total impact on each period’s labor supply, therefore, is 
-yu + 8 times the chatige iti tto- Becau.se yo is unambiguously negative, 
there is no sign prediction for this total impact. Since yo + 8 is less 
than y(t ') + 8, however, the response of labor supply to a shift in ito 
should be less in algebraic value than the response to a shift in the 
wage profile only at age/'. The wealth effect associated with a shilt in 
TTo is greater. The labor-supply profile for consumer II. then, can he 
above or below consumer I’s lalxtr-supply profile. It will lie alxjve 
consumer I’s if yo -t- 8 is positive. 

The empiiical specification of life-cycle supply given by equations 
(12) and (19) provides a convenient framework for estimating the 
response of labor supply to the different kinds of wage changes 
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described above. Estimation of the X constant labor-supply equation 
produces an estimate of the intertemporal substitution elasticity 8. 
This elasticity can be used to predict the response of labor supply to 
evolutionary wage changes; it provides the information one needs to 
describe a consumer’s dynamic behavior. Estimating the equation for 
F given by (19) produces estimates of the parameters yo, yi, y 2 , and d. 
These estimates provide the additional information one requires to 
predict the response of labor supply to parametric wage and wealth 
changes, and they can be used to explain labor-supply differences 
across consumers. Combining the estimates of y„, y,, and yj with the 
estimate of 8 allows one to predict how labor-supply profiles adjust to 
changes in the wage-path coefficients tto. tti, and 7r2. This includes 
both shifts and slope changes of the wage profiles. The estimate of 6 
provides the information one needs to predict the response of labor- 
supply profiles to changes in a consumer's initial permanent income. 
Estimating the empirical model proposed in this paper, then, fully 
characterizes a consumer’s lifetime labor-supply behavior. 


Three Substitution FAa.sttcUtes 

Nowhere in the above interpretation of parameters was there any 
mention of compensated substitution elasticities. In terms of the 
above notation, it can be shown using Slutsky’s equation that the own- 
and the cross-compensated elasticities are 8 + y{t) — E{t)9 and y(t) — 
E(t)0, respectively, whereE'(/) ^N(t)W{t) is real earnings at age/.'“In 
the analysis of life-cycle behavior, it is important to distinguish sharply 
compensated elasticities from the intertemporal elasticity (i.e., 6) and 
the uncompensated elasticities (i.e., own effects 6 -I- yVt] and cross 
effects yl/1) discussed above. 

Some researchers incorrectly infer that intertemporal and compen¬ 
sated substitution elasticities are the same.'** The fact that an indi- 


'"Aicording to .Slut-sky’s etjuaiion, 

aiv«) | _ dN(i) \ jy . aNp) 
a/v(/)|,/ dWiniAm ' a.4(o>’ 


So 


W(t)dN{t) _ WU)dNU) aillA/p) 

tV(<)dn’(o„ M{t) awo) A,o, a.4(0) 


= 6-1- y( 0 - /'-(d®. 


t here has been some confusion concerning the interpretation of wage coefficients 
estimated in labor supply studies such as those of Ghez and Becker (1975) and Smith 
(1977), who use synthetic cohort data. Because these studies estimate the response of 
hours of work to life-cycle wage growth, the wage coefficient is an intertemporal 
substitution elasticity. While this elasticity constitutes an upper bound for compensated 
and uncompensated elasticities, it is not one of the familiar elasticities associated with 
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vidual is at the same level of lifetime utility at all a^es in a world of 
perfect certainty suggests that responses in hours of work to changes 
in the wage rate over the life cycle represent compensated substitution 
effects, which in turn suggests that the iniertemporal and compen¬ 
sated elasticities are equivalent. These elasticities, however, are used 
to predict labor-supply responses to different kinds of wage changes. 
As discussed above, intertemporal elasticities determine hours of 
work responses to evolutionary wage changes. Compensated elas¬ 
ticities, on the other hand, are like uncompen.safed elasticities in the 
sense that they determine responses to parametric wage changes (i.e., 
wage changes due to shifts in wage profiles rather than movements 
along these profiles). Compensated elasticities, then, can be u.sed to 
predict differences in hours of work across consumers whose wage 
profiles are different and whose lifetime utility is the same. 

Formally, the intertemporal substitution effect can also be inter¬ 
preted as an elasticity that is associated with a particular kind of 
parametric wage change. In particular, it determines the response of 
hours of work at age I to a shift in the age I wage rate holding X or the 
marginal utility of wealth constant. Thus, whereas uncompensated 
elasticities hold financial wealth constant and compensated elasticities 
hold lifetime utility constant, the intertemporal elasticity is equivalent 
to a parametric wage elasticity that holds A constant. In the literature 
on consumer demand, this particular elasticity is known as the specific 
substitution effect.^® This correspondence between intertemporal and 
specific substitution effects is a direct consequence of the assumption 
that utility is additive over periods. 

While the three types of substitution elasticities are related, they are 
distinct and reduce to the same value only if income or wealth effects 
are zero. Assuming leisure is a normal good in all periods, these 
elasticities can be ordered as follows: 8 > 6 + y(0 — > 6 + ylt); 

that is, intertemporal respotuses are greater than compensated re¬ 
sponses which are in turn greater than uncompensated responses.^* It 


parametric wage changes. Thus, without impcwiiig iionirivial 1 csti iciioiis on piefei- 
ences, wage coefficients estimated using synthetic cohort d,it.T cannot l>e used lor police 
analysis of the sort encountered in predicting lalKir-supply lesponses to negative 
income lax experiments or other proposed changes in tax policies EKcwheie, .Smitli 
(1975) uses these estimated coefheients for exactly this purpose, iiameh, to piedici the 
effect of incoinc-maiiuenance programs on hoiiis of work. 

“See Phlips (1974, pp. 47-50) for further discicssion. 

*' We can relate these elasticities using solutions of what is known as the lundameiual 
matrix eejuation in the literature 011 consumer demand (spe, eg., ibid ) toi leisuie 
demand, it can be shown that: (1) the intertemporal effect is 


SL(t) 
dW(() » 


= 


where fi” is the (2, 2) clement of the inverse of the hessian matrix of /.(Ol, (2) 
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is also easy to show that interiemporal and compensated responses 
must be positive, while uncompensated responses may be either posi¬ 
tive or negative. Intertemporal and uncompensated substitution ef¬ 
fects, then, provide an upper and a lower bound for compensated 
substitution elasticities. 


IV. Empirical Analysis 

I'he A. constant hours of work, function given by (12) and the struc¬ 
tural equation for F given by (19) provide a manageable empirical 
model for analyzing labor-supply liehavior in a life-cycle setting. This 
model naturally suggests a two-step estimation procedure. In the fit st 
step, one estimates the parameters of the X constant labor-supply 
equation. This step provides an estimate of the intertemporal sub¬ 
stitution elasticity and all the information a researcher requires to 
predict how the hours of work of a given consumer will differ at two 
points in time. In the second step, one uses estimated “individual 
elTects" as dependent variables to estimate the structural equation for 
F which pioduces estimates of wealth effects and uncompensated 
suhstitutioti elasticities associated with shifts in the intercept and the 
slope of the lifetime wage path. This step provides the additional 
information one requires to predict how labor supply will differ 
across consumers. Using results from both steps, it is possible to 
compute the aveiage own- and cros.s-uncompensated substitution 
elasticities associated with wage changes in a single year. I'hese restiks 
also permit the calculation cjf upper and lower bounds lor compen¬ 
sated elastic ities. 1 his two-step estimation procedure e.xploits the spe¬ 
cial characteristics of panel data to character ize life-cycle behavior 
with a minimal amount of computational burden.^* 


the compnisaicd ctfcil is 

a/-(O j _ dr{i) \ 

dn/(/)|( ajt'ioix lac4(0)J' 

vvlun; n — (())]“* > 0; and (3) ihe uiKompcnsaicd filcct is 

dW) I _ dl.U) I , 

aw{t)\^ dW(t)\„ '’a,4(0)' 

So, it L{t) IS a normal f;oocl and U is concave, we have 

aL(0 I 9/.(f) I ^ dl.(l) I 
dtFthli, dn't/) 1/ 

'I'tic proposed rcslnctioiis on labor-supply elaslicnics follow immediately using this 
ic.sult. 

“There is an ahernaiive strategy for estimating this structural model of lifetime 
labor supply. Instead of the procedure suggested above, one could use a one-step 
pnx-edure. Substituting the right-hand side of eq (19) forf directly into the \ constant 
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The empirical work reported here uses the randomly designed 
sample from the Michigan Panel Study of Income Dynamics. The 
sample consists of ob.servations of 51S prime-age, white, married 
males for the years 1967-76. Only males continuously married to the 
same spouse during the period 1968-77 and who were 25-46 years 
old in 1967 were included in the sample.^ The lalxrr-supply variable 
u.sed in the empirical analysis is annual hours of work. The wage 
variable is average hourly earnings deflated by the Consumer Price 
Index. 

To avoid confusion in this .section we must be careful to distinguish 
between age variables and indexes representing a sample period. As 
in the previous discussion, I denotes the age of a consumer; in par¬ 
ticular, I — 0 when the consumer is 25 years old, t = 1 when he is 26, 
etc. The index J, on the othet hand, denotes the sample period ;7 = 1 
when the observation is from year 1967,; = 2 when it is from 1968, 
etc. The following analysis assumes that there are a total of r sample 
periods (r = 10 for the .sample used in this paper). Finally, the 
notation ^(j) denotes the consumer’s age in sample periotl;. Thus, In 
W,(t) is consumer I's wage at age /, In IV,( j) is his wage in period j, and 
in IV,(0 = In Wll{j)\ = In 1V,(/). 


Estbmles of the Interlemponil SubslUution ElasttcUte^ 

Estimating the parameters of the \ constant labor-supply equation is 
simplified by working with a firsi-diffei enccd version of this equatioti, 


Idl3<)i -supply mveii by cither (12) or ( I ‘^) pi o<Iuccj» j new cqudiion coiuainiu^ 

rill o( flic Mi iK furul p.iidincicis of inicicst It is» possible to estimate this equation ni a 
single step using ctmstraiucd .siinuUaneous-cquaiion esiiination piotedincs. The iwo- 
-step piiKcclure offers advantages over this onc-siep iiiciIiimI when theie cm.sIs inis- 
sjX'cihLaiion of the c(|uatiou loi t' The presence ol any inisspctihtatmn errors ni the 
/^-equation Icarls to inconsislcnt csdinalors for all paiainctcrs of the laboi-suppI\ 
equation in the lase o( the one-step pyfxcduie. Foi the iwo-stcp estnnanon scheme, 
however, only the second-step csliinalois for the paiaineteis of the /'-equation aic 
incorisislent. 

Hoiithakkcr and L'aylor (1970, < hap. 5) and Thlips (1974, pp J 90-93, 25()-f>0) have 
estiniatcfl inargina) unlit) ol wealih constani demand fuiKtions as an intei mediate 
coinputation.il step low.ird estimating a system of oidmaiy dem.md luiKtions In 
contrast to their woi k, I directly estimate the A coiisiam dem.md functions .is a means ol 
characterizing a consumer’s dviiaimc behavior. I treat A as a fixed elleri which smii- 
inarizcs the ellect of historic and lutuie mioi maiioii on curreul decisions I thank 
James Mcckm.in and Orley AslicnfVltcr foi the Hoiuhakkei-1 avior relcience 

^ A worker had to satisfy the following criteria .is w'ell to he included m the sample 
(1) He must be classified as employed or uiieniploved (i.e , peniianentlv dis.ibicd and 
reured wcie deleted). (2) Wage and lalKir-siipply data must be available for all veai s (3) 
A woi ker must leport less than 4.680 houis vvoi ked ^ler year. The absolute value ol the 
dill Cl elite in his leal average houi ly eainings iii adjacent yeais cannot exceed $16 f)r a 
change ol 200 percent. 'I1ie absolute value of the diflevence m the number ot houis he 
works in adj.accni years caiinot exceed 3,000 houis or a change ol I90 pcicent The 
purpose of this last criterion is to minimize difh<uhies arising lioni the presence of 
outliers. 
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First differencing equation (12) yields 

D )n Nli) = 8[p - r(j)] + SZ> In W + e^j), j = 2 .r, (22) 

where D is the difference operator, that is, D In iV((j) = In N^j) - In 
^Aj “ 1). and e((j) = Duiij) is a disturbance. This equation relates a 
consumer’s labor-supply changes to changes in his wage rate. Notice 
that differencing eliminates individual effects, and thus it avoids the 
introduction of incidental parameters. Step one of the estimation 
procedure is to use this equation to estimate the intertemporal sub¬ 
stitution elasticity, 8. 

Combining these equations for a given worker into a single system 
of simultaneous equations creates a model that is particularly well 
suited for an empirical analy.sis. Stacking equation (22) for worker i 
according to thej index yields 


D In A',(t) 
DUxNIt- 1) 


ft. 

ftr-X 


Z> In W^t) 

O In fT,(r - 1) 


Hr) 

Hr - 1) 

D In N0) 

D In .V,(2) 


fSa 

^ _ 

-f 

D In Ni(3) 

D in (Vi(2) 

6 -f 

€((3) 

H2) 


where = S(p — r(J)\,j = 2, . . . , t. Putting this system into vector 
notation, we have 

l)]nNi = 13 + DlnWfi + et, t=l.n, (24) 

where D In N ,, )3, D In Ni, and are (r — 1) x 1 vectors with D In N\ ~ 

[D In Ni(r) __ Z> In N,(2)}, /3' = (ft,, _ ft^), D In JT; = [D In W,(t), 

...,/) In ITi(2)], = fcifr), . . . , €((2)], and n is the total number of 

woikers in the sample. The following analysis assumes that the error 
vectors cj are independently distributed across individuals once com¬ 
mon time effects are removed with the inclusion of year dummies in 
the labor-supply equations. No restrictions arc imposed on the 
covariance matrix of e,, which permits arbitrary forms of serial corre¬ 
lation. 

The parameters of equation (24) are estimated using standard 
two-stage and three-stage lea.st-squares procedures which permit the 
imposition of equality constraints across equations.The wage- 
growth variables, D In = 2, . . . , t, are treated as endogenous 

variables. The set of instruments used to predict!) In fF,(j) includes 

The iwo-stage procedures must account for the tact that the covariance niatnx of 
the ti’s is not proportional to the identity matrix when computing standard errors for 
the estimates. 
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family background variables,*® education, age, interactions between 
education and age, and dummy variables for each year of the sam¬ 
ple. Estimation of (24) using simultaneous equation methods takes 
advantage of the time-series aspect of panel data to estimate the 
intertemporal substitution elasticity S with a minimal amount ot com¬ 
putational burden. These methods avoid biases arising from pure 
reporting error in earnings and hours of work, and they offer a 
flexible framework for testing and estimating alternative functional 
forms. 

Another equation that can be used to estimate S is one that relates 
changes in hours of work to changes in earnings. AdditigS • D In^V^;) 
to both sides of (24) and solving this new equation for D In jV((j) yields 


D \nN^) 


g[p - >'(;)] 

1 + 8 




C(0) 

1 -t- S ’ 

J = 2. . 


, T. 


(25) 


where E^) = is real earnings in period j. Slacking these 

equations creates a model like (24), except that D In IT,, 5, and 6( arc- 
replaced by D In E(, 8/(1 + 8), and (1/(1 -t- 8)Jei, respectively, and the 
elements of /3 become /3j = {8[p - r(;)l}/(l + 8), 7 = 2, . . . , t. Exactly 
the same prexedures described above for estimating equations (24) 
are applied to estimate the parameters of the stacked representation 
of (25). Using the coefficient on earnings, 8/(1 + 8), it is possible to 
construct an estimate of 8. 

Table I presents estimates of the intertemporal substitution elastic¬ 
ity. Two specifications of the labor-supply equation arc considered. 
One a.ssumes that the interest rate, r(l), is constant over lime, and it 
constrains intercepts in the labor-supply equations (i.e., the elements 
of /8) to be equal over the sample. The other allows r( t) to be different 
in each period by including dummy variables for each year without 
any constraints on their coefficients, which permits intercepts to be 
different each period. These alternative empirical specifications yield 
similar results. All of the implied estimates of the intertemporal sub¬ 
stitution elasticity are positive. According to the estimates of the wage 
coefficients, 8 lies in the range .10-.23. The earnings coefficients 
indicate a range of .25-.45 for 8. The earnings coefficients indicate a 


Family background variables include the cducaiion (in years) of boih the father 
and the mother of the consumer .md dummy variables indiiating parents' economic 
status at the time the consumer was growing up. Both education and cducaiion s(|uaied 
and interactions between these two education variables and age are included as instru¬ 
ments. Coefficients in the "pteiliction ec|uatioii" for wage growth are constrained 10 be 
equal across time periods. Formal hypothesis tests accept this lesiricuon. In the appli¬ 
cation of constrained three-stage least squares, the wage-growth cquauoiis are not 
treated as part of the simultaneous-equation system. 
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TABLE 1 

Si Mill. I ANKOLlS-EqUAI ION Es PIMATION OK FlRSl-nifFK RK.NC.KI) LaBOR-SuVPI.Y-EqIIA I ION 
Esi IMAI tS OK IHF. InTKHIEMPORAI Sl'BVI ITU J ION El.ASTlf.lTY 


Esiiiiialion Average of 

Procedure O(I.ogWagc) /)(Log Earning)* Intcrcepl Ycai Dummies 



23 


- 009 



(2.42) 


(4.02) 


SSl.S 

.14 


-.008 



(1 97) 


(4.26) 


2SLS 


35 

-.006 




(2.22) 

(4 26) 


sst.s 


.25 

- 006 




(2.63) 

(5 18) 


2.SI.S 

15 



- 008 


(.98) 




:isi.s 

10 



-.008 


(.80) 




2SI.S 


45 


-.007 



(1 54) 



:t.sLs 


30 


-.007 


(I.<i7) 


NniE —v.ilufs ot /-'il.uisiits .iir ii» 

• I Ilf csimuitcs .iimI I si.mslK n Ml l!i<* ■ l.<»K i .n immjj" «<»U»nii» mjc (or fi, ihrv ji c t ompulrcl U5iii>j ihc 

((M-f (k u'iu on fill nitiKS, <irnti(c<1 i/i. Aiid itsr-si.ittst$LS Wf lijvr ft 0/( 1 - «^) I'oi oiivci i ihcr-Matisius ic]m» tcci loi ij; 
to i!ioi>r for 6 ie<|Hirf> divisKm liv the ({luniiiv (I ^ ft)* cv.ilu.necl .ii ft = ft 


higher estimaie for 8 in alt cases, but these differences .are small 
relaiive to their siaiidarcl errors. The estimates of the intercepts 
indicate that the real rate of interest exceeds the rate of time prefer¬ 
ence on avetage by about 2-4 percentage points, 

A compaiison of these results with others in the literature is 
difficult since most studies use cross-section data for their empirical 
analysis where differences in lifetime wage paths are the primary 
source of wage variation acro.ss observations. As a consequence, they 
do not estimate the intertemporal substitution elasticity. Estimating 
equation (IS) using cross-section data is complicated by the presence 
of individual effects, Fj. As discu.ssed above, economic theory implies 
that Fi is correlaterl with a consumer’s wages and all of his other 
characteristics. Therefore, it is not reasonable to assume that /■'( is a 
“random factor” uncorrelated with explanatory variables. One, then, 
cannot directly use ob.servations on individuals from a cross section to 
estimate the parameters of (13) even if one uses simultaneous equa¬ 
tion estimation procedures. Such procedures implicitly treat indi¬ 
vidual effects as random variables, and this leads to inconsistent 
parameter estimates. Eo estimate the intertemporal elasticity using 
cross-section data, one requires a specification of the labor-supply 
equation where variation in wages reflects evolutionary wage change. 
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One approach constructs synthetic cohorts to estimate equation (13) 
using cross-section data as implemented by Ghez and Becker (1975) 
and Smith (1977). A synthetic cohort is constructed by computing 
geometric means of’wage rates and hours worked for each age group, 
and it is assumed to represent the life cycle of a typical individual. The 
basic assumption underlying this approach is that there are no cohort 
effects, so that group individual effects (i.e., the aveiage of the F/s 
for each age group) are the same for all age groups after adjusting 
for “smooth" vintage effects. In this case, least-squares estittiation 
of equation (13) using synthetic cohort data protluccs consistent 
parameter estimates.^* This approach allows for measured "taste- 
shifter’’ variables such as fatnily size and age. In principle, this ap¬ 
proach accounts for the endogeneity of wages and measured charac¬ 
teristics by using group averages as instruments. The problem of 
treating group effects as random vaiiables does not arise since it is 
assumed that they are the same for all age groups. 

I’he estimates of the intertemporal substitution elasticity obtained 
by Ghez and Becker and Smith are comparable to those estimates 
presetued in table 1 of this paper. Becker forms synthetic cohorts 
using the 1960 U.S. Census, and he obtains estimates tor 8 rangmg 
from -.068 to .44.^^ Smith, on the other hand, treats the family as the 
relevant decision unit atid uses the 1967 Survey of Economic Oppor¬ 
tunity to fortn his synthetic cohorts. He estimates S to be about .32.*" 
Comparing these estimates of 8 and tho.se leported in table 1 suggests 
that cohort effects do not .seriously bias estimates based oti synthetic 
cohort data. 


Estimale'i of Responses to Parametric Wof^e Changes 

Estitnating the struc'tuial equation for F' given by (19) prosides the 
additional infot tnatioti we require to predict a consumer's labor- 
supply response to parametric wage changes. Estimating this equation 
is not as difficult as it tnay first appear. It is true that all of the 
variables appearing in this equation (F„ TTot, ttu, tth. and Ooi) are not 
directly observable. But it is possible to construct observable quantities 
that have expected values equal to these variables. If one replaces the 
unobserved variables by their observed counterparts, one can etnplov 
standard two-stage least-squares procedures toestitnate the structural 
parameters of interest. 

“ (ihez ,tnd Becker and Siiiilh do not intcrpiei their pai.iniever csuinates .is those ot 
a \ con-stant labor-supply funt lion. C.iveii their log-liiiear spec ifications, however. In A i.s 
absoibed into the intercept of their regression sperifications. 

Estimates ohl.iined from tihez anil Becker (1975, pp. 112, 1141. 

” Estimates obi.iined from Smith (1977, p. 244). 
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First consider the coefficients of the lifetime path for wages. Define 
the difference operator as In ~ ^i(j) ~ Iti IV,(J — k). 

Applying this operator to the wage equation given by (15) and divid¬ 
ing the result by k yields 

- A] + 

Subtracting ZJi In fF((2) from this equation and dividing the result by 
\/{2j — k — 3) creates a new equation; 


1 

2/ - A - 3 


Pfcln ^ 

k 


77-21 + 


(2; - k - 3)A 


D,V^2) 

2j-k-2,' 


Notice that the dependent variable of this last equation has mean 
ecjual to TT-n- Replacing tth in the previous equation by this dependent 
variable allows one to create another observable variable whose mean 
is n,i- Further substituting these two observable variables for ttj, and 
77,/ in the original wage equation allows one to create a third measur¬ 
able variable whose expected value is ttos- Following this strategy and 
taking aveiages to use all the available data, considei the following 
definitions; 


772, = 


1 y ± r Dj^,ln ^V,(J + 2) 

T - 2 5^ / [ ;• + 1 


- D, In 1V,(2) 


( 26 ) 


7r,/ = 


1 y iDj In 1F((> + 1) 

^ - I I ; 


- 772,[2f(/ + 1) -,/] 


(27) 


•"■Of = - ffzitK;)?}. (28) 

^ j=i 

where t is the total number of sample periods. It can be shown that 
E{77hd ~ 77j,i, /j = 0, 1, 2. This is an important result because the ■um's 
are observable variables, and it is possible to use the tt as dependent 
variables in a simultaneous-equation analysis to estimate the TTh/s 
consistently. 

Similarly, given observations on consumer i’s income over the sam¬ 
ple period, one can construct the variable do< using definition (28) with 
Kj(j) replacing In WIj)-, do(, then, can be used in a simultaneous- 
equation analysis to predict the intercept of the lifetime income path, 
aoi, which is a measure of consumer i’s initial permanent income. 

An analogous strategy can be used to construct a measurable vari¬ 
able to serve as a proxy for F,. From equation (13) we see that an 
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average of the quantities In N^) — bt(j) — 5 In W^) has an expected 
value equal toi^j, but unfortunately, this average cannot be directly 
observed since it depends on unknown parameters b and 6. We have 
estimates of these parameters, however, from the first step of the 
empirical analysis. A logical alternative to the above average, then, is 
to form the variable 

= T 2 f" ^<0) - l>tU) - S In 

^ j=i 

Asymptotically, Fi has an expectation equal to F(. 

Collecting the above results, we have a complete simultaneous- 
equations model given by 

■iThi = + Vh. h = 0, \ ,2, ( 29 ) 

«(H - + ^ 3 , (30) 

Fi = 4> ^ n-oi-yo + 7f„y, -t- -t- doiff + r/t, (31) 

where the vectors of exogenous variables Af, and ,S'( and the coefficient 
vectors g„, g,, gj, and ^0 defined by (16) and (18), and the tja’s are 
disturbances. We have one set of equations for each consumer i. The 
endogenous variables in this model are Fj, do,, and the tt^’s; the 
exogenous variables are the elements of M, and 5,; and the structural 
parameters of interest are -yo. yi. yt, and 6. The vectors M, and .5, 
contain variables determining the coefficients of the lifetime wage 
and income paths. In the following empirical analysis, they include 
the consumer’s education, his education squared, and family back¬ 
ground variables. 

I'o estimate the parameters of the structural equation for F given 
by (31) consistently, one can employ a standard two-stage least- 
squares procedure. The standard errors repoi led by this procedure 
are valid if the number of time-series observations for each consumer 
is sufficiently large.-''® 

Table 2 presents estimates for the structural parameters of the 
individual effects equation given by (31), where F, is computed using 
three different sets of estimates for h and 8. The estimates of yo. yi. 

““Family b.K-kgrouiid variables include the cdm.ition of boih die tadier and the 
mother of the coii-suiner and dummy variables indicating paicius' economic status at 
die time the consumer was growing up. 

““ If one docs not have a sufficiently large number of these time-series observations, 
however, the usual standard errors are invalid. The pioblein lies in ihc fact that we 
use estimated values for b and 8 to torm the dependent variable F,. In cases where the 
number of time-series observations is .small, one must adjust the usual standard errors 
to account for errors in estimating b and 6. The precise form of this adjustment can be 
obtained from the author upon request. While this ad|ustinent is not complicated, it 
does require the use of matrix operations. This ad|ustmcnt was verv minor in every 
instance for which it was u.sed in this study, which suggests that even 10 observations 
per person is large enough to neglect making anv adjustments in standard errors 
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TABLE 2 

SlMUl TANKOLIS-EQllATiriN EmIMATION OF FIXED-EKFEC 1 S EyUATIONS 


i, S 

Yo 

Yi 

Yi 

e 

Iiiteruepi 

-.009, .1 

-.0.5 

-.83 

-10.47 

-.00026 

7.81 


(1.5) 

(1.4) 

(1.2) 

(.48) 

(157) 

- 009, . 1 

-.07 

-1.08 

-1.3.03 

-.0001 

7.77 


(1.78) 

(1.65) 

(1.31) 

(.16) 

(137) 

-.009, 2‘i 

-.10 

-t 46 

-16.86 

,0001 

7.68 


(2 13) 

(1.9) 

(1.4) 

(-2) 

(119) 


Note - Alnoluic v.iliies of /-stJiiMu s .iie tn paictMhrvs I he van.iblc F, (i e , ihe pi oxy vaiKihle tor Fj) 

IM oiiipiiU'ii using the ffstiiD.ites A .ind d icporln! m ihr stub (oliiinti. Iixoiiie li nie.tsiiird in ihou«.iUiis ot ilolJdis 


and 72 are all negative as theory predicts. These estimates decrease 
monototiically as one uses a higher estimated value of 8 to compute 
the proxy variable for individual effects, Fj. Since uncompensated 
substitution elasticities associated with permanent wage changes are 
calculated by adding the estimate of 6 to the estimate of -y^, an inverse 
relationship between the estimates of 8 and 7 „ is required if uncom¬ 
pensated permanent wage elasticities are to remain constant for dif¬ 
ferent chokes of 8 in computing F,. 

All of the estimates of the initial peiinanent income coefficient, 9, 
are statistically insignificant and very small, I’he measure of properly 
income (i.e., T,! tj) used in this empirical analysis is total family income 
minus husband’s earnings in thousands of 1967 dollars. While the 
esiimaies of 6 are negative as theory predicts when low estimated 
values of 8 are used to conslnutwe see that the estimated effects of 
a change in income on hours of work arc minute; a $1,000 increase in 
initial permanent income leads to at most a .026 percent decrease in 
hours of work. Several other measures of properly income were tried 
in the empirical analysis;®' in every case the estimates obtained for 6 
were of the order of magnitude reported in table 2. 

(Combining estimates of yo uiid 8 ;illows one to form estimates of 
trass- and own-uncompensated substitution elasticities and bounds 
for compensated elasticities associated with wage changes in a single 
year. Dividing the estimate of -y„ by the length of the working life 
jjroduccs an estimate of the average cross-uncompensated elasticity. 
Using results from the second row of table 2 and assuming a working 
life of 40 yeai s implies a cross elasticity equal to —.0018. Adding the 
estimate for 8 (which is .1,5 for the second row of table 2) to this cross 
elasticity implies a value of .15 for the average own-uncompensated 

■’‘One measure of income excluded wife’s earnings. Others included imputed in¬ 
come for house ownership. Due to data limitations, it was not possible to include im¬ 
puted income from other forms of consumer durables, which certainly constitute a 
major component of a consumer’s nonwage income. 
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elasticity. Since the average own-compensated elasticity lies between 
the intertemporal and the average own-uncompensated elasticity, we 
conclude that this compensated elasticity is also approximately equal 
to .15. Increasing a consumer’s wage rate in period / by 10 percent, 
then, leads to about a 1.5 percent increase in his hours of work in 
period t and approximately no change in his hours of work at other 
ages. 

Combining estimates of y^, 7 ,, and with estimates of 8 provides 
the information needed to predict a consumer’s labor-supply re¬ 
sponse to shifts in his wage profile. In response to a uniform 10 
percent increase in wages at all ages (i.e., a parallel shift in the log 
wage profile), the estimates of the .second row of table 2 predict that a 
consumer will adjust his hours of work by an amount equal to (-yj 4- 
8)10% = ( — .07 -f- .15)10% = .8 percent at all ages. There is, then, a 
small positive response in a consumer’s labor supply to parallel shifts 
in his log wage profile. If the slope of a consumer's wage profile is 
altered by changing the coefficient on the linear term (i.e., tt,,) by A 
percent, the estimates for y, and 8 indicate that his hours of work at age 
t change by (y, + 8 t)A% = (—1.08 + .15/)^%. Recall that t here 
measures a consumer’s age and takes a value of 0 when the consumer 
is 25 years old. Hence, hours of wot k decline at early ages (i.e., prior 
to age 32) and they increase at later ages iti response to this sort of 
increase it) the slope of the wage profile. The same is true when the 
slope of the wage profile is altered by changing the coefficient on the 
quadratic term (i.e., v-,,) by A percent. Hours of work at age t ad just by 
an amount equal to (- 13.03 -t- .15t*)A%, which also iniplies a decline 
in hours of work at younger ages (i.e., prior to age 34) and an increase 
at older ages. 

The estitnates above of uncompensated elasticities are generally 
cotisistent with results found in cross-section studies and the popular 
iKStion that the lifetime labor-supply curve of pi ime-agc inales is not 
very responsive to permanent wage changes. If, in a cross-section 
analysis, one purges wages and income of their transitory components 
using a simultaneous-equatioti method and interprets the estimated 
coefficients as sotne sort of “lifetitne average" relationship, then one 
finds small positive estimates for wage coefficients and negauve or 
zeio estitnates for income coefficients for prime-age males,which 
agiees with the results abtjve. In contrast to previous work, however, 
the estimates above also indicate that to predict the lesponse ot labor 


” Aitoiinting Ibi ihc ciulogfiicilv of In tTit/) .ind )',(/) h.is Ix^eii shown 10 have <1 
signiluant etfecl on rio.ss-.scctioii esiim.ilcs. DaVanzo, Dcli.iv, .nut GrccnbeiH (1976) 
find Lhat treating both wages and inroiiie as endogenous sariables leads 10 |xisiiive 
estimates for wage coetlicienls .iiid negative or zero esiiniaies lt>r inroine crtefht tents 
for prime-age men, which ate toiiMsienl with the empirical resull.s reported alxive 
Neglecting this endogeneity piotliices estimated coefficients with opposite signs 
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concentrate on the intermediate case where the intertemporal pattern 
of expenditures is unchanged. In this case, at a given value of r, the 
dechne in current C'^ net of the increa.se in Y’ exactly matches the rise 
in G.® 


The Role of Public Sendees 

The government is viewed as utili/ing its commodity purchases at 
each date, G,, to provide a contemporaneous flow of public services to 
the private sector. These services are treated as provided free of 
charge to household/producers. Two types of services are considered. 
One form is modeled as a direct conveyer oi utility to households. 
Examples which do not encompass the traditional roles of govern¬ 
ment include parks, libraries, school lunch programs, (subsidized) 
hospitals, and, possibly, highway and transportation programs. (The 
latter category could be viewed alternatively as an input to private 
production futictions.) An important feature of these lot ms of public 
services is the possibility of close substitution with private consumer 
spenditig. 

The second type of .service is an input to private production pro¬ 
cesses, which can apply either to busines.ses or households. Examples 
include the provision of a legal system, aspects of national defense, 
hre and police services, education, and various regulatory activities. 
(The last item is likely to exhibit negative marginal product.) In some 
cases these services would be close substitutes for private inputs of 
labor and capital. However, in areas like the provision of a legal 
system and national defetise, the public services are likely to enhance 
the marginal products of private factors. 

In many situations a particular government activity would exhibit 
features of both general types of public .services that are being consid¬ 
ered. The extent to which each feature was repre.sented would vary 
across a wide range of programs. Despite this real world diversity, the 
formal analysis proceeds as if there were a single type of govern¬ 
mental activity, which has service attributes that are partly of the 
direct-utility type and partly of the productive-input type. 

Government-provided services are olten modeled as “public,” as 
oppo.sed to private, goods in the sen.se of being “nonrival”—one 
person’s enjoyment of the good does not diminish the enjoyment by 
another person.'' It is doubtful that this characteristic applies to the 

’ Thi.s result doe.s not depend on an infinite honzon tor the representative household. 
Finite lives can alter the effects of sonic government actions, such as changes in public 
debt or social security, that involve a publicly mandated shift in incomes across genera¬ 
tions. Cioverntnent purcliases that are hnanced conteinporaneously by taxes (which are 
intlependent of age) do not involve these considerations. 

^This characteristic is embedded in the theoretical analysis of Samucison (1954). 
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majority of government purchases at the current time. Falling 
outside of this category would be the bulk of expenditures on educa¬ 
tion, hospitals, school lunch programs, and any service that is subject 
to congestion, such as parks, courts, libraries, transportation and 
highway projects, and police/fire services. Even in the case of national 
defense, the benefits to individuals are likely to be relative to the total 
amount of property that is being tfefended, because the level of 
external threat would respond to the potential prize from conquest 
(see Thompson 1974). 

In a nonrivalry situation, individual utility or production would 
depend on the total of government services rather than on the quan¬ 
tity provided to the particular economic unit. Because nonrivalry 
seems atypical, the modeling assumes that individual utility and pro¬ 
duction depend on real government purchases per capita. However, 
the general form of the analysis would not be altered appreciably if 
some elements of nonrivalry were introduced. 

Consider, liist, the direct interplay in utility functions between 
government services and household choices of consumption and lei¬ 
sure. Suppose, as stressed by Bailey (1971, chap. 9), that the contem¬ 
poraneous levels of (per capita) C, and (individual) C, are close sub¬ 
stitutes in utility terms. For example, assume that each unit of G, (per 
capita) is viewed as providing utility services that are equivalent to a 
fraction 6 of a unit of contemporaneous individual consumption 
expenditure.’ That is, household utility depends on the effective 
consumption How at each date, Cf = C, + ffG„ where 0^0^ 1. The 
formulation neglects this type of utility substitution among noncon- 
teinporaneous values for C and G or between G and leisure.® I he 
provision of these types of public services means that households 
cjbtain units of effective consumption, G*, that exceed the quantity tif 
private real expenditures, C. I'he permanent flow of government pur¬ 
chases, G, can lx‘ used to finance the uniform effective consumption 
flow, BG J T his aspect of public services offsets the negative perma¬ 
nent income effect from G that was described earlier (see also n. 6 
above). Fhe permanent income effect that is pertinent to prix^ite 
choices on consumption expienditure and leisure is now (B — 1)G— 
the condition 0 ^ ^ 1 implies that the permanent income effect of G 

is still less than or equal to zero, hut no larger than one in magnitude. 

' t he paraiiietci 0 tan be viewed in the tollnwin); an.ilvsiv as appb ing 10 ilic iii.iiginal 
unit ot G,. 

“ t he time path of G, could also alleo overall household utilitv in a toriu ihat was 
addiiively separable from the time paths ol etiectivc consumption and leisure In this 
sense the 6 parameter need not Itinit the utility salue that households attach to public 
services, t his possibility does not invalidate the subsetjueni discussion ol peimaiieni 
income changes that are induced by shitts in G. 

’ t he ciiscussion a.sstmies that the inetjiialitv conslraint, C, > (), is never binding. 
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With G held hxed, an increase in G, now imphes some direct 
crowding out of contemporaneous private commodity demand. In 
order to maintain ^e level of effective consumption, Cf —which is 
appropriate when G, the other determinants of (jermanent income, 
and r are held fixed— Cf must decline with Gt in accordance with the 
parameter d. The greater the utility substitution (at the margin) be¬ 
tween Cl and G, —as measured here by d —the larger the negative 
response of Cf to an increase in G,. As long as 6 ^ 1 applies, aggregate 
commodity demand for date t — Yf =Cf + Gi —rises as the nonnega¬ 
tive fraction, (1 — 6), of increases in G, (when G is held fixed). 

Consider next the role of public services as an input to private 
production processes. It is assumed that public services of this type 
have a positive marginal product, which is denoted by MPG. The 
condition MPG 1, which is assumed to hold, implies that the 
marginal response of (aggregate) private output to an increase in 
(aggregate) 0 does not exceed the social cost of providing the extra 
public services input. Through its role as a productive input, an 
incretnent in G raises commodity supply, V", for given levels of private 
factor inputs. Note that part of total output will be utilized to provide 
intermediate goods, which take the form of publicly supplied pro¬ 
duction inputs. Although there are good reasons in principle for 
delettng these intermediate goods from measures of final product, 
this approach is tiot followed in the national accounts.* In the 
theoretical analysis the output measure, Y, is al.so gross of this type of 
intermediate production. 

Because the public-.service inputs are provided freely, a change in G 
alters private sector real incomes in accordance with the marginal 
product, MPG. The representative household receives a per capita 
share of this extra real income. This effect further offsets the inverse 
influence of G on permanent income. The net effect now depends on 
the term (d + MPG — 1), which is nonpositive but no greater than one 
in magnitude if 0 0 -t- MPG 1 applies.* 

If variations in G alter the marginal products of private productive 
inputs, then additional effects would arise through changes in factor 
demand functions. Cases where public services substitute for private 
inputs—such as the provi.sion of public rather than private 


“ t hese matters are discussed in Kur.neis (1948, pp, 156-57) and Musgrave (1959, 
pp. 186-88). The double-counting property for publicly provided production inputs 
implies that empirical counterparts of total output, like real GNF, overstate the re¬ 
sponse of final output to government purchases. 

“ Recall that the analysis deals with a tomposite government service that has attributes 
of the direct-utility (fl) and production-input (MPG) type. A particular category of 
purchases is unlikely to exhibit a high value lor both parameters, 8 and MPG. There¬ 
fore, tf 9 e 1 and MPG « I apply for each category of purchases, the condidon 6 + 
MPCi s 1 is likely to hold for the composite over all categories. 
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guards—would generate reductions in the marginal product of labor. 
On this count, the private demand for labor would tend to fall when G 
rises. In some other cases the private demand for capital services 
would decline. However, to the extent that backgiound services like 
national defense and a legal system are expanded, factor marginal 
products are likely to rise, which would generate the opp>osite re¬ 
sponses in private factor demands. The main analysis neglects the 
array of possible effects of government purchases on private factor 
marginal products. 


Commodity-Market Clearing 

The equilibrium condition for the commodity market is given by 

Y<^ 3 C\ . . ,r, G, G) -t- G = Y‘(. . . , r, G, G). (3) 

(-)(-)(-) (+)(+)(+) 

Recall that the analysis has neglected the investment component of 
demand and has, thus far, assumed lump-sum taxation, l ime sub¬ 
scripts have been omitted for convenience. Signs beneath the inde¬ 
pendent variables refer to partial derivatives. The omitted arguments 
in the C'^ and Y’ functions involve various fixed aspects of households’ 
permanent incomes, the production technology, and so on. As men¬ 
tioned before, the real rate of return, r, exerts intertemporal substitu¬ 
tion effects that are negative on C'' and positive on Y\ A rise in C has a 
negative crowding-out effect on the contemporaneous choice of C'^. 
The impact of G on K* is nonnegative if MPG ^ 0. A rise in G reduces 
the pertinent measure of households’ permanent incomes, as dis¬ 
cussed before, which leads to a decrease in C** and a rise in V'*. 


Ejfecls oj a Temporary Rise in Government Purchases 

Consider a temporary_ expansion of real government purchases, 
where G rises while G is held fixed. The budget condition from 
equation (1) implies that these purchases are financed by a contem¬ 
poraneous increase in real taxes-net-of-transfers, T. In fact, for the 
context of a temporary rise in government spending—which is most 
pronounced at the federal level during wartime—it is more natural 
that the bulk of contetnporaneous finance would take the form of 
interest-bearing debt issue rather than tax increases. This behavior 
allows the government to spread the higher raxes necessitated by 
temporary spending over a large time interval instead of implement¬ 
ing exceptionally high tax collectioi« for a few periods (see Barro 
[1979, 1980i] for discussions). WithG held fixed, it would be possible 
to utilize interest-bearing debt so as to maintain the entire initial time 



loq4 JOURNAL OF POLrTIC:AL ECONOMV 

path of net real tax collections, T,. in any event, for the setting of 
lump-sum taxes, the present value of tax obligations is not altered by 
shifts between public debt and taxes (see n. 2 above). Since house¬ 
holds’ calculations of permanent incomes depend only on this 
pi esent-value magnitude, the effects of increases in government pur¬ 
chases that are financed by debt issue would coincide with the effects 
of those that are financed by higher taxes. 

The rise in G reduces C on tlie left side of equation (3) in accor¬ 
dance with the utility substitution parameter, f>. Therefore, aggregate 
demand, K'', rises on net by (1 - 0) times the increase in G. If the 
marginal product of public services, MPG, is positive, V rises with G 
on the right side of equation (3). Overall, the shift in “excess de¬ 
mand," Y'* — K*, is determined by the term (i — 0 — MPG). It was 
already assumed that this term is nonnegative—that is, ilie direct, 
one-to-one effect of G on aggregate tiemand is offset only partially by 
the utility-substitution and productive-input aspects of government 
purchases. 

Since an increase in G raises excess commodity demand, a iise in the 
real rate of return is required in order to restore commotlity-market 
clearing. This rcspon.se in r reduces C'' and raises I'*. The rise in Y" 
reflects the substitution of current work effort for planned future 
effort. Since I'* was also increased directly by the rise m G, it is 
apparent that equilibrium output rises. This output effect is greater 
the smaller is the value of 0, the larger is the value of MPG, and the 
greater is the real interest rate elasticity of K® relative to that of C'. In 
the polar case where 0=1 and MPG = 0, the response of K and r to G 
would be nil. In this circumstance, government purchases would 
amount to lump-sum transfers to households, because G, and C, were 
perfect substitutes in the utility function. 

Private consumer spending, G*', is crowded out from the rise in r 
and from the initial negative efiect of G. Ihercfore, the positive 
response of P to G must be less than one-to-one—that is, the model 
exhibits an output dampener rather than a multiplier. 

The positive response of Output to temporary movements in gov¬ 
ernment purchases would apply especially to wartime periods." The 
higher real rate of return can be viewed as a price signal that induces 
the intertemporal substitution of resources toward periods such as 
wars in which aggregate output is valued unusually highly. Phis type 

Because the expansion u( C, is leiniKirary. ihc rise in the equilibrium real rate of 
iritercsl woulci also be teriiporaiy. An extension to allow cliverjfeticcs lielweeii current 
and expected future real rates of return does not alter the basic analysis. 

" Wartime may also lie asstxiated with uncertainties on maintaining property rights, 
which would lend i<i reduce private invcsinierit demand. The possibly changing proba¬ 
bility of winning or losing a confliti would enter in this context. The analysis .abstracts 
from these eftects and from controls on prices or interest rates. Also excluded arc 
ellects of patnotisni or coercive behavior, such as conscription 
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of substitution has been stressed by Hall (1980, Sec. 2), who points out 
also that this behavior differs in some important respects from (he 
respon.ses of supply to monetary misperceptions that occur in .some 
business cycle theories that stress intertemporal substitution on the 
supply side (e.g., Lucas and Rapping 1969; Lucas 1975; Barro 1980a). 
The effect of temporary government purchases on the time arrange¬ 
ment of work and production does not rely on elements of misper¬ 
ception with respect to the general price level or other variables. 


Effects of a Pemuinenl Rise in Government Purchase\ 

Suppose now that G and G rise by equal amounts. The increase in G 
was shown already to raise excess commodity demand in accordance 
with the term (1 — — MPG). A higher value of G was shown earlier 

to reduce effective permanent income by the same factor. It was also 
n(3ted above that the typical or intermediate response of C'‘ net of L' 
would be one-to-one with this type of change in ellective permanent 
income. (This respon.se arises when households reduce expenditures 
on consumption and leisure by the same amount in each period.) In 
this case ilu^responsc of (C'' — Y‘) equals —(1 —ft — MPG) times the 
change iti G. Since this response exactly offsets the excess demand 
elTecl of G, the overall response in (f"* - Y') would be zero. It follows 
itt this case that the real rate of inteiest is invariant with permanent 
changes in goveininent purchases (under lum|j-sum taxation).'^ Al¬ 
though the invariance of r would not hold in general, this conclusion 
for an intermediate ca.se contrasts with the presumption of a positive 
effect for the selling ol a temporary increase m govermnent pur¬ 
chases. 

When r is unchanged, it follows fiom ccjuation (5) that consumption 
falls, and total output rises. Lhe expansion of production reflects 
partly the direct effect of G on Y" (which was assumed to be positive) 
and the negative income effect of C on leisure. (The net response of 
leisure becomes ambiguous when income laxtilion is introduced—see 
the subsequent analysis.) I'he decline in consumption means that 
output ri.sc’s by less than one-to-one with the pet maiieni expansion ol 
government purcha.ses; that is, an output dampener is again pre¬ 
dicted. 


Non-Lump-Sum Taxation 

Some of the results are affected by the unrealistic assumption that 
government expenditures arc financed by lump-sum taxation, fliis 

II follows also—assiiiiiiii)^ no elircsl circtls on ihc ni.irginal pimluci ot (.ipii.il 
sthediile—that capital accumulation woulci lie uinilierecf 
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section explores the consequences of income taxes in a simple envi¬ 
ronment. Because the present analysis is concerned only with tax 
effects that are systematically related to changes in government 
purchases—rather than with public finance questions, per .se—this 
simplified analysis may be adequate. 

Suppose that government expenditures are financed by a general 
income tax. Let t, represent the effective tax rate on incomes that 
accrue during period In a nonproportional tax setup, r, would 
represent the average marginal income tax rate. Assume now that the 
government can borrow and lend at the real rate of interest, r—the 
.same rate that applies to the private sector. Given the permanent flow 
of real government spending, which includes G and the comparable 
measure for real iransfers,*''and the possibility for variations in public 
debt, there exists some income lax rate f that is constant over time and 
also just satisfies the government’s intertemporal budget constraint,'^ 
Although many patterns of time-varying tax rates would also satisfy 
the government’s budgetary requirements, it will be desirable in most 
circumstances"’—in terms of minimizing the distortions that are im¬ 
posed on the private economy—to stabilize income lax rates over 
lime. That is, the government would adjust its public debt issues and 
redemptions in order to prevent divergences between current and 
expected future income tax rates. (For discussions of this type of 
result, see Barro [1979, 1980c); Kydiand and Prescott [1980, pp. 
185-86].) In this setting changes in government purchases would not 
generate movements in current tax rates relative to expected future 
rates, This conclusion means that, first, variations in G with G held 
fixed have no effect on tax rates; second, shifts in G imply equal 
changes in current and expected future tax rates, fand third, for 
the purpose of studying government purchases, it is unnecessary to 
deal with the intertemporal substitution effects that would arise from 
expected time variations in income tax rates.'* 

Given this framework for tax rale determination, it is unnecessary 
to modify the main conclusions that were derived earlier for the case 


o The analy.sis neglect.s llie double taxation ol income.s that flow through the corpo¬ 
rate sector. I'axauon in the lorm of inflationary finance could be included separately 
without allecling the main results. 

Real irilere.sl payments on an initial public debt .stock would also enter. 

Because changes in tax rates affect the tax base, f, the solution for f is generally 
noiiunique. However, the minimal possible value lor f is the pertinent choice. 

It may be optimal to allow tax rales to vary over the bu.sincss cycle. A counteicycli¬ 
cal pattern shows up empirically tor the U.S. federal government. However, the pattern 
IS at least less pronounced In terms of the total government sector. 

" It is assumed that increa.ses in tax rates induce increa,ses in real tax revenues within 
the relevant range. 

E.g., these types of effects are central to a study of the investment tax credit (see 
Kydiand and Prescott 1977, pp. 482-86). 
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of a temporary increase in government purchases. Notably, output 
and the real rate of interest continue to rise when G increases, with G 
held fixed. The principal modification to the previous findings is that 
r should be interpreted as the after-tax real rate of return, as calcu¬ 
lated with the appropriate average marginal tax rate on interest in¬ 
come.'* 

For the case of a permanent increase in government purchases, the 
^ew element is the rise in the income tax rate, f, along with the rise in 
G (see n. 17 above). The higher income tax rate motivates a shift away 
from market work and toward leisure and other nonmarket activities. 
In equation (3) this change is reflected as downward shifts in the Y’ 
andj!^** functions.™ For given values of r (now interpreted net of tax), 
G, G, and so on, it is plausible that the declines in C'' and K* would 
roughly balance. In other words, as in some cases that were discussed 
earlier, there is no reason to expect a particular direction of change 
for the relative amounts of consumption and leisure expenditures 
that are conducted at different dates. 

Given this pattern of response to a higher income tax rate, it still 
follows that a permanent shift in government purchases has no effect 
on the (after-tax) real rate of return. However, the negative effect of 
higher income taxation on the incentive to work offsets the tendency 
for output to rise.*' The net tnovement in output now involves three 
forces: first, the substitution away from work because of higher in¬ 
come tax rates; second, the negative income effect (associated with the 
higher level of G) on leisure, which motivates more work; and third, 
the direct productive-input effect of G on K". The first two influences 
involve the standard ambiguous net response of leisure to the sub¬ 
stitution and income effects that are generated by either a change in 
the real wage rate or a shift in the income tax rate that is applicable to 
labor income. However, the iticome effect here involves the term —(1 
- 6 — MPG)G, while the tax shift applies one-to-one to G. This dif¬ 
ference increases the likelihood that the substitution effect will out 
weigh the income effect. Suppose that the .substitution effect were, in 
fact, comparable to or dominant over the income elfect. In this case 
the overall change in output that is induced by a permanent rise in 


'*Thi.s result assumes that interest iiKoiiie is taxable and intcicsl pavnients are 
deductible from taxable income. The conclusion neglects sysleinatic ditleienccs be¬ 
tween the marginal tax rates applicable to reteivers of inleresi income \s. those perti¬ 
nent for payers. The lesull is not ailecled diiectly by the taxation ot iioiuinal, raihei 
than real, interest payments. However, other effects of inifaiion on effective lax rales 
would matter. 

The incentive to accumulate capilal would also be diminished I hroiigh this chan¬ 
nel, a permanent increase in government purchases tends lo reduce the capital stock, 
even when the after-tax rate of return is unchanged. 

A reduction in the capital slock, as mentioneti in n. 20, leinlorces this effect. 
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government purchases would be bounded from above by the direct 
positive effect of’ G on K*. This channel corresponds to only one 
portion of the positive output response that arose for the case of a 
temporary increase in government purchases—the other part in¬ 
volves the intertemporal substitution effect on work effort, which is 
associated with the increase in r. It follows that temporary rises in 
government purchases would induce larger output responses than 
equal size, but permanent, rises in purchases. The sign of the output 
respon.se is now ambiguous for the case of a permanent expansion of 
government purchases—the reaction is more likely to be positive 
when the marginal product of public services is high. 

Overall, (etnporary expansions of govertititcnl |jurchases are dis- 
titiguished from permanent increa.ses in that, first, the positive effect 
on output of the temporary expansion is likely to be larger and, 
second, a positive ef fect on the (after-tax) real rate of interest is pre¬ 
dicted only for the temporary case. The piesetit empirical investiga¬ 
tion deals only with output effects of temporary veistts permanent 
movements in govet timeitt purchases. Some prelimitiary analysis of 
real-rate-of-return effects is carried out in Barro (1981a). 


II. Empirical Implementation 

The theoretical propositions will he tested by examining the effects of 
govertimetit purchases in a retluced-form relationship for output, its 
measured by real GNP. The analysis is an extetision of previous 
etnpirical research (Barro 1981(>), which stressed the business cycle 
influences of monetary disturbances. This earlier work included a 
government purcha.ses variable, but did not distinguish temporary 
fiom perm.metit government spending. 

It is convenient to cat ry out the analysis in tei ms of the ratio of real 
govet ntneiu pint bases to real CNF, G/Y. In particular, temporary or 
permanent vaiiatiims in G are a.ssiiiiieil to enter relative to Y in a 
litiear relatioti lot the log of output—that is, 

log(F,) = . . . + /3,|(G - G)/Y], + /3,(G/F)„ (4) 

where omitted variables indicated by . . . include current atid lagged 
monetary shocks and mher deterministic and stcjchastic influences on 
output. The variable G(, which would generally be unobservable, is 
the pcriiianent flow of goveiiiinent purchases as perceived at dale I. 
riie empirical procedure fm' handling this variable is discussed below. 
Lagged values of G/Y and G/Y might also appear in equation (4), but 
these effects were not found to be important empirically. 

The functional form in equation (4) implies that increments in 
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government purchases induce increments in output in accordance 
with 

dYll + /3,(G - C)/Y + /3.fi/Y] = l3,d(G - G) + p^G. (5) 

Therefore, in regions where f/3,(G - G)IY + ft-filY] is small relative to 
one, the coefficients /3i and would indicate the approximate re¬ 
sponse of Y to unit changes in (G — G) and G, respectively. The 
empirical results suggest that this approximation is satisfactory over 
the 1930-78 period in the United States, except for the World War II 
years. 

'I'he theoretical analysis suggests testing the hypothesis, 0 ^ /3i 

^ 1. This restriction implies that temporary changes in government 
purchases have a larger output effect than permanent changes, but 
that permanent changes also raise (measured) real GNF. These con¬ 
ditions are likely, but not inevitable, implications of the theoretical 
model. 

A central aspect of the empirical analysis is the representation for 
permanent real purchases as perceived at date t, G,. Suppose for the 
moment that the time-series behavior of C, implies a relationship for 
(GIY)t in terms of a set of paiameters a and a vector of currently 
observed variables, Z,: 

{GIY), = F(Z,; a). (6) 

In this case the unobservable construct, G„ coultl be substituted out 
from equation (4) to yield a relation for output in terms of observable 
variables and the vector of unknown coefficients (a, )3,, (i-i), 

log(K,) = . . . + ^,[(G/f), - F(Z,; a)l + (Z,; a). (7) 

Some hypotheses arise that concern the role of the Z, variables in 
equations (4), (6), and (7). If these variables can be guaranteed, ex 
ante, not to appear separately in the list of omitted elements that are 
denoted by . . . in eejuation (4), then the Z, variables would appear m 
equation (7) only to the extent that they serve as determinants for 
(GIY), in the f-function of equation (6). Some cross-equation restric¬ 
tions therefore emerge for the parameters of equations (6) and (7). 

The next sections deal with the problem of modeling a form of 
equation (6) for real government purchases m the United Stales. 

“Levi!, Korhiii has suggested the aiu.inive alternative of using tlie tin tent ovciall 
tax rate as a prow tor the antitipated, long-i tin avciage r.itio ol governineni purthases 
toGNP. The rationale for identilviiig the curieni t<ix latewiih the anticipated govern¬ 
ment expendilure ratio was di.vtiissed in the theoretical section. .Some problems with 
implemenung Kochin's suggestion arc; First, the distinction between purchases and 
expenditures implies that a separate model would be required to pi edict luture trans¬ 
fers (including interest payincnls), whith is 1101 obviously easier than modeling pur- 



loo 


JOURNAL OF POLITICAL ECONOMY 


Government Purchases Equation 

The stress on transitory movements in government purchases 
suggests special attention to war-related expenditures, which are 
likely to be viewed as largely tempiorary. I have proceeded empirically 
by separating total government (federal plus state and local) real 
purchases of goods and services into a “defense” component, G"’, and 
other purchases, G**. The present analysis does not attempt to classify 
components of government purchases in accordance either with their 
relative substitutabilities with private spending, as reflected above in 
the 6 parameter, or with their role as inputs to private production, as 
measured above by the MPG parameter. Differences between defense 
and nondefense items with respect to these parameters affect the 
interpretation of some of the empirical findings. Presumably, defense 
purchases are characterized by a relatively low value of 6 and possibly 
by a relatively high value of MPG. The former implies a relatively 
large output effect of temporary defense purchases, while the latter 
would enhance the output effects of both temporary and permanent 
defense purchases. The empirical analysis would be sharpened by 
obtaining a division of nondefense purchases into relatively homo¬ 
geneous categories with respect to the 9 and MPG parameters, but 
the feasibility of this classification is unclear. Transfer payments 
have not been included in the analysis. 


Defense Purchases 

A primary determinant of G"” would be the level of current and 
anticipated future wartime activity, assuming that at least the timing 
of wars can be treated as exogenous with respect to expenditure 
decisions. 1 have quantified this influence by using a casualty rate 
measure .0,, which represents battle deaths per 1,000 total population 
(see table 1) for the wartime years since the Civil War; 1898, 1917-18, 
1941-45, 1950-53, 1964-72. In effect, this variable can be viewed as 
an alternative to a set of wartime dummy variables. The casualty rate 
measure represents an attempt homogeneously to quantify the inten¬ 
sities of different wars and different years within each war, without 
using military expenditures or personnel measures, which are the 
types of variables that are to be explained. In particular, the use of 


rha.ses directly; and, second, the use ot the tax rate to proxy the permanent expendi¬ 
ture ratio may work better lor the federal government than for total government. See 
Benjamin and Kochin (1978), who argue that mobility possibilities would prevent state 
and local governments from choosing an excess-burden-minimizing debt policy. How¬ 
ever, this issue involves also the federal government’s interaction with state and local 
governments—that is, the federal government may compensate for public debt/tax 
variations that cannot be carried out at the slate and local levels. 
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TABLE I 

Casualtv Rate Variable 


Date 

B 

Date 

B 

Date 

B 

1898 .... 

... .0052 

1945 ... 

... .603 

1966 , 

.025 

1917 ... 

... .23* 

1950 ... 

... .071 

1967 . . . 

.047 

1918 ... 

... .28* 

1951 ... 

... .097 

1968 ... 

.073 

1941 ... 

... .0044 

1952 ... 

... .030 

1969 ... 

.046 

1942 . .. 

... .162 

1953 ... 

... .021 

1970 ... 

.021 

1943 ... 

... .205 

1964 ... 

... .0014 

1971 ... 

... .0067 

1944 . .. 

... 1.090 

1965 ... 

... .0070 

1972 ... 

.0014 


Soi'BC rs—Vietnam (1964-72). SUUutuai Abstract of the UmUd iuufs, 1977. p, 369, table 590. World War 1 
(1917-16) knd Spanish American War (1898): HuLmcal Stahstscs of Uu VmUd Staict, 1975. p 1140, line 660 Korean 
War (1950-53) relative yearly data from Department of the Army, BaitU Castuiitin of UteAmt^ (1954), were applied to 
war lotil from SUUuUcai Abstract L/futed SiaUs, 1977. p 369, table 569 World War II (1941-45) rclauvc yearly 
data from Olhee of the Comptroller of the Army, Army Bottle C'asvaittes and Xonbatde Deaths in Worid IVar It Final 
report. December 19AI-December 1946, were applied to war total (romStalunral Aftitrort (f the l/nt(rd .States. 1977, p. 369. 
table 589 Korean War and World War I1 data were obtained from William .Strobridf^e, Chief. Hutoncal Service* 
Division, Department of the Army 

Nora.— B is battle deaths per L,000 total populatiurt Valuesof ceroapply todates not listed. Orders-of-tnagnuude 
values of B (per year) for earlier wars are Revolution v1775-83), 0 2, War of IHi2 (1612-15), 0 08. MexKan War 
(1846—48), 0 04, Cavil War (1861 -65, union only), I 0 Casualty hgure* are from Civil VS>ar HistorM-al Statuses cf the 
UnUed Statet, 1975, p- 1140, line 880; other wars. Department of the Army, Hutorj of MtUiary MohtUtatton tn the CniW 
States Am^, 1955. appendix A 

* Yearly data were unavailable Figures are based on war total assuming equal rate ol casualue* per month 


separate dummy variables for each war would remove any power 
from the statistical tests that are carried out below. 

Because of improvements in the technology of caring for wounded 
and offsetting changes associated with the “efficiency" of weapons, it 
is possible that the casualty rate variable does not consistently measure 
the intensity of war at different dates. 1 considered using a broader 
casualty measure that included wounded, but the ratio of this concept 
to battle deaths showed no trend at least since the Spanish American 
War.Since I was unable to obtain reliable annual data on wounded 
for World War II, I have restricted iny analysis to the narrower battle 
deaths concept of casualty rates. 

Prospective wars would be likely also to influence current spending, 
with good information on forthcoming military actions existing prior 
to at least the U.S. entrances into World Wars 1 and 11. Since I have 
been unable to construct any instruments for these war exp>ectations, I 
have introduced some actual future values of B into an equation for 
current defense spending. This procedure introduces errors-in- 
variables problems into coefficient estimation, although the present 
analysis is concerned primarily with obtaining conditional forecasts 
rather than with coefficient estimation, per se. A later part of the 

” The ratio of total casualties (including wounded, but excluding deaths that were 
unrelated to combat) to battle deaths is 5.3 for the Spanish American War, 4,8 tor 
World War 1, 3.3 for World War II, 4.0 for the Korean War, and 4.3 for the Viet, 
nainese War. See the notes to table I for sources of casually data. 
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empirica] analysis considers a specihcation where future values of the 
B variables are excluded from the estimation. Lagged effects of on 
spending are introduced also into the equation for defense purchases. 
Empirically, two annual leads, the contemporaneous value, and up to 
a third lag of the B variable were found to be important. 

Since defense expenditures involve a substantial investment com¬ 
ponent, the amount of current spending would tend to be influenced 
negatively by the si^e of existing capital slocks. Accordingly, 1 have 
included in a defense spending equation the variable Af_,, which 
ineasiires the beginning-of-period real stock of military equipment, 
structures, and inventories (table 2, col. K"'). The relation of capital 
stock to current spending is assumed to be given by 

Ki^-= bGi‘-+ (1 - 8)Kr-,, (8) 

where 8 is a depreciation rale anti b measures ihe fraction of total 
defense spending that constitutes investment (net of within-year de¬ 
preciation on this investment). The K"' series was constructed with 
values of ft and 8 that varied over time (see the nn. to table 2), but 1 have 
limited the theoretical discussion below to situations where these 
paraineteis are approximated satisfactoi ily as constants. 

The estimating equation for G"' lakes the form 

gl*' = (G'''/y)( = ctoBi -h ... -h -h , -t- a^Bf+i — 'yftj'-i + Wf, 

(9) 

where fJj'' is real defense purchases, E, is real CNF, ftj'l, = A'|'l]/E(_,, 
and u, is a stochastic term. Note that the dependent variable in 
equation (9) isgj''. the ratio of real defense purchases to real CiNP. The 
main part of the subsequent analysis is carried out in terms of ratio 
variables of this type, f'he form of equation (9) implies that a doubling 
of Y„ K'/L,, and E/-,. for given values of the B variables, leads to a 
doubling of (7“’. The model for determining g”' over time will be used 
to deteimine the currently perceived permanent flow of real pur¬ 
chases when expressed relative to real GNP, g?’ = ((T^/E),. 

The error term in equation (9) was modeled satisfactorily as a 
random walk, so that estimation can be cariied out readily in first- 
difference form: 

Dg"’ = oloDB, -t- . . . -t- ol^DBi-^ -I- a,DB,.f., -I- a-J)Bi+2 ~ 

( 10 ) 

where D is the first-difference operator and =u, — U(_, is a white- 
noise error term. A constant is in.significant when added to equation 
(10) in the empirical analysis—that is, there is no trend in the defense 
purchases ratio. Moving-average error terms or more complicated 
autoregre.s.sive error structures also did not add to the explanatory 
value of the equation. 
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The form of equation (10) implies that a current shock c,—which is 
not associated with wartime in the sense that the values of the DB 
variables are small—would have a permanent effect on the future 
mean level of g*". Because of the inclusion of the A{^, term with a 
negative sign in equation (10), the effect on gf of e ,—that is, of the 
current actual value gf with values of the B variables and held 
fixed—turns out to be positive, but somewhat less than one-to-one. 
On the other hand, if the distribution for future values of B is 
stationary in level form, positive values for DB tend to be followed at 
later dates by negative values, which lead to decreases in future values 
of g"”. In other words wars and the accompanying levels of expendi¬ 
tures are modeled as temporary. I'his mechanism implies that an 
increase in gf that is accompanied by positive values of DB will have 
much less effect on ^ than will the equivalent peacetime change in de¬ 
fense spending. Wartime spending has an appreciable effect on gf 
only to the extent that military expenditures depart from the amount 
associated typically with the current level of war intensity. The possi¬ 
bility that wars have a systematically important permanent effect on 
the purchases ratio is ruled out by the form of equation (10). Some 
alternative specifications of the error process that would have admit¬ 
ted this type of persisting effect were not supported empirically for 
the United States. 

As detailed in the Appendix, equations (10) and (8) can be used to 
express expected f uture values of the defense purchases ratio, as 
a function of the latest observed ratiothe value and actual and 
expected future values of the war intensity variable, B. Summation 
over these expressions with proper allowance for discounting \ields a 
relation for the permanent purchases ratioin terms of g"', kf^i, the 
array of B variables, and a real discount rate p (which equals the dif¬ 
ference between the real interest rate and the growth rate of real GNP). 
Finally, a simple specification for the stochastic structure of the B 
variable—based on the frequency, intensity, and duration of wars 
over the full history of the United States—is used to solve for the 
expected future values of the B variables. With this substitution, gf is 
determined as a function of observable magnitudes, up to the setting 
of a discount rate. In the case where future values of the B variables 
are admitted into the government purchases equation (10), g,"" ends 
up as a function of the values B,+ 2 , . . . ,B,_ 3 —that is, the values B,+ , 
andB ,+2 are treated as observable at datet. In a situation where future 
values ofB are excluded from equation (10),^ is expressed in terms 
of the current and lagged values, B,,. . . , B ,_3 (as well as the values erf 
gf and kf_y). 

The main product of this exercise from the full empirical analvsis is 
the series for temporary real defense purchases expressed relative to 



TABLE 2 

Government Purchases Variables 


Date 


rt 

1889 

.0060 


1890 

.0057 


1891 

.0057 


1892 

0056 


1893 

.0062 


1894 

.0064 


1895 

.0053 


1896 

.0059 


1897 

.0080 


1898 

.0192 


1899 

.0165 


1900 

.0131 


1901 

.0113 


1902 

.0110 


1903 

.0108 


1904 

.0123 


1905 

.0112 


1906 

.0093 


1907 

0090 


1908 

0117 


1909 

.0105 


1910 

.0100 


1911 

0101 


1912 

.0094 


1913 

0096 


1914 

0139 


1915 

0135 


1916 

.0164 


1917 

.076 


1918 

258 


1919 

.156 


1920 

038 


1921 

.033 


1922 

.017 


192.3 

014 


1924 

.014 


1925 

.012 


1926 

.011 


1927 

.012 


1928 

013 


1929 

.013 


1930 

.015 

.030 

1931 

.017 

.031 

1932 

.019 

.033 

1933 

.016 

.032 

1934 

.016 

.032 

1935 

.017 

.031 

1936 

.018 

.031 

1937 

.016 

.029 

1938 

019 

.031 

1939 

.017 

.029 

1940 

.028 

.025 


.097 

.094 

.094 

.089 

.097 

.102 

.092 

.097 

.091 

.094 

.087 

.088 

.081 

.084 

.088 

.088 

.089 

.082 

.091 

.110 

.089 

091 

.105 

.100 

.096 

.106 

.112 

.093 

.085 

080 

.049 

.085 

.125 

.116 

.105 

.116 

.115 

.108 

.118 

.121 



.055 

.117 

.015 

056 

.141 

014 

.057 

.158 

.014 

.063 

.175 

.016 

.063 

.176 

.016 

.056 

.187 

.014 

.050 

.174 

.013 

.045 

.179 

.013 

.043 

.165 

012 

.046 

.187 

.012 

.045 

.183 

.003 

.052 

.163 
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TABLE 2 ( Continued ) 


Date 


rt 

Jf " - g " 


g-it 

1941 

.120 

.065 

055 

074 

.122 

1942 

.317 

.104 

.213 

.164 

.092 

194 .S 

.439 

.093 

.346 

.309 

.069 

1944 

463 

.074 

389 

.431 

.064 

1945 

.410 

.069 

.341 

.470 

.063 

1946 

.103 

063 

.040 

.483 

.092 

1947 

.055 

.069 

-.014 

.414 

.106 

1948 

.056 

.072 

-.017 

337 

.117 

1949 

.064 

.091 

-.026 

.290 

.132 

1950 

.066 

.074 

-.008 

.239 

.117 

1951 

.123 

.098 

.026 

209 

.107 

1952 

156 

.127 

.030 

225 

.110 

1953 

.156 

.140 

.016 

260 

.118 

1954 

.133 

132 

.001 

.291 

.119 

1955 

.115 

132 

-.017 

.284 

.116 

1956 

.112 

•.129 

- 017 

283 

.115 

1957 

.116 

.133 

-.017 

.286 

119 

1958 

.115 

.133 

-.018 

.296 

134 

1959 

108 

.129 

-.022 

.286 

129 

1960 

.102 

.124 

-.022 

282 

.133 

1961 

.104 

.124 

- 021 

.278 

. 1.38 

1962 

103 

.123 

-.020 

.268 

.138 

1963 

.096 

.116 

- 021 

263 

142 

1964 

.087 

.107 

-.021 

251 

.145 

1965 

.080 

.096 

-.016 

.235 

.147 

1966 

088 

.090 

-.002 

.219 

146 

1967 

098 

.086 

.011 

216 

149 

1968 

096 

081 

.014 

.211 

.151 

1969 

.088 

.080 

.009 

.206 

.150 

1970 

.079 

.083 

-.004 

201 

154 

1971 

.068 

.080 

- 012 

173 

. 1.57 

1972 

.063 

076 

- 014 

169 

153 

1973 

056 

.074 

- 017 

.153 

.148 

1974 

.055 

071 

-.017 

147 

157 

1975 

.055 

070 

-.016 

.141 

.164 

1976 

.050 

.067 

- 016 

128 

.156 

1977 

.049 

.064 

- 015 

.118 

152 

1978 

.046 

.060 

- 014 

111 

.150 


* g* ^ C^IY, where Y u real ON P (1972 base) i» real «lclcrue purr liases (1972 base) Daia «iKe 1929 arr from 
National Income ami Product Arcounii of tht VntUd StaUi at«l receiil ivtues of ihe CmUii Skiln Sun'ry of Currmt fiiiMnrw 
I he fraction of noinmal delense purchases m t*)tal iiomin.il ferlrral porchavs was inulii]»lirf1 b> iiKures on leal 
fedeia! puithases (1972 base) Data fioin 1889-1928 are fi<»m Kendruk (1961. lalilc X-l, col 5) hiKures were 
tnuluphed by 4 8. Iwscd on ihr overlap for 1929. 

IS the estimated iionnal defense putchases ratio, as calculated fioni e<) (14) in the lext 
t A * » A^/y. where A ^ is the end-of-year value of net leal stocks of niilii.irv miuc luirs, equipmeni. and inventories 
(1972 base) Data from 1929-69 are Ironi KendrKk (I97b. table B-24) convened from a 19*i8 to a 1972 index b> a 
(onstant multiple (1 72) Figures were extended to 1978 using data on vancHis exiienditurc coni|>onrnis’ nniiiarv 
sii ucturcs, AEC structuies, miktary equipment. AECequipment, inventoncs for GSA stockpiles, and msentones for 
AEG stockpiles Oepreciaiion csiimaies were based on rates used bv Krndnck within each category His calculations 
assume a higher rate of depreciation during World War II 
8 ^where G' is real nondefensc purchases ol the lederal plus state and Icxral governmeiii sectors <1972 
base), was caKtilated as total real goveniment purchasesC lesaG*^ Sounes forG correspond lo those .ibevve for 
G* except that Kendrick (1961. table A-lla) was used for data from IRB9-192H 
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real GNP, (g-“’ — g”’),, which is indicated over the 1930-78 period in 
table 2. ('This series corresponds to the choice of discount rate—the 
difference between the real rate of interest and the growth rate of real 
GNP—of .02 per year.) As is evident from the table, this variable 
identifies the years associated with wartime, 1940-46, 1951-53, 
1966-69, as times when the defense purchases ratio is above its 
perceived long-run average value. Although the underlying model 
allows a quantitative assessment of the gap between g"’ and ^ in the 
context of wars with different intensities and in an environment of 
nonconstant values for gf, it is also clear that the general pattern for 
the (g" - g"), variable would be robust to some changes in the 
underlying model. Notably, the important aspect of the stochastic 
sperihcation for the war intensity variable B is the temporary nature 
of wars, rather than the details of war probabilities. Substantial varia¬ 
tions in the discount rate p also have minor effects on the results. 

It should not be surpri.sing that the (g''" — g""), variable exhibits a 
substantial amount of positive serial correlation. In this respect a gap 
between current and normal values—which the (g“' — ^), variable is 
intended to capture—should be distinguished from thg spread be¬ 
tween actual and anticipated or perceived amounts, which has been 
stressed in earlier analyses of monetary di.sturbances (Barro 19816). 
The latter type of vaiiable exhibits serial independence as a conse¬ 
quence of rational exjieciations and the assumption that information 
is received with, at most, a one-pet iod lag. I'his type of argument does 
not apply to a variable that measures temporary ef fects. 1 n the case of 
the temporary defen.se purchases variable, the large number of (se¬ 
rially correlated) peacetime years with small negative values of (g“' — 
g"), is offset by a smaller number of (serially correlated) wartime 
years with excesses of gf over gf. (However, the years that are 
significantly affected by war—for example, the set 1941-46, 
1950-53, 1965-72—should not be deemed special, since they con¬ 
stitute 47 percent of the years since 1941 and 39 percent of those since 
1946.) 


Government Purchases of Nondefense Items 

I'he nondeferise portion of government purchases— g'‘ ^C'/Y, where 
G" is nondefense real purchases—was examined statistically over 
samples beginning in 1929. This study revealed little predictive value 
for hrst dif ferences, Dg", except for a negative association with the 
contemporaneous change in the defense component, Dg"’.^'* In par- 


Fast lustoi 7 cit the residuals, lagged values i>i Dg“ or Dg’'\ a capital stock measuii 
Dk‘\ and a constant were all insignificant. 



GOVERNMENl’ PURt;HASES 


1 I 07 

ticular, there is no drift in the nondefense purchases ratio, fhe 
negative association of Dg” with Z)g“’ reflects the crowding out of 
nondefense government spending during wartime. The dependence 
of fig" only on_Dg"’ means that departures of g" from the perceived 
normal valueg" are determined completely by the difference between 
g"’ and g”". With g"" — g"’ held fixed, changes in g’’ amount entirely to 
shifts in the permanent component of nondefense purchases. Ac¬ 
cordingly, with the g“’ variables entered separately, the coefficient of 
the g‘‘ variable in an output equation would reveal the effects of 
permanent changes in nondetense purchases. It is not possible here to 
estimate the response of output to temporary changes in nondefense 
purchases, since no temporary changes were isolated over the sample. 


Empirical Results 

The principal empirical analysis involves joint estimation of the gov¬ 
ernment purchases equation (10) and a relation for output that is 
based on the form of equation (4). With the defense and nondefense 
components of government purchases entered separately, the output 
equation becomes 

log(K,) = . . . + /3,(g“ - + p,gf. (11) 

Note that the real government purchases variables all appear as ratios 
to real GNP. The variable^ s (ZP^IY), is determined as a function of 
observables from equation (10) when used in conjunction with some 
relations that arc derived in the Appendix (eqq. [A5) and 1A6]). 

The first set of hypothesis tests involves the output effects of the 
is"' ~ S"^i S" variables in equation (11)—specifically, that the 
coefficients of these variables satisfy the restrictions, 0 ^ /Sj sS )3i ^ 1. 
As indicated earlier, the coefficient on the g}' variable, R-,, reveals the 
output effect of a permanent change in nonflefcnse purchases. If 
noiidefen.se purchases were charactcri/.e<l by closer substitutability 
with private consumption expenditure (the 9 parameter) and by lesser 
impact on private production (the MPG parameter) than defense pur¬ 
chases, then ^2 > Pa would follow. However, this condition cannot be 
viewed as a firm implication of the theory, fhe model also suggests 
the restriction, 0 ^ ^ 1. 

The second set of hypothesis tests checks whether the explanatorv 
variables for g '/'—in this case B,+ 2 , - . . , and A"',—enter an 

unrestricted reduced form for output as determined solely bv their 
role in determining g“’ in accordance with the coefficients of equation 
( 10 ). 

The analysis is contingent on a value of the discount rate p—the 
difference between the real interest rate and the growth rate of real 
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GNP—in the calculation of the g’f variable (see eqq. [A5] and [A6] in 
the Appendix). However, the results turn out to be relatively insensi¬ 
tive to variations in the p parameter at least over the range from .01 to 
.05 per year. The reported results refer to a fixed value of p = .02 per 
year, which is a plausible magnitude, ex ante, and which approxi¬ 
mates the maximum likelihood estimate for this parameter. 

Jointly estimated equations for real defense purchases and real 
GNP were calculated by means of a nonlinear, maximum likelihood 
routine from the TSP regression package, which includes estimation 
of contemporaneous covariances for the error terms. The estimation 
is joint in the sense of incorporating the role of the coefficients from 
equation (10) in determining the seriesgj® and thereby influencing the 
fit for output in the form of equation (11). In particular, the 
coefficients in the equation for Dgf are not determined solely to 
obtain a best fit of equation (10). 1 have not carried out Joint estima¬ 
tion in the broader context of choosing the number of leads and lags 
of the B variable to include equation (10), in deciding to omit 
moving-average error terms in this equation, in analyzing the process 
for nondefense purchases, and so on. 

Since the dependent variable, real GNP, appears also in the de¬ 
nominators of the ratio variables g® and g", there is a possible simul¬ 
taneity problem in the estimation. Accordingly, I have used as in¬ 
struments for gj"’ and gf the It^ged values, gjii and gf_i, and also the 
contemporaneous values, Gf/y,and Gf/P^, where P, is the trend value 
of real GNP, as determined from a regression over the 1946-78 
period ot log(GNP) on a constant and time. The estimates are not 
altered substantially if only the pair of lagged values or only the pair 
of contemporaneous values relative to trend are used as instruments. 
Empirically, the movements ing" are dominated sufficiently by con¬ 
ditions of war or peace that the use of instruments yields estimates for 
the coefficients of the (g“' — g®), and gf variables in equation (11) that 
differ only in minor ways from ordinary-least-squares (OLS) values. 
However, the use of instruments is important in the case of the gf 
variable—OLS estimates for the /Ss coefficient in equation (11) appear 
to be biased downward subsUintially because of the inclusion of P< in 
the denominator of the gf variable. 


Results for Post-World War II Output Sample 

For an output sample that begins in 1946, the results of the Joint 
estimation of equations (10) and (11) are, for the 1932-78 sample; 

DgT = + .198£>£,+, + .275DB, + .240i>B(_, 

(.013) (.012) (.014) (.017) 


1 
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- .022DB,_j + .088DB, , - .26DfeJ'l,, 

(.015) (.016) (.08) 

<T = .0143, D-W = 1.7; 
and for the 1946-78 sample; 

log(K,) = 2.97 + .0343 • t + .83/)M/?, + 1.12DMK,_, 

(.04) (.0008) (.22) (.22) 

+ .99(r - + -55^ + •62^. (13) 

(.21) (.12) (.45) 

or = .0143, D-W = 1.5. 

Asymptotic standard errors are shown in parentheses below the 
coefficient estimates. The & values are asymptotic estimates of the 
standard errors of the disturbance terms; D-W is the Durbin-Watson 
statistic. Note that the .sample for the government purchases equation 
(12) begins in 1932 and thereby includes the World War 11 experi¬ 
ence. 

Variables included in equations (12) and (13) are: Y, real GNP 
(1972 base); t, time trend; DAf/? ^DM — DM is “unanticipated money 
growth," as measured in earlier research (Barro 19811>), where DM is 
an estimated value of money growth from an equation that is based on 
the M 1 definition of the money stock;*® g" ^ G“’/y, where O'" is real 
defense purchases (1972 base); g” = C’lY, where G" is real, non¬ 
defense, federal plus state and local purchases (1972 base);B, casually 
rate variable as defined in table 1; and A" = K'^fY, where is real 
government defense capital stocks (1972 base). 

For pre.sent purposes 1 focus on the role of the government pur¬ 
chases variables in equation (13). The money shock variables have 
effects that are similar to those discussed in previous research, as 
reported in Barro (19816). 

Theg^ variable in the output equation is based on the specification 
for Dgf that appears in equation (12). The main result from this 
equation is the strong positive spending effect of wars, as measured by 
the casualty rate variableB. The equation shows a 2-year lead ellect of 
the B variable and a lagged effect out to 3 years. (The negative effect 
on Dgf of the DB ,_2 variable is difficult to interpret.) The conse¬ 
quences of eliminating the future values of DB from this equation are 

is determined from an equation that is estimated over the 194 1-78 sample 

DM, = .095 + A9DM,-, + .l6DM,.t + .069FEDV, + 030 ■ lt)g[t//(l - t’)),.,. 

(.024) (.14) (.12) (.015) (.008) 

where observations from 1941-45 are weighted by .86. FEDV, is real tederal spending 
relative to a distributed lag of itself, and 17 is the unemployment rate in the total labor 
force. See Barro (19816) for a discussion of this type of equation. 
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discussed later. For present purposes the most important aspect of 
war spending is its temporary nature, although precise calculations 
forgl" involve the distributed lag pattern ofDB effects on Dgf and the 
implications of these responses for the behavior of the capital stock 
ratio /f"’. Equation (12) shows also the expected tiegative effect of 
on DgT. 

Using equations (A5), (A6), and (A11) from the Appendix, and die 
value p = .02 for the discount rate, the point estimates of coefficients 
that appear in equation (12) can be shown to imply the formula torg"' 
as follows; 


gr =.011+ .67gr + , - . 06 JJ ,^3 + . 02 B, , 

- .185, - .I2B,„ - .075, 


. 165 ,_, 

(14) 


This relation is a particular form of equation (6). F.qualion (14) shows 
a positive but less than one-to-one effect on g'/' ofg"', a positive effect 
of(for a given value ofg"). aid basically negative effect of the 
casualty rate variables (again given the value of g“'). Values of g"' that 
are calculated (fotn equation (14) are shown along with values of g"' in 
table 2. 

The temporary defense purchases variable, (g"' - ^),, has a 
sigmficatttly expansionary effect on output. The estimated 
coeflicient^'* in e({uation (13) is /S, = .99, S.E. = .21. The “t-value” 
corresponding to /S, = 0 is 4.7. The normal defense purchases vari¬ 
able, g", is also significatitly expansionary in this equation—= .55, 
S.E. = . 12, which implies a t-value of 4.6. The estimated effect for the 
permanent purchases variable is somewhat greater than half that of 
the estimated temporary effect.^^ The results permit rejection ol two 
extreme hypotheses: first, that only the temporary part of purcha.ses 
alfecls output (which would require the estimated coefficient of 
the gf variable in eq. [13], to differ insignificantly from zero), 

“ 13c'caiis<- (it the tiegative torrelaiion <(t (g‘ - ^), willi gf, the g, coetficicnt picks up 
an adtlitional etfect The extra lerni involves the dillei ence helweeti the output el tects 
of permanent and tcmptiiary nondelensc purchases. The output coeificient assooated 
witli temporary noiidclense purchases could not be estimated with the ,iv:nlablc data. 
However, since the regression coeflicient ofgf cm (g” — g"), is on the order ol - 0.1, it is 
unlikely that the overall incKhrication is important. 

It has been suggested that the temporary government puichases variable may be 
proxying lor the ellects of accompanying federal dehcits. The analysis in Barro (1979) 
dcK'iiments the strong positive ellect of temporary federal spending, as in wartime, on 
public-debt issue. Some pieliminaiy results in Barro (1980fc) indicate that lagged “debl 
shocks” have expansionary clfects on output that are statistically significant but sub¬ 
stantially weaker than tbo.se i>f monetary sh<x;ks. However, this constructed debt-shock 
variable filters out the normal positive as.s<Kiation between temporary government 
spending and the deficit. With these debt shocks held fixed, the actual lagged values of 
public-debt growth have no explanatory value for output. This last hndmg suggests 
that the strong expansionary influence of temporary delensc purchases does not 
involve a proxying for the effect of correlated movements in the federal deficit. 
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and second, that temporary and permanent purchases are of equal 
importance for output. The latter case would correspond to equal 
coefficients (/3, = /J-d for the (g"' — g"'), andg-"' variables—that is, to the 
proposition that the coefficient of the gf" variable would be /.ero in an 
equation that held fixed the value of the actual purchases ratio, g'f. 
For convenience, the results from equation (13) can be rewritten in 
this form as 


lo}r(F,) = . . . + .99^? - A4gr 
(•21) (.24) 

The hypothesis that the coefficient of the g"' variable e<iuals zero 
corresponds to a /-value of l.H, which is significant at the 5 percent 
level for the case of this one-sided test. That is, the null hypothesis of 
equal output effects for tempoiary and permanent defense [>urthases 
(/3i = ^2 iti eq. [11]) is rejected in favor of the hyjiothesis that 
temporary purchases are more expansionary, /S, > j32.~'' 

The estimated coefficient on the (g"' — g"), variable in etjuation (13) 
implies that a temporary change in the level of real defense purchases 
has almost a one-to-one ef fect on the contemporaneous level of out¬ 
put. While this finding is consistent with the restriction. /3, ^ 1, the 
evidence would also bo consistent with a moderate multiplier relation¬ 
ship between temporary government purchases and output. The rel¬ 
atively high estimated output effect is associated in the theoretical 
model with a small value of the ^-coef ficient, a high value of the Ml’d 
parameter, and a high real-rate-of-return elasticity of aggregate sup¬ 
ply relative to that of demand. 

The estimated coefficient on theg"' variable in equation (13) iitiplies 
that a permanent increase by one unit in real defense |)urtbases leads 
approximately to a one-half unit rise in real GNP. rhis result accords 
with the restriction /32 ^ 1—moreover, the estimate is significantly 
below unity in this case. 

The estimated coefficient of gf, the nondefense purchases ratio, is 

“ 1 considci fd dis< rimtiuting between tempoiary and permanent deicnse pin th.a.'.es 
by iitili/ing a measure ol the leturii on tlie emmics ol detense loniiaiiors relative lo 
that on a market porttolio of New Yoik .Stock Exchange (NV'SK) sirnks. (The lelative- 
leltiins vaiiable wa.v coiisiruclcd using data on total reluins to N\’SK stinks from the 
Tenici lot Keseaiih in Seeuniy t’rires ol llie Uiiiversily ot Cliiiago. A list ol defense 
loiuraetor.s ,ind the si/e of these ronlratls lor l',)69 Iroin the Deparinieni ot llelensc 
was kindly supplied lo me by Claiie Fiiedland.) This relalivc-reUivns variable has no 
explanatory power when added lo a (irsl-dilfeienee rorin ol the output equation with 
^ 1 ^ 1 : and the deicrmiiianls ot Dgy Iroin the torm Of eq. (12) indnded as 

independent variables. Comcptiially. it is unclear whether the relative-returns v.triable 
signifies an increase in war probability and, iheretore. that ctineni defense expendi- 
luies are more likely to be temporary, or an mcicase in the Umg-run expected (pianinv 
of defense purehases, which would imply that tiirrciit defeii.se cxpenctiinres arc more 
likely to be permanent. Therefore, the sign ol the variable is ainbiguoiis on Iheoretiral 
giounds. 
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/Sa = .62, S.E. = .45, which is positive but imprecisely determined. The 
relatively small amount of independent sampling variation in this 
variable since 1946, in conjunction with the necessity of using instru¬ 
mental variables, results in the high standard error. (Using OLS 
techniques, the estimated coefficient of the gf variable is negative.) In 
any event the hypothesis that the coefficients of thegf and variables 
are equal, ^2 = fia, would be accepted from the present evidence. As 
noted earlier, the relative output effects for these two types of perma¬ 
nent movements in government purchases depend, in the theoretical 
model, on the relative values of the 6 and MPG parameters. Possibly, a 
more precise determination of thegf coefficient would be obtained by 
extending the analysis to the 1930s, during which major changes 
occurred in the nondefense purchases ratio. The main obstacle for 
this extension is the i.solation of monetary shocks, which seems to 
require a specification for the pre-World War II monetary regime 
that differs from that used for the post-1941 period (see n. 25 
above). 

A combination of equation (11) with an expre.ssion for g"' in the 
form of equation (14) implies a reduced-form relation for output in 
terms of a constant, a time trend, DMR variables, g'f, gf, the B vari¬ 
ables, and A"l,. Unrestricted estimation of this reduced form affords a 
lest of the hypothesis that the determinants of^—specifically, theB 
variables and A?!,—affect output only in the manner implied by the 
forms of equations (11) and (14). The test is based on the likelihood 
ratio corresponding to unrestricted and restricted forms of joint esti¬ 
mation. The value of -2 • log(likelihood ratio) turns out to be 3.2, 
which is below the 5 percent critical value for the distribution with 6 
degrees of freedom (the number of coefficient restrictions in this case) 
of 12.6.'"’ Therefore, the hypothesis that the determinants ofg"' enter 
only in this indirect manner in influencing output is accepted. 


It IS woith noting that the point estimates for the government purchases 
coefficients in eq. (13) change little if the output equation is respccified in first- 
clillcrcnce torm (see Plosser and Schwerl [1978] for a discussion). The estimated 
equation in this rase is, for the 1946-78 .sample; 

D log(T,) = .036 + .mOiDMR,) + + .9lD(g“- - p), 

(.006) (.29) (.34) (.22) 

4- .550^ + 0.63Dgr. a = .0176, D-W = 2.5. 

(..39) (1.67) 

The )ointly estimated equation for is similar to that shown in eq. (12). Note that the 
constant in the equation for D log(T,) corresponds to the time trend for the level 
equation. The main change from the previous specification is the higher standard 
errors for the gf and gf coefficients. The Durbin-Watson statistic of 2.5 suggests 
overdil ferencing. 

™ If the discount rate p were regarded as a freely estimated parameter, there wouli 
be only 5 degrees of freedom, which would imply a critical value of 11.1. 
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There would, of course, be many possible output effects of war that 
do not operate through the channels that were specified in the pres¬ 
ent model. For example, there would be responses to conscription 
and patriotism, and the possibility that war would threaten future 
property rights. Some of these effects would, however, influence 
output in a manner similar to that of the pecuniary intertemporal 
substitution variable that was stressed in the theoretical analysis (see 
Barro [ 1981a] for a discussion of wartime influences on financial rates 
of return). In particular, the level ot temporarily high demands on 
resources by the government, (g" — may proxy satisfactorily for 
the full range of wartime output ef fects. 

More generally, because the war variables are the prime basis in this 
work for distinguishing temporary from permanent movements in 
government purchases, it would be infeasible to allow unrestricted 
direct wartime output effects and still carry out interesting tests of the 
underlying hypotheses. In any event the restriction that the war 
variables influence output only indirectly through influences on tem¬ 
porary government purchases is satisfied in the present case. 

Elimination of future values of the casualty rale variable from the 
government purchases equation (12) has a substantial effect on the 
estimation of this equation for the 1932-78 sample. Aside from a 
major deterioration in fit, the residuals then show pronounced posi¬ 
tive serial correlation. These effects are dominated by the World War 
II years—in particular, from the rise in military spending in 1940-41 
prior to the onset of casualties and from the major advance in spend¬ 
ing in 1942-43 before the peak in casualties for 1944-45. In ihe.se 
cases it seems reasonable to treat the future casualty values as rough 
proxies for contemporaneously available information about the inten¬ 
sity of the war. (Another possibility would be to use a foreign casualty 
rate variable, but data limitations and conceptual problems concern¬ 
ing the perceived threat attached to foreign conflicts have prevented 
the implementation of this idea.) 

For the post-World War II period, this type ot advance informa¬ 
tion on war intensity seems less important and. in fact, the future 
casualty variables lack significant explanatoi) ptjwer for defense ex¬ 
penditures over this period. With these future values deleted and 
1946-78 samples used throughout, the results of the joint estimation 
are, for the 1946-78 sample: 

Dg‘,‘ = .264DB, -I- .226DB,_, - .027DB/l^ + .079DB, j 
(.061) (.043) (.021) (.022) 


- .30Z)*|'l,, 
(. 11 ) 


<f = .0128. D-W = 1.9; 


( 15 ) 
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log(y,) = 2.99 + .0340 • I + .ii»DMR, + \.\2DMR,_, 

(.06) (.0010) (.25) (.26) 

+ .96(g“' - ^), + .52|f + .74g‘,\ ( 1 ^) 

(.26) (.17) (.52) 

<> = .0177, D-W = 1.5. 

Although the fit of the estimated otitput equation (16) i.s poorer than 
that shown in equation (13), the pattern of estimated coefficients and 
standard errors is similar. Therefore, the conclusions on output ef¬ 
fects of the government purrha.ses variables are not sensitive to, first, 
elimination of the World War II years from the sample for the gov¬ 
ernment purchases equation and, second, removal of the future val¬ 
ues of the casualty rate variable from this equation. The results also 
remain similar if tfie starting date for the output equation is shifted 
from 1946 to 1950, which removes .some contribution of the World 
Wat II years that works through the effects of lagged B values on the 
constructed gj'' variable. 


Adcbtion oj World War II Output Experience 

Rather than insulating the results from World War 11, it is in many 
respects more informative to evaluate the performance of the model 
dttring this extreme experience. Clearly, the sample variation in the 
wartime-related variable, (g"’ — g"'),, is raised enormously by this 
extension of coverage. On the other side, the inclusion of the World 
War II years raises problems that concern the measurement of real 
output during a period of extensive price controls and the accuracy of 
linear sirecihcations tor extreme ohservatitjns. 

Jointly estimated eciuations that include the 1942-45 observatirms 
on output are, for the 1932-78 sattrple: 

Dg'/' = .163/)/f,+2 -I- .196071, H + .2740/1, -H .242DB, , 

(.013) (.013) (.015) (.017) 

- .022/7/1, 2 + .087/)/l,_,., - .250/;;'.,, 

(.015) (.016) (.08) 

d = .0143, 1)-W = 1.7; 
and for the 1942-78 sample: 

log(r,) = 2.97 -I- .03.50 • t + .76DMR, + .90DMR, , 

(.05) (.0009) (.15) (.15) 


( 18 ) 
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+ -p), + A2'^+ .Mg',', 

(.06) (.12) (.51) 

a- = .0154, n-W = 1.4. 

A lest that the 1942-45 observations (or output conl'orm to the 
.same structure as that for the other years corresponds to a value lor 
— 2 • log(likelihood ratio) of 37.9, which exceeds the 5 percent critical 
value of the distribution with 4 degrees of’ freedom of 9.5.” It 
seems likely that this appearance of structural break during World 
War II would not appear if the functional form were altered to allow 
for a nonlinear dependence of logfK,) on (g"' — ^),. .Specifically, the 
most important change from the 1946-78 estimates in equation (13) 
to the 1942-78 values in equation (18) seems to be the drop in the 
estimated coefficient of the (g“’ - g"), variable, from .99, S.E. = 
.21, to .71, S.F.. = .06. A functional representation that allowed for 
persitive, but diminishing, output effects of temporary government 
purchases would probably account for the overall results in a 
homogeneous form, but 1 have not experimented along the.se lines. 
As the results in eqttation (18) stand, they reveal the anticipated 
reduction in the standard error of the (g" — ^), coefficient—in fact, 
if this relation were viewed as well specified, the estimated output 
effect of temporary government purchases, /S,, would now be mea¬ 
sured as significantly less than one.^^ 

■" 1 have also earned om the estimation with an allowance for hctci ostcd.istuil) in the 
toini ol a dilfcrent error sariaiice tor the output equation duiiii)' ihc World War II 
years. 1942-4.5. Maximum likelihood estimates indicate that those yeais hate an error 
variance that i.s 2.6 tunes that lor the 1946-78 period, winch cori es|x>nds to multiplying 
the 1942-45 ohscrvations by 62 in theestnnaiion. I he results with the lieteioscedastu- 
ity correction applied arc, lor the 1942-78 sample. 

log(K,) = 2.98 + .0347 ■ t f .H5i»M«, + 1.01, 

(.04) (.0009) (.17) (.17) 

+ -75(g" - g^), + .48^ + .34fif. 6 - .0148, O-W ^ 1.4 
( 07) (.12) (53) 

Note that the a value applies here to the error teim lor the 1946-78 peiiixl. The 
paltci n ol restilis does not diller greatly Iroin that shown in erp (18) flic )oinlly 
estimated government purehases equation is very close to that shown in cq. (17). .\ test 
lor the addition of llie 1942-45 years tor output to the rest of the sample corresponds 
here to a value for —2 ■ log(likelihooil ratio) of 30 9, as compared to a 5 percciii x' 
value with 3 degrees of freedom ol 7 8. ( The dcgiees of fieedom aie icducecl hs I here 
III comparison with the test lor the context o( hcnnosced.istii iiv because ol ihe estima¬ 
tion of tfie helerosccdasticity parameter.) 

“ The estimated coelhcient for nondefense pun bases, gf,-is also smaller than hefore, 
but still insignificantly dilferent irom that on the variable. VVilhgJ' held lived, the 
implied estimated coclficient on ^ is now - 29, S.K = , 12. fhat is, tile hypothesis that 
temporary and permanent defense purchases have equal output ellects—13, = g ,— 
cot responds here to a (-value of 2.4. This hypothesis would, therefore, again lie re)erted 
in favor cif g, > /3j. A test that the determmancs of^ from cq. (17) enter only in this 
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Expectations nj Future Wars 

Calculation oi expected 1 uture casualty rates is based on the following station¬ 
ary probability model lor wars.'“ First, a 2 x 2 matrix is specified lor the 
probability of war or peace next year (or rather for year / + 3 when conditions 
at / -I- 2 are assumed known at date /), conditional on war or peace prevailing 
currently. It is assumed that information about the future course ol B is 
contained hilly in the most recent observation, earlier values of 5 and values 
of other variables not having to be considered. The probability of war during 
at least part ol next year, given peace for the latest observation, is based on 
data over the 1774-1978 period—namely, 

p, = Prob(d,+, > 0 ] B, = 0) = 9/162 = .06, (A7) 

where 162 is the total number of peacetime years in the sample (where B, = 
0), and 9 is the number ol tht-.se years that were followed by the outbreak of 
war.*' Correspondingly, the probability ol peace continuing is given by 

(1 ~ p,} = Prob(B,+, = 0)B, = 0) = .94. 

The value ol p, is slightly higher ilThc sample is limited to the more recent 
period 1889-1978 (the .sample for which relatively accurate observations on 
G"' and B are available), for which the result is p, = 5/68 = .07. 

fhe probalnlity of the continuation ol war is given for the 1774-1978 
sample liy 

p.i = Prob(B,^., > 0 |B, > 0) = 33/42 = .79, (A 8 ) 

where 42 is the number of war veais (where B, 7 ^ 0), and 33 is the number of 
these that weie followed by another year of war.*** In other words nine wars 
begtin and ended over the sample 1774-1978. For the 1889-1978 period, the 
result would be p 2 = 16/2 1 = .76. Finally, the probability of no war next year, 
given Its existence this year,'* is given for the 1774-1978 sample by 

(1 - pi) = Prob(B,^., = 0)B, > 0) = .21. 

fhe expected value ofB tor the first year of a war is calculated as the mean 
value for the five wats .since 1889 (for which accurate data on B are available): 


" War probabiliues and the distribution ol si/cs ol wars need not be constant over 
time, although there is no indication of substantial sluictuial change in ihe small 
sample ol evidence allorded by the 200 yeais of U.S. history. (The largest value lor the 
B variable would actually apply to the (avil War—see the notes to table 1.) Fioin the 
slandpoini ol constructing the gP variable, shifts in the stochastic structure for wars 
would essentially be an alternative to the present specification that allows for shifts in 
spending lor a given war stiucture. as represented by the stixhastic variable e, in eq. 
(.A 1). In the context of output aiialy.sis, it is unclear that there would be much empirical 
difference between these alternatives. 

" The year 1978 is not included in this calculation, although it could have been if 
peace during 1979 were also included. War years are taken to be 1775-83, 1812-1.5, 
1846-48, 1861-65, 1898, 1917-18, 1941-45. 1950-53. and 1964-72. There may be 
some objection to starling the sarnplejust before a war (which is not independent of the 
start of U.S. data), but the results are not highly sensitive to this choice. 

” The probability p 2 refers to the existence of war during at least part of a year 
following a period of war during at lea.st part of the previous year. 

'“This calculation pertains to the existence of peace over the entire year / + 1. 
conditional on war during at least part of year /. 
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p 

Mo 

Ml 

.01 

1 95 

2.33 

.02 

.94 

2 26 

.05 

..35 

2.06 

.10 

.16 

1.80 

25 

051 

1.30 


B , = 0 ) 

= 7(.005 + ,23 + .004 + .071 + .001) = .062. 
5 


(A9) 


Since war could break out at any lime dtiring the year, the annualiced value ot 
denoted by S'*, would be roughly twice the above hgure—that is, S'* = 

.124. 

Finally, when are both positive, the conditional expectation for 

B, IS given by 

E(B,^^\B„, > 0,B,>0) = e,Bf, 

where Bf i.s the current casualty rate expressed at an annual rate if hostilities 
applied only to a fraction of year t. The parameter is based on the 
assumption (not refuted by the small sample of U.S. data) that wars tend 
neither to grow nor to contract over time, except that war may end at some 
time during year / + I as governed by the parameter// 2 . Accordingly, f^, = 1 
- 'M 1 - pj) = -SO- > he parameter da is set so that the value of the 'F, variable, 
which appears in equation (A 6 ), converges to the value associated with = 
0 as 21 ,^ 2 -» 0 (which essentially recognizes that a new war may break out next 
year, even il one is already going on). The value of 0^ turns out to be S,, 

= .007. Accordingly, I use the relation 

E(B,^,\B,^t > 0,fl( > 0) = .007 + ,905/. (AIO) 

Equations (A7)-(A10) allow calculation of the relevant expectation of fu¬ 
ture casualty rates, which appears in equation (A6), sF, = 2 ,*.ii 5 (+i+ 2 Ai ■*" P) > 
conditional on observation ot B through 8(+2 and for a given value of the 
discount rate p. The result takes the form 

'Ff = Mo Mi^t*+ 2 , (All) 

where m„ and p, can be determined as functions of the p parameter." 
Specifically, the.se coefficients for selected values of p can be seen in table A1 
Since p corresponds to the difference between the real rate of return and the 
real giowth rate, the values ot the p coefficients corresponding to the lower 

The general formulae are; 

Mo = [Bpi(l - pjKl + P + fiA) 

+ p20o(l + P)(P + Pl)lf[pfl + P + pi “ Pjlff + P ~ Ps®l)]. 

Ml = pjPi/fl + P - 
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values of p would seem to be most pertinent. The empirical results that are 
reported in the text use the value p = .02 per year. 

The combination of equation (All) with equations (A5) and (A6)_a]lows 
calculation of the fjerceived permanent government purcha.ses ratio, gi”, as a 
function of the variables (gf, .... £,+ 2 ) and the parameters (p, y, ao, 

«!, Oj, tta, fli, 02 ). where p is the net real discount rate, y measures the reaction 
of current defense purchases to existing capital stock, and the a’s and a’s 
describe the effect of the array of B variables on defense purchases. The 
results are therefore expressed in terms of the general form of equation (6) in 
the text. Other coefficients that appear in the analysis (S', b, p,, p^, B —see, 
e.g., the expressions contained in n. 37 above) are treated as fixed at the 
values: S' = .16 per year, b = .34,/), = .06, = .79, and fi = .062. 
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Information Costs, Duration of Search, and 
Turnover: Theory and Applications 


Louis L. Wilde 

Calijoniia hnitinie of I'frhnology 


riiis paper uses a (ormal model of search over multialiribule alter¬ 
natives, analyzed in a pioduct market setting, ttj investigate the 
theoretical foundations of the empirical literature on tiuratton of 
seart:h and turnover in product markets, labor markets, and mar¬ 
riage markets. A number of specilic empiiical predictions arc also 
det ived. In particular, whethet “quality” is a "search'' attribute or an 
"experience” attribute is related to the cost of search, the cost of 
itispettion, the price of the good, and certain properties of the 
marker disfriljutioit at ptice and quality 


I. Introduction 

Several years ago Phillip Nelson observed that certain problems arise 
in extending search theory to deal with nonhomogencous goods. In 
particular, lie noted that “inibrination about quality differs from 
information about price becau.se the former is usually more expen.sive 
to buy than the latter” (1970, p. 311). To analyze the implications ol 
this observation, Nelson divided goods into two classes, .search gottds 
and experience goods. Search goods are those for which utility is 
assessed before purchase by actual inspection. Experience goods are 
those for which utility is assessed after purchase by actual consump¬ 
tion. 

This paper is substatilially revised version «t “Furlher Results on Inspection and 
Evaluation in Product Maikels" (unpublished manuscript, California Institute ot 
Technology, February 1978). I would like to thank Steve Lippinan and Alan Schwariz 
lor very helplul coiniiients on an earlici draft. Comparative statics results are proved 
in an Appendix which i.s available from the author (California Institute of Technology. 
Social Science Working Paper no. 306, January 1980). 

[Journal of PnlUical hfoiumii, 19S1, vot 89, iia fi| 
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These definitions turn out to be very strong. So strong, in fact, that 
they lead to some difhculties. Consider, for example, experience 
goods. For these goods utility is assessed after purchase by actual 
consumption. But at least the price is observed before purchase. Since 
for experience goods this cannot, by definition, affect the choice of 
which brand to buy. Nelson was forced to a.ssume that “consumers 
either sample at random from among all brands or from among those 
brands in the price range the consumer deems appropriate for him- 
selF’ (1970, p. 313). These assumptions require that consumers either 
ignore prices completely or have perfect information regarding 
prices, neither of which seems likely. 

The problem with Nelson’s definitions is that goods generally pos¬ 
sess a number of characteristics which can dif fer in their degree of 
observability. 'I'hus a good might possess some “search characteristics” 
and some “experience characteristics.” Furthermore, whether a par¬ 
ticular characteristic is a search characteristic or an experience char¬ 
acteristic ought to be endogenously determined by the consumer. 

In previous work, I have analyzed a model in which goods are 
described by precisely two characteristics, price and quality. The mar¬ 
ket is described by an exogenous distribution of these characteristics. 
Sampling from this distribution is assumed to be costly. Once an 
observation is drawn, price is observed costlessly. Quality, however, 
can never be observed before purchase. In this case quality is an 
extreme example of an experience characteristic, one for which the 
cost of observation piior to purchase is infinite. The present paper 
extends this model to allow quality to be observed at some finite cost. 
The purpose is to explicate the relationships between information 
Costs, duration of search, and turnover.' 

This model, introduced in Wilde (1980), applies to a number of 
cases discussed by Nelson. For example, it formalizes his proioiypic 
experience good, canned tuna fish. Nelson suggestetl that “to evaluate 
brands of canned tuna fish, for example, the consumer would almost 
certainly purchase brands of tuna fish for consumption. He could, 
then, determine from several purchases which brand he preferred. 
For tuna fish there is no effective search alternative open. .At the low 
price of experience, there is insufficient demand for specialized es¬ 
tablishments selling tastes of various brands of tuna fish” (1970, p. 
312). In this case, Nel.son seems to be suggesting that the “price of 
experience” is low because the price of the gtK>d us, low. How'escr, the 


* I’he lormal analysis in ihis paper will Ik* sei in tht* pioduci inarkci In this case 
turnover is a iionrcpeai sale. Sec. i\' will ciisciisN applw alums u> the l.Uxiv luai kei aiul 
the niainage market. In the lalxir rnarkci, uirnovei is €i<|uii arul in the marriage market 
turnover is a divorce 
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price of experience can be low for other reasons as well. For example, 
if there is little variance in quality, then the price of experience is low 
because there is little chance of purchasing a low-quality good. In fact, 
it will turn out that by allowing quality to be observed before purchase 
an explicit expression for the price of experience can be derived. 
Utilizing this expression, this paper will also explicate the relationship 
between information costs and the price of experience. 

'The ftKus of the model developed in Section II is on an imperfectly 
informed consumer who is interested in maintaining a flow of con¬ 
sumption of a good which is described by price and quality. The 
market offers various combinations of price and quality, but the 
consumer cannot costlessly observe them; by paying a search cost he 
or she can .sample a good from the market, but only price is observed. 
Quality can be observed either before purchase by actual inspection 
(at some additional cost) or after purchase by actual consumption. 
Whether quality is observed before purchase (herein called inspec¬ 
tion) or after purchase (herein called evaluation), the consumer can 
return to the market and resample if the observed quality is too low. 
The initial problem is to characterize the optimal strategy for a con¬ 
sumer in such an environment. For a fixed utility function, joint 
distribution of price and quality, and cost of search, three possibilities 
arise depending on the cost of inspection: (1) If the cost of inspection 
is low enough, inspection will be the optimal strategy for low prices 
and drawing a new observation will be the optimal strategy for high 
prices. (2) If the cost of inspection is of an intermediate amount, 
evaluation will be the optimal strategy for low prices, inspection will 
be the optimal strategy for intermediate prices, and drawing a new 
observation will be the optimal strategy for high prices. (3) If the cost 
of inspection is high enough, evaluation will be the optimal strategy 
for low prices and drawing a new observation will be the optimal 
strategy for high prices. 

The formal results of the paper relate to the characterization of the 
consumer’s optimal strategy which is summarized by statements 1-3 
above. However, these results have broad implications. Three im¬ 
portant general observations emerge from the analysis. 

First, as mentioned above, the price of experience can be defined 
analytically. This is important because it allows one to differentiate 
between the direct, short-run benefits of purchasing a good which are 
derived from its consumption and the indirect, long-run benefits of 
purchasing a good which are derived from evaluation of its quality 
attributes. 

Second, it will be demonstrated that in some cases quality will be a 
pure search characteristic (statement 1 above), in some cases it will be 
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neither a pure search characteristic nor a pure experience character¬ 
istic (statement 2 above), and in some cases it will be a pure experience 
characteristic (statement 3 above). Hence, not only is determining 
whether a particular good is an experience good or a search good a 
complex matter, but so is determining whether a particular charac¬ 
teristic of that good is an experience characteristic or a search char¬ 
acteristic. Since these disdncdons have become very popular in the 
literature it is important to understand their limitations. 

Finally, the comparative statics associated with the characterization 
of the consumer’s optimal strategy will show that the theoretical 
foundations of much of the empirical work on duration of search and 
turnover in labor markets and marriage markets are unsound. 

A number of specific empirical predictions are also derived. It is 
shown that quality is more likely to be a search attribute (1) the lower 
is the cost of search, (2) the lower is the cost of inspection, (3) the 
higher is the price of the good, and (4) the larger is the variation in 
utility due to quality reladve to the variation in utility due to price. 
Similarly, it is shown that quality is more likely to be an ex(>erience 
attribute (1) the higher is the cost of search, (2) the higher is the cost 
of inspection, (3) the lower is the price of the good, and (4) the smaller 
is the variation in utility due to quality relative to the variation in 
utility due to price. 

It is also shown that regardless of whether quality is a search 
attribute or an experience attribute, increases in search costs decrease 
the duration of search. In general, increases in inspection costs have 
ambiguous effects on the duration of search, but it is argued ibat if 
the variation in utility due to quality relative to the variation in utility 
due to price is low, increases in inspecdon costs are likely to increase 
the duration of search (at least by one measure). 

Finally, considerable attention is focused on turnover. Again, in the 
most general setting results tend to be ambiguous. This is especially 
true regarding the relationship between search costs and turnover. 
However, it is argued that if the variation in utility due to quality 
relative to the variation in utility due to price is low, increases in 
inspection costs are likely to increase turnover. 

This paper is organized as follows. Section II introduces the basic 
model. Section III considers the three cases mentioned above: (a) in 
which quality is always observed before purchase; (6) in which quality 
is observed before purchase for some prices but is observed after 
purchase for other prices; and (c) in which quality is always observed 
after purchase. Section IV discusses the empirical implications of the 
model in the labor market and the marriage market. Section V pro¬ 
vides a brief conclusion. 
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II. The Model: Notation, Definitions, and Assumptions 

In this section the basic model will be developed and an analytical 
expression for the price of experience derived. Section III provides 
comparative statics for the three cases mentioned in the introduction. 

Assume the good which is sought by the consumer has a lifetime of 
one period. Let U{p, q) be the total net value to the consumer of 
purchasing and consuming the good characterized by price p and 
quality q, where U is differentiable and bounded on R+ x with 
auldp < 0 and dU/dq > 0. Let <f>{p, q) be the consumer's subjective 
estimate of the market density of P and (X For rnatliematical conve¬ 
nience, assume <!> is stiictly positive on x R^. Define/(/>) as the 
marginal density ofP andg(^ | p) as the conditional density of () given 
F = p, both based on 4>. 

The cost of drawing an observation at random from (p is r<, where 
Cs 3= 0. The cost of observing the true value of Q prior to purchase is 
Cr> where 3= 0 (both c,s- and c-r are measured in the same units as U). 

The consumer can sample as many observations as desired from (p 
at the beginning of each period. Any number of inspections are also 
allowed. However, the consumer demands precisely one unit of the 
gootl each period.^ The consumer’s objective is to maximize his or her 
expected discounted utility of consumption net of search costs. Sam¬ 
pling is assumed to be without recall, the horizon is infinite, and the 
discount rate is )3, where 0 < y3 < 1. 

Now suppose the consumer has sampled a good priced nt p. Three 
reactions are possible: sample again without observing quality; inspect 
quality and then either buy the good forever or sample again; or 
evaluate iiuality and then either buy the good forever or sample 
again. Let v(p) he the expected value of drawing an observation ofp 
and then proceeding optimally. Then 

td/i) = -Cs-+ max {L,B(/)),/’(/>)}, (1) 

where F is the exjtected value of search (i.e., the expected value of 
ul/i] taken with respect to f),B(p) is the expected value of buying the 
good without observing quality and then proceeding optimally, and 
T(p) is the expected value of testing quality prior to purchasing the 
good and then proceeding optimally. 

To define U(p) and T(p) analytically, observe first that, once quality 
is known, the value of the optimal policy is the same whether quality is 
observed via inspection or evaluation. In other words, once quality 
has been observed, it is irrelevant how it was observed—any asso- 


^ Search and inspection are assumed u> be timeless in order to avoid confounding the 
direct costs of these activities with the opportunity cost of delaying the purchase 
decision. 
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dated costs will by then be sunk costs. Define the value of the optimal 
policy once quality is known as k{p). Then 

k{p) = yG[<,*{p)\p]-^( g(q\p)dq, (2) 

where q*(p) is that quality level which makes the consumer intliflercni 
between consuming the go<td characterized by [p, q*(p)] and search¬ 
ing again for a new good from <f). I'hat is, q*(p) is dehned by U[p, 
q*(p)] ~ ^^{1 ~ l3)- T he logic of (2) is that if ^ < q*(p), then the 
consumer rejects the good and samples again, receiving V (the ex¬ 
pected value of search). This happens with probability 6'[^*(/>) |/)]. If 
q S' q*{p), then the good is acceptable and the consumer receives the 
conditional expected value oi U(p, q)/(\ — (i), given q ^ q*(p)- 

Using A(/7), both B(p) and T(p) are define*! straightforwardly. It the 
consumer buys the good priced at /», he or she receives the expected 
utility of consutnption, given quality is unknown, plus the discounted 
value of an optimal policy once quality is known. Hence 

B(p) = EU{p.(l) + (ik(p). (3) 

If the constnner tests for quality prior to purchase, he or she pays c-p 
and receives the undi.scouiited value of an optimal policy once quality 
is knowti. Hence 

l'(p) ^ -Cp + k(p). (4) 

from (3), by adding and subtracting A(/t) and rearranging,can 
be rewritten as /t (/t) = —[(1 — ^)k{p) — EU(p, Q>)] + k(p). The term in 
brackets is a cost which is directly comparable to Cp. I'hat is, one can 
define 

f„(p) = (\ ~ I3)k(p) - EU(p,(l). (5) 

Then B(p) = -c„(p) + k(p). 

Equation (5) provides an analytical expression tor Nelson’s price of 
experience. Furthermore, it has a natural interpretation. Recall that 
k(p) is the expected value ot an optimal policy once quality is known, 
given the observed price is p. When cjiiality is observed via actual 
consumption, this value is not obtained for one period (since in this 
model the good lasts for preci.sely one period). Hence (1 — (i)l<{p) is 
the gross opportunity cost of consuming the good, given quality is 
unknown. But consumption of the good yields utility, in this case 
^f^ipy Q.)- fhc net opportunity cost of consuming the good, given 
quality is unknown, is the difference between these two quantities. 

It) analyzing the optimal policy it will be convenient to make a 
transformation of variables in the definition of k{p). Since it is ulti- 
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mately final utility which matters to the consumer, the focus of (2) can 
be shifted from the conditional distribudon of quality given price to 
the conditional distribution of utility given price. That is, let | p) 
be the conditional distribudon of utility given P — p. Then k(p) 
becomes 

k{p) = yny(i -^)\p] + 


where z(p) = lim 4 _,<„t/(/>, g). Integradng (6) by parts gives 


k(P) = (1 - P) 



h p> 


'l'(u) I p)dw 


( 7 ) 


This form of k(p) is intuidvely less appealing than (6), but it is more 
powerful. For example, using (5) it is easy to show from (7) that 

^(w\p)dw. (8) 

Z<P) 

where z(p) — U{p, 0). Expressing the price of experience in this form 
is useful analytically since it makes the comparison between B(/)) and 
T(p) easier. Conceptually, it helps idendfy factors which might affect 
the decision whether to observe quality before purchase or after 
purchase. For example, suppose the cost of search, c^, increases. Then 
surely the value of an optimal policy will fall, that is, dVIdcs < 0. 
Equation (8) suggests that the price of experience will then fall as well. 
Hence the higher is the cost of search, the more likely it is that 
evaluation will be optimal. These and other results will be formalized 
in the following section. First, however, a few more preliminary as¬ 
sumptions will be needed. 

Using the definition of Cnip) introduced above in equation (5), the 
functional equation (1) can be rewritten as 

v(p) = -Cs + max {F, k(p) - c„(p), k{p) - Ct}.^ (9) 


The next step in characterizing an optimal policy is to compare V,k(p) 
— Cg{p), and k{p) — Cy. Unfortunately, without more structure on the 
joint distribution of price and quality, any number of things can 
happen. Nelson recognized this problem as well, stating the following: 
"Prior to using [a] brand, all the consumer knows is its price. But this 
knowledge provides only the roughest sort of guide to choice, for the 
consumer must assume a generally positive relationship between price 


“ It is convenient at this point to assume that the functional equation (1) has a unique, 
bounded solution. In this case V is unique and well defined. Existence of a bounded 
solution to (1) is straightforward. Uniqueness can be established along traditional lines 
il search and inspecuon are not assum^ to be timeless (see Wilde 1979) or by appealing 
to the appropriate generalization of MacQueen and Miller (I960). 
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and quality. In the absence of any other information, the consumer 
would not know if he were better off experimenting with low- or 
high-priced brands” (1970, p. 313). To get around this problem, 
Nelson converts the joint distribution of price and quality to a dis¬ 
tribution of net utility and proceeds under the assumption that evalu¬ 
ation is always used to observe quality. In the present analysis, since 
the decision whether to observe quality before purchase or after 
purchase is endogenous, a more formal approach is needed. The 
standard assumption is that d'V{-u)\p)ldp > 0, This assumption im¬ 
plies that, on average, higher price is associated with lower utility even 
though higher price may well be associated with higher quality. Be¬ 
cause this assumption is discussed at length in Wilde (1980), it will be 
assumed to hold here without further rationalization.^ 

Several implications follow directly from the assumption that 
d'p(w I p)iap > 0. First, it implies that the price of experience is 
increasing in the price of the good. To see this, simply take the 
derivative of (8) with respect to p: 


<^a(P) = f. 




J -V(1 
2( p) 


(P) 

V(l-fi) 


[d'F(a/1 p)/dp]dw - 'lt'[z(p) | p]i'(p) 


since M'[z(p)lp] = 0 by definition. Second, d'F(u'|p)/dp > 0 implies 
that observing quality before purchase becomes a less desirable alter¬ 
native to sampling again as the obsei vcd price increases. That is, using 
( 6 ). 

J hp) 

[d'F(ii; I p)/dp]dw « 0. (10) 

Moreover, B'{p) = k'(p) — c'gip) s T'(p) since c’g{p) ^ 0. Hence 
observing quality after purchase also becomes a less desirable alterna¬ 
tive to sampling again as the observed price increases, and it does so at 
an even faster rate than observing quality before purchase. 


III. The Optimal Policy 

It turns out that three qualitatively distinct forms of the optimal policy- 
are possible. If Ct is low enough, then inspection always dominates 
evaluation. If Ct is somewhat higher, then evaluation dominates in- 

* Even in the case where cy is infinite, some formal structure must be placed on Mr. See 
Wilde (1980) or Hey and McKenna (1981) lor more details. 
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spection for one set of prices and inspection dominates evaluation for 
another set of prices; and, if cr is high enough, then evaluation always 
dominates inspection. These three cases are analyzed next. 

A. Inspection Only 

In comparing the expected value of observing quality before pur¬ 
chase to the expected value of observing quality after purchase, the 
crucial parameters aie Cj- and ci,(p), the respective costs of these two 
activities. It is clear I torn (3) and (4) that/!(/)) %T{p) ds.cr § Cn(p). But 
r„(p) is increasing in p so that ffl(O) c-i implies Ci,{p) ^ Cy for all p ^ 0. 
Hence the expected value of observing quality before purchase will 
always be greater titan the expected value of observing quality after 
purchase when c„(0) > Cy. This is obvittusly most likely to be the case 
when Cn(0) is large and Cy is small, liquation (8) suggests that Ch( 0) is 
most likely to be large when K(1 - (3) is signihcantly greater than z(0). 
But V(l — fi) = (/[(), (/*(0)1 and z(0) = f/(0, 0). Hence Crt(O) is most 
likely to be large when ^*(0) is high. In other words, inspection is 
likely to dominate evaluation for all prices when the cost of inspection 
is low or when few cpiality levels are acceptable even at low prices. The 
latter is the case, for example, when the cost of drawing observations 
I rom (p is low or the variation in utility clue to (juality is high relative to 
the variation in utility due to price.’’ 

Iti the remaitider of this subsection it will be assumed that fn(0) 

Cy; that is, it will be a.ssumed that 

r yn 01 

I A'(w I 0)du' ^ fy. 

J ?((» 

This implies that/I(/i) ^ T{p) for all p ^ 0, in which case B(/>) can be 
ignored conijiletely; characteri/ing the optimal policy ieduces to 
comparing '/'(/») and I'. I'wo po.ssibilities arise. In the first there exists 
a unique finite price, say p*, such that observing i]uality prior to 
purcha.se is optimal for p ^ p* and sampling again is optimal for p 9 
p* (see lig. 1). The critical price is defined by V = T{p*).*' In the 
second, K is strictly less than T(p) lor all5= 0 so that ob.serving quality 
prior to purchase is always optimal.^ 

■’ rhcrc IS no giiaraMlc’c that ((,(/») > 0. In pai ticulai, it niijrlil he the tasc that f/,(0) = 
0 li tan lx,- shown, howeser, lh.it there exests e ^ 0 such that f«(0) > 0 it i,, < c. 

“ .Since \p}/dp > 0 only implies T'{p) « 0, it is possible that the ccination Tip) = 

V does not have a unique solution. However, it this is the case, then -{/i] Tip) = T} 
(|)*, X), wtieie pf = ini {p { T{p) = T}. U p* i.s defined in thi.s fashion, it satishes the 
tormal icquirenieius stated m the text. 

' Formal proots ot these assertions have been omitted since they are trivial. It is 
apparent, howevei, that sinc;c r«(0) ^ 0, there is always a small enough value ot r, 
(possibly 0) such that Bip) s Tip) for all /) 2^ 0. 
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Fu,. 1.— Dcfiniiion of/>**! 


Assume that/)* exists and is finite. Then p* andi/* partition R + x R + 
into three sets (see fig. 2). In region l,p > p*, so the good is re;jected 
outright. In region l\,p ^p* but^ < q*{p), so that quality is observed 
prior to purchase but the good is subsequently rejected (i.e., nt)t 
purchased). In regioti III, p ^ p* and q ^ q*(p), so that quality is 
observed prior to purchase and the good is subsequently accepted 
(i.e., purcha.sed). 

How do changes in Cs and cj. alTect this partition? To answer this 
question one needs to know how changes in Cs- and Ct affect />J and q*. 

I’he tollowing results are straightforward but tedious and can be 
found in the (unpublished) Appendix to this paper. It is shown there 
that dppdcs > 0 and dppdcf < 0. Of course V lalls as either Cv or 
rises. Since U[p, q*(p)] - V{\ - /3) by definition and U is increasing in 
q, this implies dq*{p)ldcs < 0 and dq*(p)/dfr < 0. In other words, an 
increase in the cost of drawing observations from (p will make inspec¬ 
tion an optimal strategy for more prices while an increase in the cost 
of inspection will make insjtection an optimal strategy for less prices. 
An increase in either cost will make more quality levels acceptable for 
any given price. 

Next, consider how changes in c.v and Ct affect the number of 
observations which must be drawn from <P (whether or not quality is 
inspected) before ati acceptable gcKtd is found. Define 



tie.. 2,—I’vice-quality combinations, case .I; I = re)ecl outright, 11 - inspect and 
subscejuentiy reject, ill = inspect and .sub.sequciidy aciepi. 
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h={'^{\-G[q*{p)\pMf{p)dp 

0 

and 

h- = \^G[q*(p)\p]fip)dp. 

•f 0 

Here /f is the probability that a random price-quality combination will 
be inspected and rejected, and Ig is the probability that a random 
price-quality combination will be inspected and accepted. It is a^ain 
shown in the Appendix that the following hold; dis/dcg > 0 and 
dlf/dfs § 0, while dljdcr § 0 and dfi.-ldcr < 0. 

Consider first an increase in the cost of drawing observations. Since 
p* increases, inspection becomes an optimal strategy for more prices. 
Furthermore, since q*(p) decreases, more quality levels are acceptable 
for any given price. Hence the expected number of ob.servations 
which must be drawn from 4> before an acceptable good is found 
{Ills) fall.s. The effect of an increase in Cs on Ip is ambiguous, how¬ 
ever, because the decrease in q*{p) counteracts the increase in p* 
rather than reinforcing it. 

Precisely the opposite happens when the cost of inspection in¬ 
creases. Since/ij; falls, inspection becomes an optimal strategy for less 
prices. Furthermore, since ^*(/>) still decreases, fewer quality levels are 
acceptable for any given price. Hence /^- falls. Since the effect on Is is 
ambiguous, it is impossible to assert that an increase in the cost of 
inspection reduces the expected number of observations which must 
be drawn from before an acceptable good is found. 

B. Inspection and Evaluation 

It was assumed throughout Section IIIA that c^jO) 3= Cj-so that inspec¬ 
tion dominated evaluation for all prices. Recall that 

'*'(«’ I 0)dur. 

z(0> 

It is apparent from this equation that Scg(0)/dc,v < 0 and dcg{0)ldcr < 
0 since V is decreasing in either cost and 'I''(w | 0) > 0 for w close to 
z(0). Hence increases in either or Cf make it less likely that inspec¬ 
tion will dominate evaluation for all prices. 

Assume that Cg(Q) < Cr- Then two p>ossibilities arise. In the first, 
inspection is optimal for one set of prices and evaluation is optimal for 
another set of prices. In the second, insjiection is never optimal. The 
first possibility will be analyzed now and the second will be considered 
in Section IIIC.* 

* Again, formal proofs that ranges of Cg and Cj- exist such that both possibilities occur 
have been omitted. The text following identifies two necessary conditions for inspection 
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Kig. 3.—Delinition of pSr and pf 


In order for inspection to be optimal for one set of prices and 
evaluation to be optimal for another set of prices two conditions must 
be met. Assuming Cg{0) < Cj, the first can be stated as follows. 

Condition 1; The cost of inspection must not be so great that 
evaluation dominates inspection for all prices. 

When condition 1 holds, since B'(p) =S T\p'\ ^ 0, there exists a finite 
price p%T such that evaluation dominates inspection for p « ptr and 
inspection dominates evaluation for p 3= p*r- The critical price is 
defined by Tipm) = Bip'Sr) (see fig. 3). 

Assuming condition 1 holds, the second condition can be stated as 
follows. 

CoNnnioN 2: The expected return to search must be low 
enough that for some prices inspection dominates drawing 
another observation from <^. 

Condition 2 requires that for some/? > Pbt< T(p) > V. However, as in 
case d, it might be that T{p)> V for all p & Assume this does not 
hold. Then there exists a finite price/?* such that inspection is optimal 
for/? E [pJr,/?*], and drawing another observation from <i> is optimal 
for/? /?!. Again, as in Section IlIA, the critical price is defined by 
T(/??) = V (see fig. 3).» 

Overall, the situation dealt with in this case is the most interesting of 
the model because it shows that quality can be a search characteristic 
for some prices and an experience characteristic for other prices. 
That is, the configuration of udlity, search costs, inspjection costs, and 
the joint distribution of price and quality are such that inspection, 
evaluation, and drawing another observation from d> are all optimal 
strategies for various prices. In general,/?};,/?)} 7 -, and q* partition x 
R+ into five regions (see fig. 4). As in cased, region 1 includes prices 


to be optimal I'or one set of prices and evaluation to be optimal for another set of prices. 
There seems to be little value in making these more formal. 

'Again, since d'if(ui\p)/dp > 0 only impliesB'(p) « 0, the soluuon to the equauon 
T(p) = B{p) may not be unique. In this case {p\T{p) = B(p)} “ {p%T- “) where/iJr = tnf 
{p\T(p) = B(p)}. Ifptr is defined in this fashion, it sausfies the formal requirements 
stated in the text. 
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Fk'.. 4.—Price-quality combinations, case B- I = reject outright, II = insjjecl anti 
iilDmaiely reject, III = mspett and iiluinalely accept, IV = purchase once and only 
once, V = purchase once and forevermore. 


for whicli rejecting the good outright is optimal. In region < p 

« p* btit q < q*{p), so that quality is observed prior to purchase but 
the good is subsequently rejected; and in region llI,/)* 7 - < p ^ p^, but 
q ^ q*(p), so that quality is observed prior to purchase and the good is 
subsequently accepted. I here are two additional regions, though. In 
region IV, /» « par and q < q*{p), so that the good is purchased 
without quality having been observed but is not repurchased. In 
region V,p « p*r and^ ^ q*(p), so that the good is purchased without 
quality having been observed and is repurchased in all subsequent 
periods. 

How do changes in Cs- and Cy al'lect this partition? As before, to 
answer this question we need to know how changes in c.v and Cy affect 
p*T, p*’ q*{p)- As in case A, dppdcs > 0 and dppdcr < 0. Also, 

dptpdcs > 0 and dp^Pdcr > 0. Finally, dq*(p)ldcs < 0 and dq*(p)ldcT 
< 0 . 

An increase in the cost of drawing observations will increase the set 
of prices for which evaluation is an optimal strategy. Furthermore, it 
will increase the set of prices for which either inspection or evaluation 
is an optimal strategy. An increase in the cost of inspection will 
increase the set of prices for which evaluation is an optimal strategy, 
will decrease the set of prices for which inspection is an optimal 
strategy, and will decrease the set of prices for which either inspection 
or evaluation is an optimal strategy. Finally, an increase in either Cy or 
fy will make more quality levels acceptable at any given price. 

It is again useful to consider how increases in Cs and Cy affect the 
probability that a random price-quantity combination will fall in any 
given region. Define 

fi>hr 

Es =1 {I - G[q*{p)\p]}f(p) dp, 

0 

Ey = I G[q*(p) I p]J{p)dp, 

J 0 



INFORMATION COSTS 


>*35 


and 


Is 


If 


r i>J' 
•'/'Sr 



{1 - G[q*(p)\p]}J(,p)dp, 


Glq*{p)\p]f{p)dp. 


Here If, Is, Ey, and Es are the probabilities a random price-quality 
combination will fall in region II, III, IV, or V, respectively. It can be 
shown that 


a/. 


dcs 

Furthermore, 

dis 

dcT 


i 0, 


aif dEs -4^ 

oc.s de.s- dcs 


io, 


S 0 . 


_|^<0,-4^>0, and -4^ 

OCf OCx OCt 


so. 


Finally, 


t^ sl>o and ^<^-s + g Q 

dcs dCf 

While the majority of these partial derivatives are ambiguous in 
sign, a number of interesting observations can still be made. First, the 
probability that a random price-quality combination will be acceptable 
(whether quality is inspected or evaluated) is given by Ig + Eg. Hence 
the expected number of observations needed to IcKate an acceptable 
good is \/{Is + Es)- As before, this quantity is decreasing in Cs and 
ambiguous in Cf. Second, both dEsIdcs > 0 and dEsldCr > 0. That is, 
the probability that a random price-quality combination will be pur¬ 
chased without quality having been observed and subsequently repur¬ 
chased is increasing in either cost. This is ljecau.se when either rj or cj. 
increases, the set of prices for which evaluation is optimal increases 
and the set of quality levels which are acceptable for any given price 
also increases. However, this necessarily implies dEf/dCs and dEfIdcj- 
are ambiguous in sign. This last observation is important. When a 
price-quality combination falls in region IV, the good is purchased 
without quality having been observed but is not subsequcntlv repur¬ 
chased. This "brand disloyalty” is analogous to a job quit in the lalxir 
market or a divorce in the marriage market. It is crucial to recognize 
that the likelihood of these events does not appear to be systematically 
related to either search costs or inspection costs. This point will be 
discussed in more detail in Section fV. 
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Kig. 5.—Definition of p* 



Fk.. 6.— f’rue-quality combinations, caseC: 1 = reject outright, IV = purchase once 
and only once, V = purchase once and forevermore. 


C. Evaluation Only 

The final case is one in which Cb(0) < Cr, but one of the conditions of 
case B does not hold.*® Under these circumstances evaluation will 
always dominate inspection for any acceptable price.” Both this case 
and case A are, in a sense, spiecial cases of caseB. Here the relevant 
critical price, ps, is defined by = V (see fig. 5). The elements of 
the associated partition correspond to regions I, IV, and V of case B 
(see fig. 6). Ot course Cr has no effect on this partition. The effects of 
c.s- are identical to those in case B (where pS now replaces pgr)- 

IV. Applications 

The model analyzed in this paper has obvious analogues in the labor 
market and the marriage market. The product market has been used 
as the setting to this point because much of the relevant literature 
deals with consumer behavior (e.g.. Nelson 1970; Lippman and 

This will certainly be the case when large enough, since V is bounded. 

'' That IS, B(p) ^ T{p) for any p such that V < max {B(p), T{p)}. 
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McCall 1979a; Wilde 1980; and Hey and McKenna 1981). However, 
many of the important qualitative implications of the model emerge 
more sharply in the labor market than in the product market. 

The labor market analogue concerns an unemployed worker 
searching for a job. This individual pays a search cost in order to 
sample vacancies, but only the wage rate is observed. Nonwage char¬ 
acteristics can be observed either by paying an inspection cost or by 
taking the Job. 

Two aspects of the labor market analogue are of primary inter¬ 
est—unemployment and the quit rate. Unemployment is related to 
the expected duration of search. The quit rate is related to the 
probability that a Job which is accepted will subsequently be rejected. 

Suppose that the distributions of net utility associated with Jobs, 
conditional on the wage rate, are stochastically decreasing in the wage 
rate. Then, in the most interesting case, there will be a range of low 
wages for which renewing search is optimal, a range of intermediate 
wages for which inspection is optimal, and a range of high wages for 
which evaluation is optimal (see Sec. IllB above). 

The effects of an increase in information costs on the duration of 
search seem straightforward. An increase in the cost of search makes 
both inspection and evaluation more desirable alternatives. Hence the 
duration of search should fall. An increase in the cost of inspection 
will likely have ambiguous effects on the duration of search because it 
makes evaluation a more desirable alternative, but it also makes in¬ 
spection a less desirable alternative. 

The effects of an increase in information costs on the quit rate are 
less obvious. The argument would seem to go as follows: A quit 
requires that two events occur; (I) a wage rate is observed for which it 
is optimal to take the job without observing its nonwage component 
hrst and (2) the nonwage component turns out to be too low so that it 
is optimal to quit and renew search once it is observed. But an increase 
in either search costs or inspection costs makes evaluation optimal for 
more wages. In particular, there are lower wages for which evaluation 
is now optimal. These Jobs must have higher nonwage components in 
order to be acceptable; that is, for them the probability of turnover is 
higher. Hence, the overall probability of a quit should increase as 
either cost increases. 

Unfortunately, these heuristics are incomplete because neither con¬ 
siders the fact that an increase in either c.v or Ct will decrease the 
return to search, making lower values of the nonwage component 
acceptable at any given wage. While this reinforces the argument 
regarding the relationship between information costs and the dura- 
fion of search, it weakens the argument regarding the relationship 
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between information costs and the quit rate. In fact, an increase in 
either cost could either increase or decrease the quit rate.** 

To justify these assertions formally the model of Section III must be 
used. Consider first the duration of search. There are really two 
measures which are of interest. First, Eg + Ig gives the probability that 
a random price-quality observation will ultimately be acceptable (re¬ 
gardless of how quality is observed). Second, Eg + Eg + Ig gives the 
probability that a random price observation will cause search to cease 
(although perhaps only temporarily). It was stated in Section HI that 


d(£.v + Is) ^ Q 
dCg 


and 


d{Es + Is) ^ ^ 
dcr 


It ran also be shown that 


d(Ef + Eg + Is) ^ Q 
dcg 


and 


d(Eg -f Eg -P Ig) g ,, 
dCg 


Purchases of goods for which quality has not been observed are really 
part of the search process. Hence l/(£'i- + Ig), which might be called 
the "pure duration of search,” is the proper expression for the ex¬ 
pected duration of search. However, empirically it would often be 
impossible to differentiate between observations which fall in region 
IV and observations which fall in region V. In other words, the 
observed cfuration of search would often correspond to \i(Eg + Eg + 
Ig), which might be called the "effective duration of search.” 

Sitnilat problems arise with respect to turnover. In fact, there are 
three measures of turnover embedded in this model, one ex ante and 
two ex post. I he ex ante measure is simply Eg-, it gives the probability 
that a random price-quality ob.servation will be purchased once and 
only otice. One ex post measure is what might be called the “pure 
failure rate for evaluation,” = Egl(Eg + Eg)-, it gives the conditional 
probability that a good will be rejected given that it is purcha.sed 
without quality having been observed. However, empirically it would 
often be hard to differentiate between observations which fall in 
region 111 and observations which fall in region V. Hence the other ex 
post measure is what might be called the “effective failure rate for 
evaluation,” Tf. = Eg!{Eg Eg -t- /a); it gives the conditional probabil¬ 
ity that a good will be rejected given that it is purchased (regardless of 
whether quality is observed before purchase or after purchase). It 
turns out that none of these measures is systematically related to 
either .search costs or inspection costs. 


See Uppman and McCal! (I979A) for an extensive discussion of these points in 
related model. 
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The third application of this theoretical framework is to the mar¬ 
riage market, d’he marriage market analogue concerns an unwed 
individual searching for a marriage partner. This individual pays a 
search cost in order to sample potential partners, but only some 
characteristics are observed. Other characteristics can be observed 
either by paying an inspection cost or by getting married. 

The aspect of the marriage market analogue which is of primary 
interest is dissolution. The most complete analysis of the relationship 
between information costs and probability of dissolution is provided 
by Becker, Landes, and Michael (1977). These authors consider two 
cases, one in which remarriage is impossible and one in which the 
remarriage market is identical to the marriage market. 

Consider Hist the case in which remarriage is impossible. When 
remarriage is impossible, the value of dis.solution is a constant. In the 
model analyzed in this paper an analogous assumption is that the 
value of not repurchasing a good is a constant, say V. Then instead of 
(3), B(p) would be defined as 

B(p)=E[U{p,(l)-t- ^(p), (3') 

where 

hp) VG[q*(p)\p\+ f t 

and q*{p) is defined by U[p, q*(p)] = L(1 - fi). The definition ofT(p) 
would remain as in (4). I'his modification aflects the comparative 
statics of the model in a straightforward way: forp « p%-rtd^*(p)l<icx — 
0 = dq*(p)/dcp. Hence as before, dEs/dc^ > Oand dEsIdcj > 0, but now 
dEf/dc^ > 0 and dEfIdCr > 0. Furthermore, as betore, dEsH^ts ^ 0 
and dEyldcj > 0, but now dEs/Bcr < 0 and dEf/dcr > 0, However, it 
remains true, even when the value of not repurchasing a good is 
constant, that F, and E^■ bear no systematic relationship to informa¬ 
tion costs. 

These results are, on the surface, consistent with those ol Becker, 
Landes, and Michael. These authors a.ssert that because “the proba¬ 
bility of entering into a mismatch” would be greater, “an increase in 
either the cost of intensive or extensive search would increase the 
probability of dissolution” (1977, p. 1150).''* The definition ot “pi ob- 
.ibility of dissolution” which these authors use to arrive at this 
theoretical conclusion is apparently The problem is that Ey is 
unobservable (it isA'^ which is observed), and even when remarriage is 

Intensive search (orrespoiid.s to inspeuioii .mcl extensive seaiih loiiesponds to 
drawing another observation trom <t>. Heme the tost of intensive se.irih is r, .ind ihe 
cost ol' extensive search is r,. 
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impossible, there is no systematic relationship between and cither 

Cs or Cfl 

The situation is even more difficult when the remarriage market is 
identical to the marriage market. Here, just as in the labor market 
analogue, none of the partial derivatives relating information costs to 
turnover can be signed. 

These are, of course, somewhat negative observations. In addition, 
some positive conclusions can be drawn from this analysis. It has been 
shown that an increase in search costs always decreases the duration 
of search. An increase in inspection costs will generally increase the 
duration of search if the effect on q*(p) is small relative to the effect 
on p*T or p%. This will be the case when the variation in utility due to 
quality is small relative to the variation in utility due to price. 

I'here appear to be no general conclusions to be drawn regarding 
the relationship between search costs and turnover. However, as 
above, an increa.se in inspection costs will generally increase turnover 
if the effect on q*{p) is small relative to the effect on p%r orp*. This is 
again the case when the variation in utility due to quality is small 
relative to the variation in utility due to price. 

V. Conclusion 

This paper has establi.shed a number of strong results which go 
against the grain of the extant literature. These results obtain because 
goods are viewed as multicharacteristic composites in which indi¬ 
vidual characteristics have specific informational properties. 

A number of extensions are possible. An obvious one is to consider 
goods which are described by more than one nonprice attribute. In 
this case the consumer would need to decide which attributes to 
observe before purchase and which to observe after purchase. At each 
stage, reservation levels would be defined as functions of the actual 
levels of those attributes already observed. Finally, the consumer 
would also need to decide in which order to observe attributes, A 
model of this nature would have important applications in psychology 
and consumer research, as well as economics. However, one might 
argue that actual consumers do not use optimizing strategies as com¬ 
plicated as this model would require. Rather, consumers would use 
various satisficing strategies. In fact, one might apply the same argu¬ 
ment to the model analyzed in this paper. 

There is much to be said for this argument, but satisficing strategies 
are even less well understood than optimizing strategies. Furlhei- 
more, most empirical work on the duration of search and turnover is 
based, implicitly or explicitly, on simpler versions of the optimizing 
model developed in this paper (e.g., Becker, Landes, and Michael 
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1977). In any event, the issue offers a number of intriguing topics for 
future research. 
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Macroeconomic Policy/ Exchange-Rate 
Dynamics, and Optimal Asset Accumulation 


Maurice Obstfeld 

Columbia 


The papei devcloijs a model of cxchangc-raie and riirreni-account 
dcieiininaiiofi lor a small ctonomy peopled hy infiiiitely lived, 
ulility-maximi/mg households. In fhi.s selling, a ccnlral-bank pur- 
ehase of foreign exchange has no real effecis when cenlral-bank 
foreign reserves earn interesl at the world late and the proceeds are 
relumed to the public. In contrast, an increase m the monelary 
growth late does Itavc real effects, even m the long run. I'he model 
(ieveloped heie implies that an increase in gcjvernment spending 
inav lead to a surplus on tuireni actounl. The exiernni ailjuslmenl 
process predicted by ihe model is one in which eoiisuinption, real 
balances, and exlerntil assets all rise or fall sirmillarieously. 


Introduction 

This paper studies exchange-rate deterininatioii and the external 
adjustment process in a small economy peopled by infinitely lived, 
iitiliiy-maximizing households. The selling is similar to one proposed 
by Koitri (1976), in that agents are assumed to consume a single good, 
to hold domestic money, and to have access to a world market in 
consumption loans. Bui the approach departs from those prevalent in 
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recent balance-of-payments incKlels by basing saving behavior and 
money demand on explicit, imertemporal utility maximization.' 

The analytical framework is adapted from the seminal work of 
Sidrauski (1967) and employs the perf'eci-foresighl ec|iiilibrium con¬ 
cept explored by Brock (1974), Calvo (I979rt), and Fischer (1979). In 
contrast to these contributions, however, the model developed below 
allows agents’ subjective rate of time preference to be endogenously 
determined in a manner suggested by Uzawa (1968) rather than 
fixed.This modification permits the small economy we study to attain 
a stationary long-run equilibrium under conditions of perfect capital 
mobility.'' 

The plan of the paper, and its main results, are as follows. Section I 
describes the structure of the economy and the typical household's 
maximization problem. Particular attention is paid to the budget 
constraints of the central bank and government. .Section II closes the 
model by imposing continuous money-market equilibrium and en¬ 
dowing agents with perfect foresight concerning the future paths of 
the price level and government transfer payments. The resulting 
dynamic system possesses a unique stationary position and a single 
convergent path, along which consumption, real money balances, and 
the stock of foreign claims all rise or fall simultaneously. This conver¬ 
gent saddlepath is an equilibrium path and is the f<K'us of the analysis 
that follows. 

Section III analy/es the effects of some unanticipated mac¬ 
roeconomic policy actions, A central-bank purchase of foreign ex¬ 
change is Ibund to ha\e no real effects when official foreign reserves 
earn inteiest that is distributed to the public. In contiast, an increase 
in the monetary growth rate does have real effects: It occasions 
exchange-rate overshooting (in Flood’s (19791 .seiLse) and a tut rent- 
account surplus and leads, in the long run. to higficr levels of con¬ 
sumption and foreign claims. Finally, the model implies that an in¬ 
crease in government spending may induce a suiplus on ciiirent 
account, contrary to the usual presumption. A deficit is possible when 
government spending produces a public good having .i high marginal 
value to consumers. The explicit spccificalion ol agents' objective 


* Rcscaix h al(ing similar linos is icpoiied in jMpois l»\ Culvo {iy70/». 1980), Helpniaii 
(1979), Ra/in (1979), mul SdHkm.in (1989). ObsUcki (19891 tkwiiljos two-tnuntvv 
<^\tensi(>n ol this paper’s nioticl 

^ tnilogenous nine prclorcnce has flisnisscd in the iiaclc Inn atm o in papn s hv 
(^^alvo ami Kiiulliiy (1978) and b\ Vindkn (1978). Kouri (1989) .nKfpis a tonsuiiipiion 
hinciioii suggcstcci hy U/awa's (1998) iheoiy. I he present appubuh is applied in a 
somewhat different context in Ohstleld (1981). 

* T his (ould be accomplished aUo by allowing bonds (in addition to ical balaiiccs) it) 
enter consumers* utility tunctions, as in Calvo (1980). 
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functions is particularly useful in Section III, for it permits a natural 
welfare analysis of alternative macroeconomic policies. 

Section IV contains concluding remarks. Appendices supply the 
technical details justifying some of the paper’s assertions. 

1. The Model 

We consider an open economy inhabited by identical households, 
each maximizing its welfare over an infinite lifetime. As in Kouri 
(1976), there is a single, perishable, consumption good, and house¬ 
holds’ financial wealth is divided between domestic fiat money and 
internationally traded bonds denominated in foreign currency. The 
economy is small, and it can therefore influence neither the foreign 
currency price of the consumption good, P*, nor the world bond rate, 
r, both of which are assumed constant. The link between the domestic 
price level,/*, and P* is provided by the relationship/* = EP*, where 
E denotes the domestic money price of foreign exchange. This ex¬ 
change rate is allowed to float freely by the monetary authority. On 
the assumption that foreigners do not hold domestic money,£ adjusts 
to maintaiti equality between the real money supply and domestic real 
money demand at each instant. We adopt the normalization /**=!. 

The number of households in the economy is, for convenience, also 
taken to be 1. The household’s instantaneous utility is a separable 
function U{c,m) = u(c) + v{m) of consumption, c, and real money 
holdings, m, defined as nominal money holdings, M, deflated by the 
home price level, w ^MIP. T he functions u( •) andu( ) are nonnega¬ 
tive, strictly concave, and twice continuously differentiable, with the 
property that 

lim u'{c) = lim v'{m) — 00 . (1) 

r —*0 

Assumption (1) eliminates the possibility of corner solutions. It is 
clear, in addition, that the derivatives u'(-) and v'( ) are every¬ 
where strictly positive. 

The household seeks to maximize the discounted sum of future 
instantaneous utilities. 


F = f (2) 

Jo 

The discount factor A, is defined by 



( 3 ) 
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where 6, is the instantaneous subjective discount rate at time s* Fol¬ 
lowing Uzawa (1968), we postulate that 8, is a function of utility at 
time s, 

S, = S[t/(c„m,)]. (4) 

and that the function S( ) is positive and satisfies 

8'{U) > 0, 6"(t/) > 0, S(t/) - U8'(U) > 0. (5) 

The first two conditions listed in (5) are in no way implied by the 
hypothesis of endogenous time preference but are imposed because 
they are analytically convenient.’ The last of the conditions ensures 
that the present discounted value U18{U) of a stationary utility stream 
U is increasing in U. 

At each moment I, the household must allocate its income between 
current expenditure and the accumulation of real wealth. Real out¬ 
put, y, is taken to be exogenous and fixed. The other components of 
expected disposable income are interest payments from abroad, equal 
to the world bond rate, r, times the real value F, of the family’s net 
foreign claims; expected real transfers from the government, t,; and 
expected capital gains on real balances, equal to -TT,m„ where tt, 
denotes the expected inflation rate. Defining the household's real 
assets at time t (excluding the capitalized value of expected future 
transfers) as 


a, = ^ + m, + F,, (6) 

r 

the flow constraint linking asset accumulation to saving may be writ¬ 
ten as 


d, = m, + T, — c, — (tt, + r)m,, 


(7) 


while the intertemporal budget constraint takes the form 

/ •oo 

+ (tt, + r)m,]dt ^ a„ + I e ’''T,dl. (8) 

0 •'0 

Restriction (8) is important in the pre.seni setting of perfect capital 
mobility, for it rules out the possibility that the household can attain 
unbounded utility by borrowing arbitrarily large sums in the world 

’ When S, = S, a constant, for all 5, we have the usual discmini factor, A, = St 
“The nature of this convenience becomes apparent in applying the maximum prin¬ 
ciple to solve the household’s lileiime problem and in proving the existence of a st.ible 
perfect-foresight equilibrium path converging to the economy's stationary state. I'hese 
points are taken up again in context. A general discussion of the time preference 
concept IS found in Koopmans, Diamond, and Williamson (1964). 
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capital market and meeting all interest payments through further 
borrowing. As Appendix B shows, (7) and (8) confine the household 
to paths along which 


/ oo 


tor all t, so that household net worth is always nonnegaiive. 

The household’s problem is to find maximizing lifetime 

wellare V subject to the constraints (3), (7), and (8), given an initial 
level rt,i of real assets. The calculations are simplified by changing 
variables from t to A (“psychological time”) using the fact, implied by 
(3), that d\ = 8[U{c,,m,)]dt. The household's problem may then be 
expressed as that of choosing time paths for consumption and real 
balances that satisfy (8) and*' 


maximize 


r 


U(c,m) 

8lt/(c,m)l 


e-VA 


subject to 


da 

dA 


rg + T - c — (n + r)m 
(c,ot)] 


Necessaiy cttndiliotis ftir an optimal program are given by the 
maximtiiti principle.’ These require that r. and m be chosen so that fitr 
each value ot the discount factor A, the Hamiltonian 


„ _ U(c,m) [ra + T - c - (rr + r)m] 

d[U(f,m)] MUic,r„)] 

is maxitnizcd. In (9), \ = is the co-state variable, which is ititer- 
[tretecl as the shadow value, in utility tei tns, of real assets. The implied 
lirst-otder coiidilioii.s (at the interior maximum) are 

(1 — (87S){f/ + Xlra -t- t - t — (tt + r)m]})U,. = X, (10) 


(1 - (578){t/ + Xfiw + T -■ c - (w + r)m]})U,„ = X(7r + r). (11) 

All additional neces.sary contlition is that the shadow price of assets 
evolves according to the law. 



Equations (10) and (1 1) imply the usual necessiiry condition of static 
utility maximization, 

" Ttic |«y( liologKat time subst lipt, A, isomitu-d lor tin.- sake ol notational simplicity, 

’ See Arrow and Kin/ 1970. If U(rjn) is .strictly toneave and both coirsunipiion and 
real balances are normal goods, the conditions listed in (5) imply that the maximancl 
U (c.m)/6[(/(f ,m)] is strictly concave. 
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x(c,,m,) s U,„(ri,>nt)IU,ict,rn,) = tt, + r, 
while (12) implies that the time derivative of is given by" 

- r}. (14) 

It remains to specify the behavior of the government and the 
central bank. We consolidate the balance sheets of these two entities 
and work with a single public-sector budget constraint. A key as¬ 
sumption is that the stock of central-bank foreign reserves earns 
interest at the world rate r. 

The public sector consumes goods and makes (net) transfers, 
financing any deficit through money creation. Thus, all government 
debt is purchased by the central bank, none by private citi/'ens. Let¬ 
ting g denote the (constant) level of real government consumption per 
household, the public sector’s budget constraint isg = A4,/P, + rR ~ t, 
= {MtlM,)m, + rR — -Ti, where R denotes the (constant) level of 
central-bank foreign reserves, measured in terms of output, and t,, as 
before, denotes net real tiansfers. 

We shall assume that the central hank varies x, in such a way as to 
hold the rate of monetary growth M,IM, constant at /c." The level of 
transfers is then determined according to the formula 

Tj = p,m, f - g. (15) 

A litial assumption is that /x is always chosen .so that n + r > 0. 'I'his 
inequality guarantees the existence of a stationary state. 


II. Perfect-Foresight Equilibrium Dynamics 

fhe model of the previous .section is closed by the assumption that the 
path of the economy is a iterfect-forcsight equilibrium patli, m the 
sense of Brock (1974). Let {P,} be a (differentiable) price-level path 
and {P/Tt} = {pe‘"Afo + PirR — Pig} the associated path ot nominal 
transfer payments from the government, where .Wo denotes the 
nominal money stock at t = 0, Acting on the belief tliat these paths of 
the price level and nominal tiansfers will prevail, the household can 
calculate expected real transfers {f,} and determine optimal paths for 

^ Only necessary conditions (10) and (11) tidier horn whai the) would be in the 
standard intertemporal optimi/ation tramcwtn k, whcic 6j — 6, an cxo^ciums tonsLani. 
In the standard setup, (10) is ifplatcd by U, = A. But m the present context, an increase 
in today’s consumption aifc'cts not only today’s instantaneous unlit) but also the dis- 
toiint factor applied to ail utilities enjoyed in the luiure. Note also ih.ii d the instan¬ 
taneous subjective disiouut rate 6 were constant, necessary condition (14) would rule 
out the possibility ot a siattonary stale unless ii so happened that 6 and t w^cie equal. 

” This assumption is crucial tor the toicign-exc hange inierveiuion ’‘neutralit) ” prop¬ 
osition derived below and differs from the one made by Koiin (1976), who rakes 
government consumption to be the residual item. 
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its consumption rate and real balances, subject to (7) and (8) with 
initial real assets Oq = (y/r) + Fq + (Afo/Po)- The path {P*} is a 
perfect-foresight equilibrium path if the desired path of real balances, 
{ot,}, equals the actual path, that is, if 

m, = M,lP, = (16) 

for all t & 0. 

The necessary conditions derived in the previous section may be 
used to find the collection of price-level paths {P/} consistent with 
perfect foresight and individual optimality. The differential equa¬ 
tions (7) and (14), together with the conditions (6), (10), (13), (16), 
and 

ir, = P,/P„ (17) 

must be satisfied by any perfect-foresight equilibrium trajectory of the 
economy. We now investigate the restrictions these requirements 
place on the possible equilibrium paths of bonds F,, consumption Cf, 
real balances w,, and, by implication, the home price level. Recalling 
our earlier convention that the world price level P* = 1, we shall 
identify the path of the domestic price level with that of the exchange 
rate, E. 

Differentiating equilibrium condition (16), we Hnd that desired real 
balances must satisfy the relation 

P(/P, = fi - (rhilm,) (18) 

along an equilibrium path. Combining (17) and (18) with necessary 
condition (13) yields 

rh, = [ft + r - x(c,,m,)]mi. (19) 

The differential equation (19) must govern the evolution of real 
balances in perfect-foresight equilibrium. 

The flow constraint (7) may be written in the form rh, + F,= y + rF, 
+ Tt — Cl — TTim,. When we use (17), (18), and the public-sector budget 
constraint (15), this differential equation becomes 

F,=y+r(,F, + R)-c,-g. (20) 

the familiar relation between the current and capital accounts under a 
floating exchange rate. Equation (20), which equates the home coun¬ 
try’s foreign lending to the excess of domestic income over absorption 
by the domestic private and public sectors, describes the equilibrium 
path of foreign bond holdings. Note that because foreigners do not 
hold domestic money, the variable Ft is predetermined by the past 
history of the current account. It can jump instantaneously only 
through central-bank purchases and sales and must otherwise adjust 
slowly over time. 
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It remains to derive the differential equation governing movements 
in consumption. Using the necessary condition (10), we may solve for 
the co-state variable X, with the help of (15) and (17)-(19); 

X, = k{c,,m„F, + R) = 

_ [ 8 ( 1 /,) - _ 

8(U,) + {y + r{Ft + ft) + [/c -I- r - x(c,,jn,)]m, - c, - g}S'(f/,)n'(c,)' 

Differentiating X with respect to time and applying (14) yields X, = 

+ X^m, + kpFt = k, {8[(/(c,,m,)] - r}, which may be solved for the time 
derivative of consumption, 

c, = X;'(X,{6[(/(c„m,)] -r} - X„wi, - k^F,) = <p(c„m„F, + ft). (21) 


Equation (21) is the final differential equation of a three-equation 
system.'® Together, equations (19)-(21) must describe the evolution 
of the economy along any perfect-foresight equilibrium path. 

Not every path sadsfying (19)-(21) is a perfect-foresight equilibrium 
path, however, for the condidons from which these paths are derived 
are necessary for optimality, but are not, in general, sufficient. For tbe 
balance of this paper, we shall focus auention on paths that bring the 
economy to the stationary state (c,m,F), defined by the conditions 
ifi(c,tn,F 4- ft) = /t + r — x(c,m) = y + r(F + ft) - c - g = 0. As is easily 
verified, the stationary state exists and is unique when, as we have 
assumed, /x + r > 0. 

Appendix A proves that the system (19)-(21). when linearized in a 
neighborhood of the stationary state, possesses a single negative char¬ 
acteristic root, and so a unique convergent path. Appiendix B proves 
that this path is a perfect-foresight equilibrium path: The choices of 
real balances and consumption it dictates are optimal choices from the 
household’s standpoint, given the implied initial value of real assets 
and the implied anticipated paths of prices and transfers. We there¬ 
fore assume that for any value of the predetermined stock of foreign 
claims, the equilibrium price level and consumption rate are those 
placing the economy on the path converging to the stationary or 
long-run equilibrium." It should be noted that this convergent 
saddlepath satisfies the household budget constraint (8), for, along it. 



+ (tt, 4- r)m, - T,]dt = Oo - lim a,e 

f—*ec 


a„. 


( 22 ) 


To find the saddlepath, we let ff, denote the system's negative 
characteristic root and to = [to,,. <^ 21 - - an eigenvector belonging to 


The ecjuaticin givingr cannot be defined if Xp — 0 It is shown in Appendix A that Xp 
< 0 in a neighborhood of the system's suuoiiary sute. In fact, it can be shown that Xp < 
0 globally if hyperdeflations are ruled out. 

" It has not been demonstrated that pcrlcct-loresight equilibnum is locally unique. 
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Any convergent solution of the linearized system must take the 
form 

c, — c = oonk exp 
m, — m = 0 ) 2 ,A exp (8^t), 

F, ~ F = 0 ) 3 ,A exp (0|t), 

where A is an arbitrary constant determined by the value of F at time t 
= 0.'^ Differentiating, we find that 

c, = 6,(c, - c) = 0 ,(o)„/a) 3 ,)(F, - F), (23) 

rn, = d,(m, - m) = 6,{o}i,la> 3 ,)(F, - F), (24) 

F, = e,(F, - FI (25) 

when the economy approaches the stationary state. The saddlepaih of 
the system is simply the intersection of the two hyperplanes defined 
by equations (23) and (24). 

The importance of equations (23)-(25) is that they allow us to 
characterize the motion of consumption, real balances, and external 
asset holdings as the economy converges to long-run equilibrium. As 
is shown itt Appendix A, the components of the eigenvector tU must all 
be of the same sign, and an immediate consequence is that die three 
variables tnust always rise or fall together along the sacldlepath.” This 
is a key property, (or it allows us to trace the path taken by the 
economy in l esponse to various policy surprises. However, one par¬ 
ticular implication of this result is important enough to be mentioned 
before we turn to such exercises. When the current account is in 
surplus {F, > 0), real balances must be rising, and when it is in deficit 
(Ft < 0), they tnust be falling. Thus, the exchange rate’s depreciation 
rate will fall short of the rate of monetary growth when there is an 
external surplus and will exceed it when there is a deficit. This 
fundamental relationship between exchange-rate depreciation and 
the current account appears in models developed by Kouri (1976), 
Calvo and Rodriguez (1977), and others. But it emerges here from 
the hypothesis of optimizing behavior and not as a consequence of the 
assumed properties of consumption and money-demand functions. 

Figure 1 shows how the cconomj)’s long-run equilibrium is found. 
From (14), stationary-state utility 1/ is determined by the equality of 
the marginal rate of time preference and the world bond rate,'"* 


See Hirsch and Sniale 1974. 

C^alvo (l979o) provides a similar characterization of the saddlepath arising in the 
Brock-Sidrauski perfect-foresight model. 

“From (Ity, X = {16((7) - t78'(t7)]/6(t7)}u'(c). The last inequality in (5) therefore 
implies that X is strictly positive. This justifies condition (26). 
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HU) = r, (26) 

and so, stadonary-sute consumption c and real balances m must satisfy 
U(c,m) = U, as shown in the diagram. The second requirement on the 
point {c,m) is that it lie on the locus of points consistent with m = 0-, the 
latter is, by (19), the Engel curve associated with an opportunity cost /li 
+ r of holding real balances. It slopes upward from the origin. Given 
the value of c determined by 0 and /t + r, f is determined by the 
relation E = [(c + g - y)/r] — R. The locus labeled/' = 0 in figure 1 is 
vertical at the unique private consumption level consistent with 
current-account balance given stationary-state bond holdings F. An 
increase in F would increase long-run domestic income and so shift 
this schedule rightward. 

III. The Effects of Macroeconomic Disturbances 

This section studies the impact and long-run effects of intervention in 
the foreign exchange market, an increase in the rate of monetary 
expansion, and an increase in real government consumption. The 
macroeconomic disturbances are assumed to take the public by sur¬ 
prise, but they are permanent and lead to no expectation of future 
policy actions. The economy’s initial position is the stationary state. 


Foreign Exchange Inleri’ention 

The central bank intervenes in the foreign exchange market by pur¬ 
chasing foreign claims from the public with domestic money. I he 
transaction accomplishes a transfer of interest-earning assets from the 
public to the central bank as well as an increase in the nominal monev 
supply. But the ium F + R of privately owned bonds and central-bank 
reserves is unaffected by this transfer. 

Now, equations (20) and (21) involve only the economy's aggregate 
claims on the rest of the world; the distribution of their ownership 
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between the public and the central bank irrelevant. Thus, the 
long-run value of F simply declines fromF toF - Ai? as a result of the 
intervention, while c and in are unchanged. Because the stock of 
privately owned foreign assets is immediately at its new long-run level, 
consumption and real balances must, by (23) and (24), remain at their 
respective stationary-state levels c and m. The monetary authority’s 
action occasions a rise in the price level (a depreciation of the ex¬ 
change rate) exactly proportional to the increase in the nominal 
supply of money. It has no real effects. Money creation accomplished 
through a purchase of foreign exchange has the same impact as a 
“helicopter” money-supply change of equal magnitude. 

It is sometimes asserted that there« a difference between the.se two 
financial policies, in that an official purchase leads to a current surplus 
as domestic residents seek to restore their holdings of external claims 
to the original level.'® The difference disappears here because foreign 
assets held by the central bank continue to earn interest at the rate r, 
while households capitalize all transfers from the government (see 
Mundell 1971, chap. 1). By the government budget constraint (15), 
the stream of interest earnings taken away from the public through 
central-bank intervention is returned in the form of transfers. 
Households therefore suffer no fall in permanent disposable income 
when a portion of their bond holdings is transferred to the central 
bank. 

This neutrality result would clearly break down if, for example, 
reserves were held in some barren, non-interest-bearing form, or if 
increased central-bank earnings were used to augment government 
consumption rather than transfer payments. That intervention has 
real effects in these circumstances is to be expected, for it involves real 
fiscal measures in addition to a neutral change in the nominal supply 
of money. 


An Increase in the Monetary Growth Rale 

Figure 2 depicts the effects of an increase in the rate of monetary 
growth, fi. As (26) shows, the long-run instantaneous utility level of 
the representative household is unchanged by the disturbance. But as 
the rate of monetary growth and the inflation rate coincide in the long 
run, the disturbance raises the opportunity cost of holding money in 
the stationary state. Thus, the household economizes on real balances 
and consumes more in the new long-run equilibrium. This is repre¬ 
sented in figure 2 by the downward shift of the Engel curve along 


'“See, c.g., Kouri 1978. 
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which real balances are stationary. The rise in long-run consumption 
entails a rise in long-run bond holdings. Accordingly, the/" = 0 locus 
shifts to the right, so that it passes through the new asymptotic 
equilibrium point, (c',m'). 

Knowledge of the long-run effects of ihe policy shift allows us to 
infer its short-run effects using the properties of the convergent 
saddlepath described in Section II. Because the eventual level of 
external assets is higher after the rise in /x, consumption must initially 
decline to bring about the implied current-account surplus. And be¬ 
cause real balances rise during the course of a surplus and rh’ is lower 
than m, the policy also entails a sharp initial fall in real balances and so 
a depreciation of the exchange rate. The economy’s short-run, tem¬ 
porary equilibrium at t = 0 is thus at a point like A in figure 2. 

Both consumption and real balances rise from the levels given by 
point A as the stock of foreign claims approaches its higher, asymp¬ 
totic level. This implies, in particular, that the rate of depreciation of 
the exchange rate during the transition will fall short of the higher 
rate of monetary expansion. The exchange rate thus exhibits an 
“overshooting" pattern of adjustment, in that the impact fall in real 
balances exaggerates the long-run decline. This reproduces, in an 
optimizing framework, the result obtained by Kouri (1976), Calyo and 
Rodriguez (1977), and Flood (1979) on the basis of assumed aggre¬ 
gate relationships. 

It is of interest to note that the integral of discounted utilities given 
in equation (2) is lower along the transition path following a rise in /a, 
than it would have been along the original, unperturbed stationary- 
state path. This follows from the last inequality in (5) and ihe obser¬ 
vation that the household's instantaneous utility (alls the moment /a 
increa.ses, rising to its original level only in the long run. Thus, an 
increase in the growth rate of money, in the present setting, has an 
unambiguously negative effect on the wellare ol economic agents. 
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An Increase in Government Consumption 

We analyze, finally, the consequences of an increase Ag in real gov¬ 
ernment consumption. It is assumed initially that government con¬ 
sumption is wasteful, in that it does not enter into the household’s 
utility function. An example in which government spending produces 
a public good complementary to private consumption is analyzed 
next. 

In the absence of any change in private consumption from its initial 
level c, an increase in government consumption would entail a 
current-account deficit at the initial level, F, of foreign assets. This is 
shown in figure 3 by the leftward shift of the F = 0 locus to F' = 0, 
which is vertical at the lower level of private consumption compatible 
with external balance the moment after the increase in g occurs. 

The steady-state levels of consumption and real balances are not 
altered by the government’s action, however, for these variables are 
determined, independently of the level of g, by the two conditions, 
8[f/(f,/«)| = r, x{c,m) = /i + r. It follows that the stock of external 
claims associated with the economy’s new stationary state must be 
higher than the initial stock,/”': In the long-run equilibrium following 
the increase in public-sector expendittire, the private consumption 
level / can be sustained with no net borrowing from abroad only if the 
economy’s privately owned foreign assets rise by the amount Agir, so 
that additional interest payments from foreigners finance the addi¬ 
tional national absorption. 

Because the economy’s adjustment path is noncyclical, the current 
account must go into surplus when the rise in g occurs. Consumption 
thus declines by an amount greater than the increase in government 
spending. And because consumption, real balances, and foreign bond 
holdings rise together along the saddlepath, the exchange rate neces¬ 
sarily depreciates on impact, reducing real balances in the short run 
so that they may subsequently increase toward their long-run level, rh. 


m 
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The point labeled A in figure S again represents short-run equilib¬ 
rium at / = 0. 

When the monetary growth rate, yu, is zero, the government’s action 
is simply a tax-financed increase in government spending. The 
model’s prediction contradicts the standard Keynesian presumption 
that the accompanying short-run fall in disposable income must re¬ 
duce private saving along with private expenditure, pushing the cur¬ 
rent account into deficit. In the present framework, private saving 
must actually increase in order that the economy may attain, in the 
long run, the higher level of national income consistent with the 
predisturbance private consumption level, l.ooked at another way, a 
tax-financed increase in government spending entails a more than 
complete crowding out of private aggregate demand in the short run. 

The preceding discussion has been based on the assumption that 
the level of government spending does not enter the utility function, 
as it would if government consumption resulted in the provision of 
some public good. To illustrate how the analysis might change if this 
assumption were modified, we study a particular example. 

Assume now that the utility function has the form 

U{c,m^) = u(f^) + v(m), Ug, i/,.„ > 0. (27) 

According to (27), public and private consumption aie complemen¬ 
tary goods. The stationary values of consumption, real balances, and 
Ibreign assets are determined by the c(|uations 

8[t/(c,w^bl = r. (28) 

v' (rh)/u,.(c~ y + r, (29) 

^ = [(f+g-y)/r]-«. (30) 

In particular, (28) and (29) imply that r and m are no longer inde¬ 
pendent ofg, as they were in the case of purely wastelul government 
spending. _ 

The long-run derivatives off, m, and F with respect tog are; 

= -iD '(iv'" + r'iy + ')«,»)■ (31) 

^ = »D '(M + (32) 

(ig 

= D '[(«,. - Iig)x'" + i'’{y + >■)(».•,■ - «. J]. (33) 

where ZJ = rfii.iV' + (y + r)w,/''l < 0 and functions beneath oveibars 
are evaluated at the initial long-run ecjuilibiium. Only (32) tan be 
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unambiguously signed under the assumption > 0.'* In particular, 
if the marginal utility of private consumption is sufficiently low rela¬ 
tive to that of public consumption, an increase in g may, as (33) shows, 
occasion a current deficit, contrary to the earlier analysis. In this 
situation, private consumption falls by an amount smaller than Ag in 
the short run; however, there is a decline in long-run consumption 
that exceeds Ag. 

In the event that the level of public-good provision happens to be 
initially at the Samuelson static optimum, so that (32) and (33) 

imply that the increase in g must occasion a current surplus and a 
depreciation, as before. Why must stationary consumption rise above 
c — Ag in this case? If, in the long run, consumption were to fall by 
exactly Ag with m unchanged, the economy would be at its long-run 
utility level U, but with v'(m)lur{c — Ag,g + Ag) < /i. r, in violation of 
(29). Long-run equilibrium thus requires that real money balances fall 
below m and that consumption rise above c — Ag. 

The welfare analysis of an increase in government consumption is 
straightforward. We note here simply that when u^g > 0 and = Ug, 
an incr^.se ing must decrease the discounted sum of future utilities F 
below UIS(U), for it induces a current surplus and so a short-run 
decline in private consumption greater than Ag. The implication is 
that a level of public expenditure optimal from a static standpoint 
may no longer appear optimal once full account is taken of the 
dynamic repercussions of a change. 


IV. Conclusion 

rhis paper has developed an open-economy model based on house¬ 
hold utility maximization over an infinite lifetime. The model answers 
several questions concerning the effects of macroeconomic policies 
when exchange rates float freely. In some cases, the answers coincide 
with results derived previously on the basis of more or less mechanis¬ 
tic behavioral assumptions. But in other cases, the answers contradict 
these results. 

In particular, the model implies that money is not superneutral 
in either the short or long runs and places both the exchange-rate 
“overshooting” hypothesis and the association between depreciation 
and current deficits on a firm microeconomic foundation. However, it 


'* I'he following considerations imply that drh/dg < 0. Because > 0, the increase in 
g would entail an increase in the marginal utility of private consumption at the initial 
level of c. If m were to rise.v' would fall, and (29) could not be satisfied in the long run 
in the absence of a higher stationary consumption level. But with c, m, and g all higher, 
the household's long-run utility level would exceed U, contrary to (28). Thus, in declines 
and v'(m) rises. 
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casts doubt on the validity of inferences drawn from the Keynesian 
consumption function which, in some form, is central to the classic 
Mundell-Fleming model and most subsequent work. In addition, the 
approach highlights the importance of the government and central- 
bank budget constraints in discussions of the effects of hnancial policy 
measures. 

Appendix A 

This appendix provides the proof of local saddlepath .stability for the system 
(I9)-(2I) and characterizes the unique path leading to stationary-state 
equilibrium. 

The model involves two state variables, c and m, which may take (unantici¬ 
pated) discrete jumps, and a third state variable,T, which is predetermined by 
the past history of external asset accumulation. Thus, to prove saddlepath 
stability, it must be shown that when linearized about its stationary position, 
the differential equation system, 

. _ X{6[u(c) ti(TO)] - r) - \J_n 4 r - x(c,w)]>w - X,[y + r(F + R) - c - g] 

‘ K 

= i//(c,m, F + R), 
m = [/ji + r - x(c,m)]m, 

F = y + r{F + R) - c - g, 

possesses two positive characteristic roots and one negative characteristic root, 
implying the existence of a unir^ue path converging to long-run equilib¬ 
rium. Recall that 

^ ^ _ {S - [u(c) + t (w)]6'}a' __ 

6 -t- {y + r(F + R) + Ifi + r - x{c,m)lm - c ~ g}S'u' ’ 

x(c,m) ^ v'(m)/u'(c). 

At the system’s long-run equilibrium (c.rn.F), we have c=m-F=a = 
S[u(c) + v(tn)] — r = 0. and, linearizing in a neighborhood of this point, the 
system becomes 

c = iji^c - cl -I- ^„(m - m) + ^fiF - F), 
m = -x,.m(c - c) - x„m(m - rh), 

F = -(c -c)+ r(F - F). 

(Functions beneath overbars are evaluated at [c,mf].) Using the definition of 
long-run equilibrium, we find that 

ifi,. = (\/\,.)6'u' -t- {k„IK)^r’n -I- 

iii„ = (klK)h'x’' + (\JK)x„m, , 
i/t/' ~ —(Xis-fX^-fr, 


rs - ( u -1- u)fi'l[^" + 5'(«')'‘Xr'»] - (li + il(u')*^' 


where 


(Al) 

(A2) 

(A3) 
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[8 - (tl + i?)8'|(8'u'x„m — S'v'yH' — {u + v)u'v'66" 
8 ^ 

- (u + v)S']S'(u'f ^ y 
6 


(A5) 

(A6) 


Now Y, = —7‘'(m}u"(c)/lu’(e)p > Oandx„ = v"(m)hi'(c) < 0; and since U ^ 0, 
8 ~ LIB' > 0, and 8" > 0 by a.ssumplion, 

X™ < 0. (A7) 

rurther, substituting (or x,. in (A4) yields 

A _ [8 -- (u + f<)8'l(8 — B'v'm)u" - (u + v){ii'fSB" (A8) 

' 8 ^ 


Because a( ) and t)( ) are nonnegative and concave, we have, using (5), 8 - 
B'v'rh > B — eS' > 8 — (u + u)8' > 0, and (A8) implies iirunediately that 

A,. < 0. (A9) 

We note, finally, that because A ~ {8 — lu(<) + p(m)]8'}tt'/S, (Al) and (A6) 
imply that 

1^,. - (A,„/A,)x,i/i > 0, (AID) 

Using (A 10), the lineati/ed system becomes 


c 


A„,x,.ot 

A6'f'' -1- \„,x,„m 

-A.r1 






K 

K 

X. 



— r 

m 


~x, in 

-x„m 

0 


m 

- w 

f 

L J 


- 1 

0 

r 



- p. 


By (A 10) and ilie fact tbati,,, < 0, ilie irate of the system's matrix is positive. 
SiiKc ibis trace is also equal to the .sum o( the system's chaiactci islic roots, 
there must be at least one root with po.silive real part. 

On the othei hand, we may write the .system's determinant as 

= (Xrarn(Ac/Ar)r 1 + r(x, m){kB'v' -t- k„x„m)lkr - r(x„,m)(A„/Ar)x, m 
= |.r,„t«(A,./A,)rJ + r(x,.m)(\B'{'')/\,. < 0. 

Betause if is the product o( the system's characteristic roots, it must be the 
case that there are either three roots with negative real part tir one negative 
root. (Recall that complex roots must occur as conjugate pairs.) Since we 
already know there cannot be three roots with negative real jiart, there must 
lie exactly one negative root. J'his proves that in a neighborhood of long-run 
equilibrium, out system is .saddlepath-stable. 

It remains to provide a cbaracterination of the unitjue stable path. I.et 6, 
denote the system’s sole negative root, and let cS = fti>,„< 02 i,ta 3 ,J' denote an 
eigenvector belonging to 0,. As was shown in the text, c, m, and F must rise or 
fall together along the saddlepath if the components of to are all of the same 
sign. We now prove that this is the case. 

First, we note that if one of the components of 3 is zero, all must be zero, 
and so all components must be .strictly positive or negative. To .see this, 
suppose to,, = 0. Then, using the last equation of (Al I), -f ru„, = 0t<o„,, 
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and so ( 0^1 = 0. But using the second equation or(Al 1), we find that = 9 as 
well; and so cu = 0. Similar at ji;uincnls dis|X)se of the possibilities that oij, = 0 
and that tt).,, = 0. 

So suppose aini > 0. From (A 1 1), —to ,, + rw-,, = < 0, and so it must be 

the case that co,, >0. Again using (A11), we see that (-x^mlo),, + = 

61|IU2|, implying coji = [(-x, m)/(<f i + x„>n)]a»i,. Thus, it must be true that a> 2 i > 
0. In this case, thercldre, all comptments of e> have the same sign. 

In the case < 0, an identital argument shows that ta,,, oij, < 0 as well 
This completes the pnxif. 


Appendix B 


This appendix contains a proof that the unique convei gent path of tlie system 
(19)-(21) isa perfect-foresight equilibrium path. The minor technical issue in 
the proof is that the state variable a, may assume negative values, so that the 
suriiciency theorem lor optimal controls (sec .Arrow and Kuiv 1970, p. 49) 
appears inapplicable. We show that the intertemporal budget constraint (8) 
implies thatic/ must be bounded from below when consumers expect die path 
ol prices implied by the convergent solution to (19)-(21). 1 his fact, in the 
present context, implies that the sufficiency theorem remains applicable. 

Substituting the flow constraint (7) into (8) and inicgraling by parts, we 
obtain the inequality 

liin (i,f 0. (BI) 

Camdition (Bl) may be used to prove the inecpiality 


u, c'"* 0, for all (, (B2) 

where hatted values are those expected to prevail along the convergent path. 
.Assume (B2) lads at time !'■ Integration of (7) implies that lor I > «, must 

satisfy 


— ^ C) 


■/: 


" fr, • c, - (7T, -F I 


(83) 


for any path of toiisuinption and real balances. .Multiplying both sides of (B.3) 
by p " and taking limns as / —► ^ yields the equality 


lim (t,e " = e 




'It, - r, (it, + r)m,l(f.v 


( 84 ) 


Because coiisiiinptioii and real lialances are noiiiiegative. the right-hand side 
of (84) must be negative if (82) is false at t'.” This contradicts (Bl), showing 
that (B2) must hold at each instant. 

Define 


h = 



The number k is finite as a result of the boundedness ol the sequence {t,} 
along the convergent path. From (82), 

(i, + 0, lor all /. (B5) 


''Our assumptions ensure that if -r > > 0 at all times 
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It is now straightforward to apply the sufficiency theorem. Redefine the state 
variable for the household’s problem to be w,^ a, + k. The flow constraint (7) 
becomes ri), = rw, + f, — c, — {tt, + r)m, — rk. The differential equation system 
(I9)-(21) is unchanged as a result of this modification, and along its conver¬ 
gent path, 

lim = 0, lim XaC“‘‘ ^ 0. 

A —kga A —*00 

Because negative values of the state variable are infeasible, the sufficiency 
theorem implies that the choices of consumption and real balances implied by 
the convergent path are indeed optimal choices, given the associated paths of 
the price level and real transfer payments. The convergent path is therefore a 
perfect-foresight equilibrium path. 
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Trade Unionism: The International 
Typographical Union 
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I Ilf wages and employment ol' typographers are examinetl to see 
wlicthei they can he useliilly eliaraeteii^ed as the outcome ol a 
piocess hy which the nmon maxinn/es an ohjettive riinetion coii- 
tannng wages and employmeiu and is constrained hy a trade-oil 
hetween these tw'o variahlesas represented hy the employei’s labor 
demand function. Our lunctional form assumptions permit investi¬ 
gation of some familiar special cases of union Iteliavior. I he 
paranietei implications ol Ixith the wage bill maximisation hypothe¬ 
sis and tfie lent maxnni/alion hypothesis provide interior expla¬ 
nations oi the moyement ol wages and eniploymem ol tlu'scwoi kcis 
compared with our inoie geneial formulation. 


I. Introduction 

Although empirical studies tneasuritig the effects of trade utiions 
have multiplied in the last few years, the behavioral underpinnings of 


We arc iticlebtcd to James N. Ros.se lor shaiing with us his deep knowledge of the 
newspaper iiidiisiry. Also, we have tienelitcd Iroiii coiiiiiientson a preliminary diatl ot 
this papei icceived from Oi ley Aslierileltci, Mciiry Karlier, Michael Hurd, Katfiie 
Krumm, I'om MaCurdy, Rolierl Michael, Daniel Mitchell, Sam Pell/man, Robert 
Solow, Finis Welch, and an anonymous referee of this Journal. Support fiom the Sloan 
Foundation lo the Dcpartiiieiit of Economics at Stanford University is gratefully ac¬ 
knowledged. 

\Journnl of f^olutral Ffonomy, 1981, vol 89. no (>) 

1981 by I he University ol Chicjgo «a22-8H98/HI/H906-(MM)5$«l **>0 


11 Gil 



TRADE UNIONISM 


i,(>3 

unionism remain poorly understood. This is because the issues raised 
in the literature on trade union goals and wage determination have 
been mired in fruitless methodological debates or simply have not 
been pursued at an empirical level. Consequently, although a variety 
of plausible models of'unionism have been proposed, an evaluation of 
them in terms of their correspondence with observed behavior would 
rest today on barely more reliable information than was available to 
Dunlop (1944) when he wrote his seminal work almost 40 years ago. 
This dearth of research is all the more remarkable when contrasted 
with the abundant theoretical and empirical work by economists on 
the behavior of fiirns and of families. And yet, as Farber (1978) ably 
demonstrated with his analysis of the Dnited Mine Workers, there is 
nothing about the iratlc union as an institution that makes its central 
features impossible to characterize in a framework analogous to pur-' 
posive models in economics.' 

The model of the trade union analyzetl in this paper is by no means 
original; it is usually associated with the work of Kellner (1947) and 
Cartter (1959), among others. It characterizes the union as maximiz¬ 
ing a function in which wage rates and employment are arguments 
and as being constrained by a trade-off between these two variables 
repte.setited by the employer’s labor demand function. I'he novel 
feature of this paper is that we have identified a setting that seems 
suitable for the operation of the model and have estimated the wage 
and employment functions that re.sult from this characterization. 

I’hcrelbre, this represents the fit si empirical application of a model 
that has been widely discussed in the literature. Moreover, several 
models of trade union behavior are nested as special cases in our 
framew'ork so that we are able to provide formal tests of some populai 
hypotheses. Our purpose is not to champion a specific model but 
simply to examine whether, in fact, the particular puiqwsive model 
that is common in the literature provides a viable framework for 
analyzing union behavior. Only through a careful documentation of a 
number of empirical studies wtll the profession be in a position to 
evaluate the relevance of this model. 

In this paper we first describe some basic features of the newspaper 
industry and of the International Typogiaphical Union (I 1 V), which 
is the union with which this paper is concerned. .As is well known, this 
is a powerful, closed-shop union that ojierates in an industrv whose 
technological features have changed considerably over the last 10-15 
years. This paper focuses on a period (from 1946 to 1965) before the 
drastic changes in the nature of newspaper composition, and during 

' This opinion differs from ihe one widely held ih.n, as espressed In Rcder (1952). 
“. . . the behavior of linns .ynd ronsiiiners can be e.isilv iiiierpieled a.s 'maximizing 
while that ol a union cannot” (p. .S t). 
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these years the model is less likely to have experienced structural 
change. We provide reasons for believing that bilateral monopoly is 
not an apt description of the market for typographers, but we cannot 
claim that our results vindicate this judgment since a bargaining 
model is not outlined and the estimates are interpreted in terms of the 
Fellner-Cartter model. Since this model seems to have escaped formal 
de.scription, we offer such a description in Section III and also intro¬ 
duce our particular functional form assumptions. In Section IV we 
present our experiences with estimating the model, first with an 
excellent data set from Cincinnati and then with data on seven other 
unions. Some conclusions follow in Section V. 


II. Institutional Setting 

The organization, structure, and traditions of the ITU suggest several 
features relevant to an analytical model of union behavior. The 
structure of the I I'U parallels that of the printing industry, which 
mainly consists of geographically segmented product markets. Today 
there are approximately 1,000 chapters of the union which negotiate 
individual contracts with local printing establishments, the most im- 
poi tanl being newspaper firms. In contrast to many industrial unions, 
the membership of the ITU is remarkably homogeneous. All mem¬ 
bers undergo the same training, there is no observable prestige dif¬ 
ferentiation, and education levels and demographic characteristics 
are quite uniform.^ 

Officers of the II U on both the national and local levels appear to 
be extremely responsive to the wishes of their constituent typog¬ 
raphers, and the union is frequently described as the most democratic 
trade union in the United Stales. Two active political parties compete, 
and a rather frequent turnover of officers occurs through the electoral 
process.^ The union leaders have few incentives to entrench them¬ 
selves in office. Financial remuneration, even at the highest level, 
differs little from journeyman wages, and local officers are often 
workers who perform union duties in their spare time. Negotiated 
settlements are as likely to affect their own financial status as that of 

' l.ipset. Trow, and Coleman (1956) wnte that . . all of the member.s of the I TU 
share a roughly common income and status. . . . The I TU is thus, insofar as this is 
possible, a community of equals, and there is consequently no ‘underprivileged’ group 
in the unions ... all subskills contribute roughly equal proportions of aaive leaders and 
one finds no tendency for a high-status group to monopolize and dominate internal 
politics" (pp. 142-43). 

* Inde^. a survey of the largest ITT local, New York no. 6, revealed that over 
one-half of the mentbership held some union office at one time or another (see Lipset et 
al. 1956). Kelerenda are conducted on all sorts of issues, with voter participation 
reaching 60 to 70 percent even when the outcomes have marginal impacts on individual 
members. 
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the “rank and file.” The dichotomy between the interests of officers and 
of members that assumes importance in some unions appears less 
relevant to the ITU. Local contracts are constructed and negotiated as 
prescribed by union laws which have existed for decades and which 
require membership participation and ratification at each stage. 

Over the years, the central organization of the ITU has been con¬ 
cerned with broad issues which have relevance to the union as a 
whole.'* In contrast, issues concerning remuneration are negotiated at 
the local level, and the wage scale negodated is applied to all jour¬ 
neyman printers. There is evidence that the regular employees are 
not indifferent to the welfare of the “marginal” or substitute workers.’ 
Indeed, as early as 1890, the general laws of the ITU contained 
provisions which attempted to distribute work evenly to all em: 
ployees.* More recently, the practice has been to fill all vacancies from 
substitute lists of irregularly employed workers. 

Therefore, the ITU is a union with a remarkable democratic tradi¬ 
tion, with no clear divergence of interests between different groups 
within the union, and with an apparent concern both for the em¬ 
ployment of its members and for their remuneration. A further 
attribute deserves mention; Although strikes in the newspaper print¬ 
ing and publishing industry as a whole are not uncommon, those 
involving the ITU are infrequent.^ Indeed, for the IFU locals whose 
data we use below, no strikes occurred during the 1946-65 period.* 
The relative unimportance of strikes is consistent with those models 
that account for strikes in terms of incomplete information either 

* I'hesc have iocused on the union's jurisdiction over printing txcupations and the 
proprietary right of members to employment. Most rccenily, the ITU has resisted the 
adoption of new technology which permits electronic and rompuierizcd composition of 
newspaper product except in cases where junsdictioii over new skills and mainirnance 
of employment is permitted (see Kelber and Schlesinger 1967). 

‘ For instance, all time accumulated in excess of the amount of hours constituting a 
regular daily or weekly shift is posted as overtime. When this overtime becomes 
equivalent to a day’s pay, the regular is requited to lake a day oil and engage a 
substitute as long as one is available. This philosophy of “work sharing" probably 
reached a zenith during the Great Depression when a national convention adopted a 
referendum vote approving a law directing all situauon holders to put a subsuiute on 
work at least I day per week. 

“See International Typographical Union 1964, pp. 312-13. Union membership is 
sometimes viewed as a legacy to be passed fiom generation to generation so that family 
ties among the union members provide a further reason for the typical jouinevman 10 
take account of the employment effects of his union's policies (sec Lipset el .il. 19.56, p 
206). 

’ For instance, in 1945 (the year prior to the period examined below) only 32 of 1,000 
locals engaged in any strikes, and all but one of the struck hrms resumed publication 
within a few days. This information is from the industry's weekly magazine. Editor and 
Publisher (1946, p. 76). 

•Therefore, it involves not too drastic an assumption if the costs of negotiation and 
the role of strikes which sometimes figure in models of union behavior (see Mitchell 
1972; Atherton 1973) are neglected in this study. 
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within the union or between the union and the employer: Rates of 
concession are unknown and the negotiation process reveals infor¬ 
mation about what to expeci from the opposition. However, in the 
newspaper industry, negotiations have been taking place between the 
same unions and newspaper firms for many decades, the rfU trade 
journals regularly publish wage scales for each of the 1,000 locals, and 
what is sometimes described (Reder 1952, p. 39) as a “mature” collec¬ 
tive bargaining relation obtains. 

The rarity of strikes reflects in large part the relative vulnerability 
of the local newspaper firm to a prolonged interruption of produc¬ 
tion. In particular, the firm’s product, the prompt delivery of up-to- 
date information, cannot be stockpiled to withstand a shutdown so 
that advertisers and subscribers are apt to turn to alternative media as 
a result. Once a firm closes down, it is expensive to regain former 
customers, especially advertisers, who provide an average of about 75 
percent of all newspaper revenue and who can turn to ready alterna¬ 
tives (television, weekly newspapers, direct mail, billboards, radio) for 
their purposes. In contrast, the i rU maintains a defense fund to aid 
striking local unions, and members have the opportunity to work 
elsewhere in the event of the newspaper firm being driven out of 
business. Hence, with the newspaper firm facing a number of im¬ 
mediate competitors in its product market and with the typographers 
able to draw on a sizable strike (und and to arrange for work in other 
cities, the setting is not a confrontation between two equally powerful 
adversaries: The union is clearly in a dominant position vis-a-vis the 
firm. I’his provides .some support for interpreting the pattern of 
typographers’ wages and employment over time as tracing out the 
union’s preferences. 


III. The Conceptual Framework 

A General Statement 

From a description of the institutional setting, we turn now to a 
formal statement of the union maximizing model. The union is 
characterized as behaving as if it possesses a twice continuously differ¬ 
entiable, strictly concave, objective function. 



where w measures the wage rate, p the price level of commodities 
consumed or the “cost of living,” and L union employment. This 
union is assumed to be involved in producing such a small part of the 
economy’.s total output that any effect of its decisions on the overall 
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price level {p) may be neglected. Hence/> is taken to be exogenous to 
the union, while id and L are endogenous. The union consists of many 
members so that an issue arises of whose utility function is repre¬ 
sented by equation (1). This has been a recurrent theme in the 
literature on trade union goals, though formally the problem is no 
different in kind from economic models of family decision making 
where the family consists of more than one individual or from the 
theory of the firm where there is more than one owner.® The typical 
ITU local has already been described as one with an unusually 
homogeneous membership and where conflicts of interest between 
the union leadership and the rank and file are not pronounced, 
therefore, the objective function above is “the” union leader’s, and 
“he” is assutned to integrate the welfare of all the union’s members. 

The employer’s cost-minimizing labor demand function is given by- 

/- =/(iD,r„r„.. .,r„,X), (2) 

where r, represents the unit price of input i and X is the level of output 
produced with n + 1 inputs by the firm.'® As is well known, L is a 
negative function of w, L must be increasing with respect to at least 
one other input price, L increases with output (provided labor is not 
an inferior input), and this labor demand function is homogeneous of 
degree zero in all input prices. 

'I'he union is assumed to .select w and /. to maximize eiiualion (1) 
subject to the labor demand function given by equation (2). This 
yields the following first-order condition: 



dw ' 


( 3 ) 


where g| = dUld(wlp) > 0 andgj - dUldL > 0: The marginal rate of 
substitution of wages for employment in the union’s objective func¬ 
tion equals the cost-of-living normalized slope of the employer’s labor 
demand function with respect to the wage rate." This first-order 


“ The problems involved m .iggregalmg over preterenres .ire disiussed in Samuelsoii 
(1956) and Bcckei (1974). A latioiiali/aiioii ot eq. (I) in lerins at a . union as a 
political inslilulion run by a bureaucracy whose goals arc nol necessarily idenlical wiih 
those of ihe incmbership” appears in Kolown? and foiies (1974). 

‘"Our assuniphon wiih the newspaper firms below is that their output is determined 
primarily by the demographic char.ittciislics ol the Icxal media market which are 
exogenous lo a single firm. In fact, we mvestigaie below- the possibility’ that output 
should be treated as endogenous (see also Rosse 1970 and Dertouzos 1979), 

“ As is well known, this tangency solution between the union's indifference curse and 
the slope ol the firm's labor demand funtlioii tioes not lie on the comrati turve (see, 
e.g , Leoniicf 1946 or Fellner 1947). Phis feattiie may lie unrcalisttr in some 
emplover-union sellings, but it is nol clear that 11 is so lot typrigraphers. 1 he second- 
order condition concerns the relative rates at which the union s isoquant and the 
employer’s labor demand lunction arc changing: p ‘g-,, + 2p 'gn(SJIdxv) - 1 - gt(d’f/dxx'^) 
+ ga{dffdu>)* < 0 . 
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condition together with the employment demand function (2) may be 
solved to derive the reduced-form equations of the system: 

w = <i){p,r„r 2 , . . . ,r„,X) (4) 

and 

L = (5) 

Since an increase in the cost of living reduces the real value of the 
wage rate without disturbing the employer’s structural labor demand 
hinction (2), the effect of an increase in p on the wage rate will be 
opposite in sign to its effect on employment. Other than this, without 
further restrictions on the form of the union’s objective function and 
the labor demand function, these wage and employment equations 
are devoid of any unambiguous sign implications.*^ This dearth of 
qualitative predictions seems to have escaped notice in the extensive 
literature on this simple maximizing model. 

If the reduced-form equations (4) and (5) are substituted back, into 
the union’s objective function (1), an expression is derived that relates 
the trmximum value (call it K) of the union’s objective function to the 
exogenous variables of the system: 

y =^V(p,r„n,...,T„,X). (6) 

Through the envelope theorem it is straightforward to show that V 
decreases with respect top and with respect to the price of any input 
that is complementary with labor and that V increases with respect to 
output and with respect to the price of any input that is substitutable 
for union labor. Moreover, the trade-off in the union’s optimizing 
objective function (6) between increases in the prices of any two inputs 
is simply represented by the ratio of the derivatives of the employer’s 
labor demand function with respect to each of the input prices:'® 
(dy/dri)/(dyidrj) = (df/dr,)/(df/dr,). Since these derivatives of the 
labor demand function are related to the question of the degree to 
which this labor is substitutable for (or complementary with) other 
inputs, this illustrates the relevance of the production technology in 
constraining the union’s opportunities. 

The sort of resti ictions on cqq. (1) and (2) that generate refutable sign implications 
may be illu.strated a.s follows. If eq (1) is strongly separable in wages and employment 
(or, less restrictively, if gij ^ 0), if labor is not an inferior factor, and if an increase in 
output does not reduce the slope of the labor demand function (i.e., if d'fldwdX S' 0), 
then an increase in output or an increase in the price of a substitute input results in a 
higher optimal money wage rate. 

“Also (aF/ar,)/(dF/3X) = (a//3r,)/(d//dX). 
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Particular Functional Forms 

In order to implement this framework, at some stage in the analysis 
some explicit functional forms have to be introduced. These lunctions 
should be consistent with the institutional features of the newspaper 
industry, they should be straightforward to interpret, and they should 
be computationally manageable. The approach in this paper is to 
spiecify particular forms for the union’s objective function ( 1 ) and for 
the firm’s employment demand function ( 2 ) and then to use the 
first-order condition (3) to solve for the corresponding reduced-form 
system (4) and (5). Thus the union’s objective function defined over 
real wages and employment is specified as follows; 

This function, the Stone-Geary, claims an excellent pedigree in em¬ 
pirical work on esdmating systems of consumer demand equations, 
though this represents its first application to union behavior. In fact, it 
possesses a number of attractive features for our purposes. Though it 
is not necessary to adhere to this interpretation, following Samuelson 
(1948), y and 6 may be thought of as “reference ” or “minimum” or 
“necessary” values with the function being homothetic to the point 
(y, 8 ), and {wip) - y and - 8 represent “supernumerary” real wages 
and employment, respectively. The value of 0 indicates the relative 
importance attached to supernumerary wages versus supernumerary 
employment; only when y = 8 = 0 may 0 be interpreted as measuring 
the relative weight of wages and employment in the union’s objective 
function. 

One appiealing feature of equation (7) in this context is that it nests 
other models of union behavior as special cases. For instance, Dun¬ 
lop’s model of “maximization of the wage bill for the total member¬ 
ship” (1944, p. 44) is equivalent for the closed-shop I FU to specifying 
6 = .5 with 7 = 8 = 0 in the objective function.Or the rent 
maximization hypothesis (Rosen 1970; de Menil 1971) implies 9 = .5, 
8 = 0, and 7 equal to the competitive wage. Dynamic elements may be 
introduced by relating 7 and 5 to lagged values of real wages and 
employment .so that the union’s objectives are those of wage growth 
and employment growth. Where a union designs its wage policy with 

Clearly, with y and 5 equal to zero, a value ol » of 1 yields an objective funriion that 
consists simply of the wage rate (a model of union goals .sonieumes attributed to Simons 
[1944]), while a value of of 0 implies ih.it only cmplov mein matlci s 

” Cf. “The bargaining piotess is almost alwavs directed toward an upw.nd or down¬ 
ward change in a pre-existing wage rate and the gre.atest iiilei e.st often centers on tlie 
magnitude tif the change" (Ross 1948, p. 46). formally, this particular extension of the 
.Stone-Geary was propo.sed and analyzeef by Stone tl966) .ind Poliak (1970) 
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reference to what another union has attained or to a nonunion wage, 
then y assumes the value of this comparison wage rate.'* Hence the 
Stone-Geary function can accommodate a number of alternative 
hypotheses that have received attention in the literature on union 
objectives. 

The union’s opportunity set is defined by the newspaper firm’s 
labor demand function, the expression for which is as follows: 




( 8 ) 


where, as before, the r’s indicate the prices of nonlabor inputs and X is 
the firm’s output. The negatively sloped labor demand function re¬ 
quires «, < 0 and, if labor is not an inferior output, then a-^ > 0 . 
During our estimating period (1946-65), there was considerable 
merger activity in the newspaper industry;so, to allow for such 
mergers causing a shift in the labor demand function, equation ( 8 ) 
includes a dichotomous variable, D, that lakes the value of unity when 
a newspaper merges with another. This parameterization is consistent 
with a merger resulting in an entirely different cost function for the 
newspaper, different in its intercept and different in its slope with 
respect to each of its arguments. This linear-in-the-parameters labor 
demand function, equation ( 8 ), generates tractable estimating 
equations and yet conforms to what is known about the production 
technology in the newspaper industry.'* 

The hrst-order condition fora maximum of (7) subject to equation 
( 8 ) is tf/(l - ^) = ~{(ui - py)ai]/[(L - S)r,], which may be solved for 
the reduced-form wage equation expres,sed in real terms as 




(9) 


where /3„ = (1 - ff)y, Pt = ffa, '(S - ««), Pi = P 3 = ', 

and 184 = Correspondingly, the reduced-form employment 

equation is 

L = p3 + + /3hX + (10) 


I'he impoitancf ot such comparisons in af'fecnng the goals of trade unions lias 
been stressed by Ross (1948). Again, formally it involves a siinilai sort of extension of 
the Stone-Geary function as that analyzed by Poliak. (1976). 

'' Almost 8 percent ot all cities with daily newspapers had more than one newspaper 
in 1948. By 1968 this had fallen to less than 3 percent (see Dertouzos 1979). The effect 
of a newspaper merger on union preferences was also investigated, atid the results arc 
reported below. 

More information on the newspaper production technology is contained m Rosse 
(1970, 1977). Also estimated with the Cincinnati data were lalxir demand functions 
whose slopes and intercept changed with a merger, but the results did not differ 
materially from those reported below. 
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where fir, = «„ + 6(8 - ao),/3g = «,>(! - 6 ), = (I - 6 ) 02 , fin = (1 - 

0 )a2, and /Sg = (1 — Observe that, since 0 < 8 < 1, a;, > 0, and a, 

< 0 , these equations imply a positive correlation between wages and 
employment, on the one hand, and firm size, on the other hand.'“ 
These particular functional form assumptions yield reduced-form 
employment and real wage equations that are homogeneous of de¬ 
gree zero in/), r,, and rj. By substituting equations (9) and (10) into the 
union’s objective function (7), we derive an expression that relates the 
maximum value of this function to the exogenous variables of the 
system: 

V(p,ri,r2^JD) = 

- ^ -j [axyp + («0 - S)r, + + aiiXr, -I- a 40 r,], (11)' 

which is also, of course, homogeneous of degree zero in p, r,, and rj. 

With the equations of the system described, we turn now to the 
estimation of the model, first with a data set on the ITU in Cincinnati 
and then with other union ItK'als. 

IV. Empirical Results 

Estimating Methods 

The structural parameters of our system were estimated by two pro¬ 
cedures. file first exploits the fact that the cost of living,/), enters the 
system through the union’s objective function (7) but is excluded f rom 
the firm’s structural demand equation ( 8 ). Thus, using/) as an instru¬ 
mental variable to predict values for (ic/r,), a conventional instru¬ 
mental variable procedure was applied to equation ( 8 ) which provides 
estimates of the a’s. These a’s thus estimated were then imposed in 
estimating the reduced-form wage cqu.iiion (9) which yields point 
estimates of the parameters of the union’s objective function (namely, 
8 , y, and 8 ). The second estimating procedure involves the simultane¬ 
ous estimation of die stochastic fbi 111 of equations (8) and (9) by full 
information maximum likelihood.'^" Conditional on the specification 
of the model being correct, this estimator is consistent, asymptotically 
efficient, and asymptotically normally distributed. In fact, the point 
estimates derived from applying the two procedures were invariably 
similar, and consequently only the maximum-likelihood estimates are 
reported. 

‘•For ovidence on the ass<x:ialion b<-tweeii wages and (inn size in ihe newspaper 
iiidnstiy, sec Dcitouzos (1979). 

■"> In parliciilai, we employed ihe Beindl, Hall. Hall, a«d Hausman (1974) eompma- 
tional pifK'ednre. 
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These procedures were first applied to an excellent data set collected 
lor a newspaper, the Cincinnati Post, for the years from 1946 to 1965.** 
In estimating the stochastic form of equations (8) and (9), w, is 
measured by the hourly wage scale for journeyman printers em¬ 
ployed by the Post in year t, Z-, by the number of full-time typogp"aphi- 
cal workers in the Post composing room,/), by the BLS consumer price 
index, X, by the amount of advertising linage (in thousands) sold 
annually by the Post, r, by the price of newsprint (in hundreds of 
dollars per short ton) quoted in national markets, and by the BLS 
wholesale price index f^or machinery and equipment.** In 1958, the 
Post merged with one of its two local competitors, the Cincinnati 
Times-Star, and, as a result, a larger and more efficient plant was 
occupied. The ITU members of the Times-Star'% chapel were absorbed 
into the Post composing room so that the employment of typog¬ 
raphers rose from 145 averaged over the years 1946-57 to 232 
averaged over the years 1948-65. Thus the merger dummy variable, 
D, in equations (8) and (9) takes the value of zero from 1946 to 1957 
and of unity for the later years. The full information maximum- 
likelihood estimates of the structural parameters of equations (8) and 
(9) are given in row a of table 1. 

With respect to the parameters of the union's objective function, 
the estimate of 6 is significantly less than .5 and greater than 0; the 
union values both employment and wages. The estimate of y is 
significantly greater than zero, suggesting that it is not merely the 
level of wages that matters to the union but, rather, the excess of 
wages over some reference level. In fact, the estimate of y of around 
2.1 ranges from 26 to 47 percent below the real wage actually received 
by these workers over these years. In contrast, the point estimate of 6 
is negative though insignificantly different from zero. According to 
the estimates in row a, the elasticity of the maximum value of the 
union’s objective function with respect to wages (.55) is slightly greater 
than the elasticity with respect to employment (.45) when evaluated at 
the sample mean values of the variables. The implied elasticity of 
substitution of wages for employment at this point is .69. Using a 
conventional likelihood ratio rest, the null hypothesis that the non¬ 
wage arguments of the labor demand function (namely, r„ X, and 
D) are jointly insignificant in the real wage equation (9) may be 
rejected at the 5 percent level of significance. These results are con- 


In the course of performing consulting activities for the Cincinnati Post, one of us 
obtained employment data normally not available through public sources. 

” In some estimates, the machinery price index was replaced by a measure of the 
user cost of capital, but the results were essentially unchanged. 
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sistent with the notion that the union takes account of the employ¬ 
ment effects of its wage policy. 

As for the parameters of the labor demand function, the implied 
wage elasticity of employment ranges from about —1.8 to —1.0 with 
an elasticity of —1.23 at the mean values of the variables.The 
elasticity of employment with respect to output is estimated at .38 at 
the mean values of the variables.^'*The estimates imply that newsprint 
and machinery are each substitutes for labor for all observed values of 
the variables. At sample mean values, the elasticity of the demand for 
labor with respect to the price of newsprint is estimated to be .77 and 
that with respect to the price of machinery to be .46.“ The estimated 
coefficient on the merger dummy (d^ = 87.9) is virtually identical to 
the difference between the average number ot typographers em¬ 
ployed after the merger and before the merger (namely, 87).“ 

At this point, our estimates were contrasted with those corre¬ 
sponding to two popular hypotheses of the objectives of unions. First, 
under the wage bill maximization hypothesis, 0 = .5 and -y = 8 = 0, 
and the estimates of the labor demand function fitted under these 
constraints are given in row b of table 1. A conventional likelihood 
ratio test finds that the unconstrained estimates in row a represent a 
significant improvement (at the 5 percent level) over those fitted 
under the wage bill maximization hypothesis in row b. Second, a test 
of the rent maximization hypothesis requires an interpretation of y as 
the wage rate that typographers would earn if they were not or- 


It IS not suaightforwaid to compare these csiiniales of lahoi dem.md claslitilies 
with others in (tie litei.ttiire suite these relate to a stnall gtoup of workeis in a sttiglc 
hiin, wtiile previous estimates have been at a much higher level of aggregation. 

The assunipliim that output is exogenous in these e<|uatiotis was consideicd. For 
nislant e, errors in measuring output may arise because our index of newspaper output, 
iiarnclv, advertising Image, ignores ilimensioiis of output such as quality and the space 
devoted to news. For this reason, some inslmmenial variable estimates weie deiivctl 
where the instrumcnls lor newspaper output consisted of the city-cone household 
population per year, aggregate U.S. advertising expenditures per household in real 
dollars, and the share of aggregate advertising expenditures earned by television. 
Cleatly, these variables arc exogenous to the individual newspaper tiriii. The resulting 
estimates ot the parameters ol the labor tiemand fiiiKiion were very similar to those m 
tow a ol table 1. 

“ This hnding that newsprint and iy|M)graphers are substitute inputs is consistent 
with the stylized facts in the newspaper industry. For as the relative price ot newsprint 
rose, firms found it optimal to conserve on newsprint by producing zoned editions of 
newspapers, and typographers were required to compose advertising copy lor each 
edition's separate sections. 

“ Augmenting the labor demand funtiion with a linear time trend variable produces 
similai results to iho.sc reported m rowa (table 1), while the estimated coelticieiu on the 
tiend variable is about one-quarter its estimated standard error To be specific, the 
maximum-likelihood estimates of this system are as lollows (with estimated SEs m 
parentheses)-e = .ifOl (.05-1), y =‘Z. 126 (.880), 8 = - 130.02 (283..58), a„ = 196.4(91.8), 
o. = -80.99 (27.59), a, = 1.087 (1 451), a, = 3.315 (2.653), a, = 85.14 (15.89), and 
with the estimated coefheiem on the lime trend variable .290 (1.302). 
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ganized in a trade union. This alternative, nonunion, wage rate is not 
observed, of course, but a plausible specification is that this alterna¬ 
tive wage is a linear function of what some existing nonunion workers 
receive. In particular, suppose that, in the absence of the I'lTJ, these 
workers would earn a linear function of the real average hourly earn¬ 
ings of nonsupervisory workers in the retail trade: y = yo + 

With this specification for y, the rent maximization hypothesis implies 
the restrictions 6 = .5 and 8 = 0. Ihe estimates corresponding to this 
formulation are presented in row c of table 1 with the unconstrained 
estimates in row d. A comparison of the values of the likelihood 
function in rows c and d results in a rejection (at the 5 percent level) of 
the null hypothesis that the unconstrained estimates are no different 
from those implied by the rent maximization hypothesis. Therefore, 
the objectives of the n U at the Cinrinnati Post are not adequately 
described by the maximization of the wage bill or by the maximization 
of rents. 

The estimates iti row d of table 1 correspond to a specification in 
which y is a linear function of an alternative real wage rate, while 8 is a 
constant. A natural generalization is to |>ennii 6 to vary, and, on 
investigation, it was found that 5 appeared to take on a higher value 
after the 19.58 merger which resulted iti the ITU members of the 
I'mes-Star chapel being absorbed into thePm^'s composing room. I'he 
final two rows of table 1 augment the specification of y = y^ + y,(i'lp) 
with the specification of 5 = 8o + SiD in rowe. This more general for¬ 
mulation yields the highest point estimate of 6 (namely. 490) that we 
encountered for Cincinnati. However, it is not clear what importance 
should be attached to this because the implied estimates of y and 6 are 
greater than the corresponding mean values of real wages and em¬ 
ployment, respectively, so the objective functicsn is not defined at 
these values, fhis particular outcome prcjbably results from too re¬ 
strictive a specification of the movements in y and 8, but, unfortu¬ 
nately, we are handicapped in our investigation of this matter bv 
insufficient degrees of freedom provided by this dat.i set. In general, 
of all the coefficients estimated, we found the point estimates of the 
employment parameter 8 to be the most scn.sitive to slight alterations 
in the specification of the model. 

Estimates that are more consonant with the Stone-Cearv specifica- 

torcsLall any coiitu.Mon, ihc* a.ssuiTn>lion here is iiol lliat. it they wcic not 
unioiii/cd, ihc’.sc Ivpographci's would Ix" workers in lelail ti.itle 1 he .issiiinpiion is thai 
it ihe typographers were not iinioiii/ed and remained is|><)gi aplici s. their wage over 
time would be some proptiruon ot what nonunion woikcrs in retail trade rcteive. The 
V series used relates to the United Slates as ,i wtiole .\n hourly earnings series tor 
nonsiipci visoi V woi kers in the lelail ir,tde in Ohio is available oiil\ alter 19.S2 and in 
Canrinnaii alter 19.S7 However, lor the years liom 1953 to 1966. the Clhio series and 
the U.S. .series move very clo.selv togelher. 
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tion are given in row / of table 1, but they are obtained only after 
arbitrarily restricting the paranneter 80 to a value of zero. According to 
the results in row/, the elasticity of substitution between real wages 
and employment in the union’s objective function is .305 at sample 
mean values of the variables. The elasticity of the maximum value of 
the union’s objective function with resjject to wages (.97) is virtually 
identical to the elasticity with respect to employment (.98). The esti¬ 
mate of y, is consistent with the hypothesis that the ITU’s reference 
wage, y, rises dollar for dollar with increases in the wages of non¬ 
union workers in retail trade. Tlie merger raises S by almost exactly the 
amount by which the merger shifts the labor demand function (as 
given by «.,). At sample mean values, the elasticity of employment with 
respect to wages is — 1.00, that with respect to output is .53, that with 
respect to the price of newsprint is .55, and that with respect to the 
price of machinery is .44. 

Finally, to illustrate the use of the union’s indirect objective func¬ 
tion equation (II), we formed the index f'/Fo, where Vq and F, 
indicate the values of the indirect objective function in the year 1946 
and in each subsequent year/, respectively. This ratio, F,/Fo, may be 
interpreted as an index number of the union’s objective function 
given the observed values of/>,r,,rj, andX in year / relative to its value 
in the base year 1946. Calculating this welfare index with the 
parameter estimates from row/of table 1 yields the following results; 
for 1946, F,/Fo = 1.00; for 1950, 1.46; for 1954, 1.50; for 1958, 1.57; 
for 1962, 1.55; and for 1965, 1.76. This registers a 76 percent in¬ 
crease in the union’s welfare over this period. In contrast, the real 
wages of these typographers rose by 41 percent between 1946 and 
1965 while employment grew 118 percent. 


Other Union Locah 

To examine the generality of this framework of union behavior, data 
for the 1946-65 period were collected for seven other local chapters 
of the ITU. Unlike the data for the Cincinnati Post, observations on 
firm employment are not readily available so that for these seven 
locals we used data on local union membership instead.*® The result¬ 
ing seven union locals range in size from Fond du Lac, Wisconsin. 


“ Data on local union membership may be obtained from the union's publication 
Typographical Journal. These union membership data include those employed in com¬ 
mercial printing establishments as well as those in newspaper firms. We were obliged to 
use the union membership data; though, to minimize measurement problems, large 
cities with substantial printing industries (such as New York and Chicago) were ex¬ 
cluded from our analysis. For most union locals, commercial printing employment 
constitutes a very small fraction of the membership. 
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with a mean journeyman membership of 41 over the 1946-65 period, 
to the Columbus union with average membership of over 600. In each 
city but one, the wage and advertising linage variables relate to the 
single newspaper firm in the city.** The stochastic forms of equations 
(8) and (9) were fitted to the time-series data for each of the seven 
unions with estimates given in table 2. 

As far as the union’s objective function parameters are concerned, 
first observe that the estimates of y are greater than the mean values 
of real wages for Elmira and Fond du l.ac and the estimates of 8 
exceed the mean values of employment for Augusta, Elmira, and 
Fond du Lac. As was the case for the estimates for Cincinnati in row e 
of table 1, this is a consequence (we surmise) of not accounting 
adequately for movements in the reference levels, y and 8.” For the 
other four unions, the estimates of y are significantly gieater than 
zero in all cases, while for Columbus only would the estimate of 8 be 
evaluated as greater than zero at conventional levels of significance. 
The point estimates of Q range from .15 in Dubuque to .79 in Colum¬ 
bia. Except for Dubuque, the estimates of d are significantly greater 
than zero and smaller than unity.®' Comparing the results for Colum¬ 
bia, Columbus, Dubuque, and Memphis, the tendency is for the 
estimates of the “reference” parameters, y and 8, to be greater for the 
larger unions. These larger unions are kKated in bigger cities where 
opportunities for alternative employment (especially in commercial 
printing) are greater and where wage levels are higher. In particular, 
the significance of y suggests that the union’s wage policy is deter¬ 
mined with reference to some alternative or comparison wage rate, 
and models that ignore this aspect of union behavior are neglecting 
what appears to be a consistent finding in this study. In this regard, 
using a likelihood ratio lest, we are able to reject (at even the 1 percent 
level) for all of the unions the hypothesis that the unconstrained 
estimates are no different from those implied by the wage bill 
maximization hypothesis (under which 6 = .5 and y = 6 = 0). 

The estimates of the parameters of the labor demanil function are 
probably best understood in terms of their implications for the elas¬ 
ticity of demand for labor which are given in the final three columns 


” Columbus IS the single exception. In this city the <i.Ha arc aggiegatetl over several 
newspaper firms. We follow Rosse (1978) in treating agemv shop uewspaiiers (i.e., 
"enterprises with separately owned and piodiiced editorial pioducts but with joint 
mechanical production and joint sale of advertising and subscnplions" in the same citv 


[p. 3]) as a single firm. , . 11 t- 

*011 occasions* permitting the reference levels y and 6 to vary resulted in a likeli¬ 
hood function with no pronounced niaxiinuin. 

Evaluated at each union's sample mean values, the clasuciu ol stibsiiuitimi l^tween 
wages and employment in the union’s objeciive function is esiimaied at .70 lor t.oliiiii- 
bia, .04 for Columbus, .49 tor Dubuque, and .20 for Memphis 
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requires more research concerns the intertemporal aspect of the 
model: Clearly, the objectives of unions are defined over the future 
values of variables as well as their current values, while the employer’s 
immediate response to a negotiated settlement may well differ from 
his longer-term response. Any serious attempt to model trade union 
behavior must eventually come to grips with these issues. What has 
been described in this paper is only an initial step toward that more 
satisfactory model. 
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The paper provides a theoretical flcstnplion ol' hedging in fulures 
markets that accounts for tlie behavior ot a broad class of agents. 
Specific optimal det ision i tiles are derived for agents concerned with 
the mean and variance of profit. These rules are used to evaluate 
how opliitial cash and futures positions are related to price ex¬ 
pectations, the production possibilities, and the number of futures 
markets available. 


1. Introduction 

Futures markets in commodities, foreign exchange, and Hnancial 
assets are thriving with increased participation by a broad spectrum of 
hedgers and speculators. Despite this I'act, there is no satisfactory 
systematic account of the optimal use of futures markets by hedgers 
and of the way real output decisions are affected by the availability of 
futures markets. Conventional descriptions (such as that of 
Hieronymus [1971]) seem to propose a view of hedging as taking a 
position in a futures market that is equal in size and opposite to a 
predetermined position in the cash market. Such a simple view does 
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not square well with the observed diversity and sophistication of 
futures markets participants. This paper is an attempt to provide a 
theoretical description of hedging that could better account for the 
behavior of a broad class of hedgers. The following facts and results 
are underlined; (1) Optimal cash and futures positions are normally 
determined simultaneously. The availability of hedging opportunities 
affects production or trading decisions. (2) The optimal hedge can be 
viewed as the sum of a risk-minimizing position and a speculative 
position. (3) The traditional “routine hedge” (which prescribes the 
futures position to be the exact opposite of the cash position) in 
general is suhoptimal in that it cannot be identified as a risk- 
minimizing position or as an optimal speculative deviation from the 
minimum-risk position. (4) In the absence of transaction costs and of a 
“perfect hedge,” hedgers will take advantage of the better risk-return 
performance associated with simultaneous trades in a multiplicity of 
futures contracts. (5) When a limited number of contracts seems 
appropriate (as a result of transaction costs or indivisibilities), the 
question of which contract or combination of contracts gives best 
hedging results arises, I his can be answered by making use of avail¬ 
able statistical information. (6) When dealing in a good for which no 
futures exists, a “cross hedge” may be appropriate; the best cross 
hedge may be calculated in exactly the .same way as a standard hedge. 
(7) The allocation signal to be used by a hedger for his production 
decision is a linear combination of his expectations and the relevant 
futures prices. 

More specifically, we derive operational foimulae lor individuals 
concerned with the mean and variance of their profit. We acknowl¬ 
edge the fact that cash and futures contracts are rarely perfect sub¬ 
stitutes, or that the relation of the cash price to the relevant futures 
price at the time ol delivery may not Ije predicted with certainly (this 
is known as basis risk); and we recognize that the rarity of a perfect 
hedge may justify a portfolio approach to hedging, whereby risk 
reduction (and speculation) is achieved through dealing in multiple 
contracts. We derive criteria for choo.sing the best ctintracts wath 
which to hedge and evaluate the benefits associated with the existence 
of hedging opportunities. 

Previous literature had raided some of the deficiencies of the routine 
hedge prescription. Thus Working (1962) emphasized that the exis¬ 
tence of basis risk prevents the elimination of all risk and was led by 
his observation of hedging practices to the notions of anticipatory 
and “selective” hedging (where deviations from routine hedge are 
based on price level expectations), lelser (1955), Johnson (1960), 
Stein (1961), and Peck (1975) have analytical models of optimal 
hedging that share .some of the characteristics of ours. Johnson and 
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pectations may substitute for cash price expectations in the output 
clccisiorts. In the special case where the multiple regression coeffi¬ 
cients Slim to utiity (/3i = 1), cash price expectations disappear com¬ 
pletely from the decision. 


V. Adding Futures Markets 

In this section we examine how an agent's decisions can be altered by 
extending the range of available futures markets. Since we posit no 
fixed costs in tran.sacting in a market and no costs to formulate a 
probability distribution for the futures prices, it is clear that exletiding 
the list of futures markets will not decrease expected utility.’ Fur¬ 
thermore, the introduction of new futtires will induce a restructuring 
of the entire portfolio so that the effect on the speculative positions is 
difficult to predict. CJonsecjuently, we concentrate our attention on 
how the optimal cash position may be affected. 

Specifically, we consider whether the cash position will increase or 
decrease in si/,e wheti the list of futures is extended. The basic result is 
that when we move from w — 1 futures to m futures, the absolute 
value of the cash position increases it the lull vector of futures prices 
is not expected to chatige from period 0 to period 1. This result allows 
us to argue by induction that incieasing the list of futures from m to 
m* (m* > m) will increase real production when all these futures 
present no incentive lor speculation. The reason for excluding incen¬ 
tives for speculation is that, when such incentives exist, the additic:>n of 
liuures markets may induce a riskier futures position such that the 
lequirecl risk |)reniiuin in the casli good first-order condition in¬ 
creases. 

fo formali/'e the result, let us partition the lull vector ol m futures 
according toy' -= (y[,/,„), where/, is an i/i — 1 vector. .Similarly, let/)/' 
= {ff'u, j/rnt), and let the variance/covariance matrix ot {p,, 1 ) l^e 


ihe (ahli [JiKc and ilic [>ikc oi llic hetiger's “inulu.il inner’ (see c<j. L^])- 1 hen ihc 
piaiininj^ prict’ ()ctoin('S I his is the < ui rcnl pi ice oi the lied^ei s iniKiiai 

iunfi jujiiijs die expetfed gcnerdli/cd basis. J’his iiuerpiet.inoii JS llieureliially dllicU- 
livc. Hosvcver, dchned in diis way, the expecuxl basis cannot be intcrpTetcd widi 
expectations on price dilierc'iices only. Hence it does not leldin die pr.uucal ciiMaciivc- 
nvss ol the n.sujl nioiion ol die basis hIiicIi is due l<> obsei vaimn that pi ice dil iei enc es 
are more stable than puce levels. 

’’ It should \)c cniphasi/ed that the analysis ot this sec tion takes ilie LUireni prices and 
the distributions of fututc puces as j^iveii. (icneraJ equilibrium cficcts ol adding 
tuiuies iiiai kets cannot be iiaecd out in this contexl. In a general ccpnlibriuin aiiiilysis, 
Hart (1975) shows that adding mote lututes markets does not necessarily increase 
wellare 
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wliere o-„o, f^oHn and arc scalars; X,, is (m - 1) x (?« - I); and ^o, 
and X,„, arc 1 x {)n - 1). Wc assume i is nonsinj^ular. I hcn net 
levenue is 

n = PJ - f(v) - (pf, - pf,yf, - - /4»)/„r (10) 

It we consider maxinii/ing a linear mean variance tnnetion of n 
with respect toy,/,, and J,„, the necessaiy conditions arc, respectively: 

pi ~ ^ iy^ X( 3 'tr,ii, — i.()i /1 - cr„„|/,„) = 0, (1 1) 

P'iii-P'u - X( -y^n, + lul, + = 0, (12) 

and 

pmo - P'mi - x(-><ro.„ t + CT= 0. (13) 


l et y**,/f*, and /f„* he optimal with a lull \cc.ior of tuinres; then 
they solve (1 1)-(13). l et v* <tndyf he optimal with a resiiicted list of 
futures; then iliey solve (11) and (12) wiih/,„ = 0, We will show that it 
p(i - piQ iincl=/t,^i,,then | v** | «= |v*| witfi strict inequality when 
or (T(],„ is non-null. 

Under titese assumptions, (12) imphes/f = Inserting this 

into (11), we see V* s.uishes 

pi - r'iy*) - V*X('-^oo - i«|liil^,«) ^ 0. (1-f) 


Sinnlarh, using (12) and (13) to sohe loi /f” and Jf,p .ind inserting 
these into (11), we can see that v** must satisiv 


p, - c'(y**) - y*X' 


rr - V V-IV 
^ IK) —H —10 


—ml —M —Iii)"l 


where z = [1 “ (1/(7,ini,,„]. Now u can be shown that z 0/* 


■ t —1 I —l»M I 


- ltd (i:,, -X'n 

'll -.1" 

= >ltl ( "" 1 ■ <r,„,„(td i,, 

' i..., ir,„„ 

Moif wo have lopo.Uo'dly luado uvo ot ilio <»l ilotoi inin.iiio ot [),ii iitioiuxi 

mall u os (mo Din y mos D)70 jjp. 570-71) In th<‘ last o\|>i twsHni aiul iho loi m 

111 paionihosos aio .ill Illinois ot i:, .i nonsnij^nlai x.»iiaiu<''tu\ai laiuo iiiiiiis l)\ .is- 
sumpuon. rims the last oxpiossion i\ ^liioU posiino 
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Then comparing (14) and (15), we see that if o-om or 2,o is non-null, 


p\ ~ c'(y*) - y*-A(Taa - Soi2,!2,o - - - i<Tam ~ 

L 

(16) 

is positive ifjy* > 0 and negative ifj;* < 0. Given the properties ofc(-), 
wc see | v** | > |>* |- If both <rom and 2m are null, expression (16) 
reduces to (14), and y** = y*. 

As an interesting illustration of the above, consider passing from 
zero to one futures market. In this case)*** satisfies 

(1 - /3)^ + I3(pi - B) -c'(y**) - x>**o’oo(l “ P'^) ^^0 (17) 

(here we allow the possibility that ^ p{,). Similarly, y* satisfies 

h - c'iy*) - xy*(r(io = 0. (18) 

Using die properties of c( •). it is indeed easy to see that, provided that 
(p{, - B) does not differ too widely from p and that futures and cash 
pi ices are correlated (p # 0), ly** | > jy* |- The introduction of 
futures trading leads to increased real production. However, if fu¬ 
tures and cash prices are sufficiently out of line, the reliance on /»{, for 
production decisions may suggest more conservative cash positions. 
On the other hand, if p = 0, the futures market is of no help for 
hedging purposes, and (18) reduces to (17). The original risk of the 
hedger’s real activity is intact (oco); futures tiading is limited to plac¬ 
ing speculative side bets. 

In this latter case, it is dear that the utility derived from futures 
trading is entirely associated with the speculative opportunities of¬ 
fered. If, however, we abstract from that aspect of futures trading hy 
assuming = p^Q, it will be clear that utility increments also result 
Irom the hedging opportunities offered by futures markets; the size 
of the increment depends on the degree of correlation. I'o see this, 
substitute (5) info (2) with 2ii, 2oi, w and/ being scalars and differ¬ 
entiate with respect to y and cToi to get 

d(EU) = p^~ c' + -^^(p'o - p\) - ‘h 

+ (/>(>- 

I 0-11 O’!! 
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Under our assumptions, this reduces to d(EU)ldcro, = Xy'HfroJfr,,). 
Expected utility is increasing in | <ro, |. 

Finally, we derive a monetary measure ot the benefit of l>cing able 
to trade futures. For simplicity we derive this for a single futures case; 
similar expressions are possible for the multiple futures case. Lety* 
andy** be interpreted as above and suppose that the agent is able to 
trade in futures only by paying a fixed amount k. Let A* be the amount 
that would just leave the agent indif ferent to tiading futures or not 
trading futures; k* can be interpreted as the maximum fixed charge 
that the agent will be willing to pay for the right to trade futures. 
Within our framework we find 


k* 


pt(y** ~ y*) - [c(y**) - f(y*)] + 


+ 


(p^<^ - p{)y** 


_ X. 

<r,i 2 


y**^(<Too - 


ipi ~P'iV 

XO’n 

tTit / ■ 


As an example of how this expression could be used, we might 
imagine a broker who wishes to assess the service charge a potential 
hedger would tolerate in order to deal in futures. If it is thought ifiat 
the hedger is unlikely to perceive tfie speculative opportunities (/?{, = 
p^i), the amount the hedger is willing to pay is composed of two parts; 
p\(y** ~ >*) “ [fly**) ~ r(y*)J IS the expected increment of net 
revenue on the cash market attributable to dealing in futures, and 
{y**‘^[vno ~ (u'oi/cr||)l ~ is the increment of risk in the futures 

regime versus ilie nonfutures regime. J'his will lend to be posiiive if 
I troi I and c" arc relatively small and negaiive when | (t„i | and c" are 
relatively large. I'o sum up, the amount the hedger is willing to |)ay is 
his increased expected cash revenue less his increase in risk weighted 
by his degree of absolute risk aversion. 


VI. Restrictions on Stochastic Relations between Cash and 
Futures Prices 

Singular Price VariancelCovariance Matrix 

When S is singular, knowledge of all futures ]>rices is ec|ui\alent. in a 
certain sense, to knowledge of the cash price. I bis has interesting 
implications for both the optimal casli and futures decisions 

First, recall that from the jsroperties of partitioned matnees, if L,, is 
.nonsingular, 

det ^ = (troo ~ tlel 


— (1 tiet E||. 
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It ^ is singular, dct 2 = 0 and we sec o-oo = 2oi2rl2|o or, equivalently, 
/?^ = 1. riius when (he vai iante/covariance matrix of prices is singu¬ 
lar, the cash price is perfectly correlated with a linear combination of 
futures prices. 

This restriction is closely tied to the notion of the routine hedge. 
For in this case equation (5) implies = (l/x)Xoi^ii(po ~ p\) + 

VcriK,. The covariance weighted sum of futures positions is decomposed 
into a speculative componettt that is proportional to the regression 
weighted sum of expected futures price changes and a pure hedging 
lomponent that is proportional to the cash position. The restriction 
implied by the singularity of 2 stands out more clearly if vve dimen¬ 
sion (as we ate free to do) the cash and futures goods so that cr„a — o-,, 
= . . . = (T„„. rhen when there is no incentive to speculate we have 
— V, where pj is the simple correlaturn coefhcient oi'p{, and p,. 
fh.it is, the correlation coefficient weighted sum of futures must 
eqttal the cash position. In this sense, a singular 2 is associated with a 
“generali/.ed routine hedge.” If there is only one futures contract, 
II = 1, the jnire hedge is in fact a “propei” routine hedge: / = y. 

The implication of a singular 2 for the cash position is seen im- 
mt'dratcly troirr equation (7) for, makiirg use of (9), it reduces to 

(\- fit)p,+ (i{pi-B,)^c'{y)^0. (19) 

Here the risk premium has disappeared, indictiiing ih;it the cash 
derision is tio longer affected by tire level of risk accepted in the 
fiUutes |rorlfolio. 


A'ri llfhit Hhk 

We have already noted that the interest in the basis is a result ol the 
fact that the dilleieute ol prices irray be more stable than the juice 
levels. As a limiting r ,ise, the cash aird futures jjrites may be random 
wlrile the basis is not. Koi example, this would be the case if at the lime 
ol the futures market expiration the costs of transportation to the 
cash market are noiistochastic. 

If the vector ol bases li, is a vector ol known constants, cr,,, = cr,, = 
(Too ‘Uul thus Pi = 1 lot all i. (iash price is jrerfectly correlated with 
each futures price. In this case 2 is obviously singular, so that equation 
(19) holds. We now have the luilher restiiclion that the icgressioti 
coefficients sum to unity. For we have cr„,, = 2o|2|l2|o ^ jS^io or 1 = 
/32io( 1 ^ f-foo) = /3(p] . . . pv)' = fit" Using this in (19), we see that when 
there is no basis risk, the optimal c ash position is given by ^ip{, ~ li,) 
~ f’iy) = 9- 1 be planning price is entirely a f unction of current |u ices 
and basis expectations; the expected cash jjiice does not enter the 
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decision. For the sitiffle cross-hedge case, this cotidition, first derived 
in Danthine (1978), leads- f'( 3 i) = 0. 

VII. Basis Speculation 

Working (1962) and others have expressed the view that hedging and 
speculative behavior are distinguished not by the one being risk 
avoiding and the other i isk taking bin by the kinds of risk accepted in 
the two cases. Speculators take pi ice-level risk, while hedgers take 
price dilterence (basis) lisk. 1 bus hedgeis are said to speculate on the 
basis. 

Resliicting ourselves to the « = I case, we can give a basis specula¬ 
tion interpretation to the hedger’s problem by assuming c(y) = p(,y: 
that is, the hedger is ,i storage company with costless stoiage. When 
we rewrite cejuation (1), net revenue' becomes 

tr = (/i, - pn)y - (p{- pi)/ 

= (Ha - ^,)V + (pi - p,)'‘, 

where s = / — >' is the excess ot the futures position over the routine 
hedge. I'he agent speculates on the basis by the amount y and on the 
price level by the amoum .'. Working's pure basts speculator arbitrar¬ 
ily sets i — 0. Our agent jointly selects y and s, thus, in general, 
attaining a better lisk/return combination W'hile suboptimal in this 
sense, pure basis speculation may be defended if the agent possesses 
intbrmation about B, and var B, but not the moments ot the under¬ 
lying pi ices. 

VIII. Conclusion 

Oui descriptions of optimal hedging decisions .n e spetibc enough to 
be implemented atid their performance to be ev.iliiated. f lowever, the 
question of empirical applications raises .sevei.il issues that we have 
not addressed exjilicirlv. 

First, we have ignored the fact that futures contracts ate 
standardi/.ed as to quaiiliiy and must be traded in integer multiples. 
For a small user this discreteness is a seiious concern and ma\ elimi¬ 
nate the possibility of multiple cross hedges. Keen tor the large 
hedger the discreteness of tmutes markets implicitly may limit tlic 
number cst markets that should be coiisideied in the poitlolio. 

A more important concei n is the tact that we have assumed that the 
agent has knowledge of the relevant moments ot the piobabilitv 
distributions of prices. In jsractice, a considerable ettort inav be re¬ 
quired to estimate these monicMits, patlicularlv d they arc' subject to 
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change over time. The stability of moments (or of some more general 
structure from which the moments can be derived) is, of course, 
fundamental to our results, as indeed it is to the notion of hedging. 
The limited ability to identify such stable relationships is likely to be 
the most serious constraint in developing very extensive multiple 
cross hedges. In Anderson and Danthine (1980) we address some of 
the problems of estimation and illustrate the approach with empirical 
applications. 

While we feel that we have provided an account of hedging in 
futures markets that is more satisfactory than any other we know, we 
recognize it is not fully general. One restriction is that we have 
assumed a nonsttKhastic production process. Judging from Anderson 
and Danthine (1978) and Rolfo (1980), which treat production un¬ 
certainty, the results of the present paper would not be seriously 
altered by the relaxation of this assumption, f'inally, in assuming only 
two trading dates, we have suppressed the possibility of changing the 
futures position many times during the period of holding an open 
cash position. In Anderson and Danthine (1981) we consider the 
problem of hedging in a single futures market but with many possible 
trading dales. 
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The Implications of Competition among 
Jurisdictions: Does Tiebout Need Politics? 
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The paper invc.stij{atcs whclhcr competition among local jurisilic- 
tions is, by itself, sulficient to ensure elhcicnt provision of local 
public goods. Jurisdiction.s have fixed boundaries, and each has an 
entrenched government with the power to tax and supply the public 
good. Residents tan move costlessly among jurisdit tions. It is shown 
that competition among numerous jurisdictions is not sulficient to 
guarantee public sector cfheienev. Though residents can "vole with 
their feet, ’ land is immobile. Ileiue. governments tan usurp some 
land rents for their own ends. Increasing the number ofjuiisdiriions 
limits bur cannot completely eliminate the abilii) to exercise di.s- 
cretionary governmental power. 


I. Introduction 

Ciovernmerits are alleged to be loo big, inefficiently operated, and 
unresponsive to the public. The assumption that government activi¬ 
ties are undertaken to serve the public has fallen into disrepute.' In¬ 
stead, positive analyses of public activities increasingly t.ike the gov¬ 
ernmental objective to be that of satisfying a buicaucraiic goal such as 
expenditure maximization or maximization of the utility of the head 
of the government agency.* Such objectives have been assumed to 
characterize local as well as national govcrnineiiis.* Indeed, the itlea 

We would like to thank the pailicipants in the Poliiual Ecoiioinv W’oikshop at the 
University of Pennsylvania, the MicuKHonoinics Woikshop at the L'niceisitv ol \ n- 
ginia, the Public Choice Workshop at Viignna Pobtechnic Insiiiuie, and .111 .inons mows 
referee for helpful comments. We thank Roix'il Inman bn' his lommenis and tor 
suggesting the title for the paper. 

' See, e.g.. Downs (1957) .ind Sngler (1971) loi seminal aigiimrms 
’See Posner 1974; Tullcxk 1974; and Niskaiien 1975 
’ .See Wagner and Weber 1975. 
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that local governiiients may pursue various goals is implicit in analyses 
of' local fiscal variables which assume that the values of these variables 
are dependent on or influenced by the process of collective choice; the 
characteristics of the decisive voter(s) as well as the nature and institu¬ 
tional form of local governments are presumed to matter.'* 

At the national level, it is often argued that competition via the 
electoral process leaves the bureaucrat considerable latitude because 
voters have limited incentive lo learn about candidates or to vole their 
preferences. At the local level, however, it has been argued that 
competition among jurisdictions restricts a bureaucrat’s ability to pur¬ 
sue policies which do not reflect the resident.s’ desires. In fact, I'iebout 
(1956) argued that jurisdictional competition would guarantee the 
same efficiency in the prttvision of local public goods that competition 
among firms assures in private goods markets. I'iebout’s argument 
thus appears to imply that the ballot box is unnecessary, that efficient 
provisioti of local public services will arise because residents can vote 
with their feet. If, in fact, jurisdictional coitipetilion leads to efficient 
local goods provision, then analyses of the political choice process 
would appear to be applicable primarily in those cases in which the 
relevant set of juiisdiclions is small. 

fn a recent atlicle, Cour;mt, Ciramlieft, and Rubinfcld (1979) study 
the choice of fiscal variables by a local government that has been 
captured by its bureauciacy. I'lieir analysis, whiclt focuses upon deci¬ 
sion making in a single jurisdiction, investigates the extent lo which 
the ability of residents lo leave the jurisdiction limits the power of its 
bureaucracy. Their paper provides an interesting analysis of the 
trade-off faced by a public employees’ union in a single jurisdiction 
wfiose residents are tnobile. 

Our analysis fijcuses on the equilibrium among jurisdictions. We 
seek to cletettnine how changing tfie number of juristlictions (i.e., 
changing the degree of competition amtsng jurisdictions) constrains 
tax and expenditure policies of individual goveinments. We posit 
alternative assumptions about governmental objec tives to examine the 
consequences of jurisdictional compelilioti.’* 

In particular, we cjucsiion whether ccnnpeiition among a large 
number of jurisdictions forces governments to implement policies 
that are in the best interests of their re.sidenls. To answer this (jues- 
tion, we deliberately consider a rather extreme version of government 


^ .See ll.itr.uid Davis 1966; Boi diet ding ami Deacon 1972, llergsiroin and 0(««linan 
1973; Denzau .ind Mackay 1976; Inman 1976; and Romer ,nid Rosenthal 1978. 

* t^ouranl and Riihinfeld (1978) explore benefit nie.isuicnienl issues in a model with 
equilibrium among juristlictions However, neither jurisdiilional conipetilion nor an 
investigation of implications of alternative goveinnienlal objcciivcs is the hx us ol then 
pajier. 
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entrenchment. 1 he government (the mayor, the bureaucracy, or 
however dehned) in each juristliciion is a.ssumeci to choo.se a lax rate 
and level ot government spending to maximize government profit. 
The government is limited in its choice of tax instruments by a 
constitution over which it has no control, but there is no political 
mechanism by which residents can affect the lax rate or the level ot 
government spending. Residents can, however, move freely among 
jurisdictions. We do not claim that local giwernmenls actually maximize 
profits; we do not wish to argue that local governments are entrenched and 
unresponsive to their electorate',. Resident voters, via intrajurisdictional pro¬ 
cesses, may indeed restrain governmental beluivior to he in accord with then 
ouJti desires. Without denying this possibility, this paper addresses the question 
of whether interjurisdictionol conipetilion can, in itselj, accomplish the same 
end. Although increasing the number of Jurisdictions reduces each 
government’s ability to levy taxes m excess ol expenditures, we dem¬ 
onstrate that competition among Jurisdictions, even with very many 
Jurisdictions, is not sutriiiem to prevent individual governments Irom 
ptirsuing policies which are not in the interests of their residents. We 
thus deinonstiate that the right of citizens to t boose among many 
communities cannot completely eliminate goveinmemal monopoly 
power; that is, competition among governments is not equivalent to 
cotnpetition among firms. A further implication is that the political 
choice piocess can matter. 

■fhe above results arc established in a model in which every gov¬ 
ernment seeks to iiiaxinii/.e goveinmemal profit and in which each 
Jutisdiction is equivalent in size and location. Once this resvilt has 
been established, it is natural to ask whether governmems in different 
jurisdictions can coexist while pursuing different objectives. This 
question is answered af firmativelv even when the number of jui isdic- 
tions is very large. In answering this cjuestion, we shed additional light 
on the “capitalization hypothesis" ami funher clantv the interpreta¬ 
tion of tests of the I iebout hypothesis, finally, we explain what would 
happen if all jurisdictions were not homogeneous in size or locational 
desirability or both. One ot the implications is that the presence of a 
laigc number of jurisdictions in one portion ot the inetiopolit.m area, 
say, the suburbs, limits the monopoly power ot jurisdictions in that 
portion ot the metropolitan arc*a Inn has little etteci on the monopoly 
power of jurisdictions in other portions ol the metropolitan area, sav, 
the central city. 

In the .section which tollows, the model is introduced and some 
comparative statics are derived for use in the subsccpient sections. In 
Section 111, governments are .tssctined to maximize profits, and the 
implications of jurisdictional competition are examined, 1 he conse¬ 
quences of jurisdictional comjjetitican when objectives differ among 
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governments are examined in Section IV. In Section V we investigate 
the case in which jurisdictions of differing size occupy differing por¬ 
tions of the metropolitan area. A summary and conclusions are pre¬ 
sented in Section VI. 


II. Equilibrium with Multiple Jurisdictions* 

In a metropolitan area with homogeneous individuals each of whom 
can freely choose the jurisdiction in which he wishes to reside, no 
jurisdiction can unilaterally create a level of well-being for its resi¬ 
dents different from that available to residents of the remaining 
jurisdictions (assuming tnore than one). If, for instance, one jurisdic¬ 
tion chooses a Hscal package (i.e., tax rate and level of government 
services) which resident.s firul relatively unattractive, they will respond 
by nioving to other jurisdictions. As a result of this movement, hous¬ 
ing prices adjust to compen.sate for differences in the desirability of 
fiscal pac kages. In this section, we will set forth more exactly the 
implications of this assumption that individuals can choose their resi¬ 
dential location. The primary purpose of this section is to derive 
properties of the equilibrium. These properties are used in the 
analyses in subsequent sections. I hc analyses in this and the following 
.section arc kept relatively simple by the assumption that all jurisdic¬ 
tions ate alike both in terms of their natural physical amenities and in 
terms of their fiscal variables. 

(Consider, therefore, a metropolitan area with a fixed total land 
area, L, in which there are J jurisdictions.’ The land area is divided 
equally among the jurisdictions, so that each has a fixed boundary 
encompassing land area /- = L!J. Housing is produced by competitive 
firms in the jurisdiction from land and nonland factors via a 
constant-returns neoclassical production function. The price of non- 
land factors is a.ssurned fixed and unilorm throughout the met- 
ropttlitan area. The housing-supply function in the yth jurisdiction 
tan then be written as;" 

= Lh>,(PU), (1) 


^ 111 this and Mibsecjucnt sections of the paper, we assume an equilibrium exists and 
we investigate the pioperties of the eqiidibiiiiin. 

^ In Se(. V we show that our results arc not changed qualiUitively if it is assumed that 
additional land is available at (he urban fringe 

® 'I'he housing-supply function is obtained by assuming that housing produceis 
choose the ratio of nonland to land inputs to maximize profits. The production 
lunction is assumed to exhibit constant returns to scale. The ratio of nonland to land 
inputs will be an increasing function of the price of housing, and it follows that the 
amount of housing supplied fJer unit of land will increa.se as the price of housing 
increases. 
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where Pf, is the price of housing in the^th jurisdiction and h’, is the 
supply function of housing per unit of land. 

The jurisdiction supplies (P units of a local public good, each 
resident is assumed to receive an equal nonshared amount. I he good 
is produced with constant returns to scale from nonland factors, and 
its unit cost will be taken as unity. Thus, the total cost of the good is 
proportional to the number of residents. I'he jurisdiction hnances 
this good by assessment of a Hat rate property tax, V, on the market 
value of housing in the jurisdiction. Determination of the levels oi CP 
and P will be discussed in the following sections. 

There are N identical individuals in the metropolitan area and in 
the jth jurisdiction. Each individual has a utility function over gov¬ 
ernment services, housing, and numeraire goods (H), U(C',, h, B), and 
faces the budget constraint, / = Pi,h(l + 1) + B. Individuals in each 
jurisdiction choose housing quantity to inaximi/e utility, taking CP and 
B as given. Let K[G^A’//(1 + denote the indirect uttlity function that 
results. That is, 

|/[GJ,P/,(1 + Idl = max U[(PMJ - P/,Ai(l + 1^)1. (2) 

hit 

The demand function for housing by an individual in jurisdiction 
given the decision to live in jurisdiction j. that results Irom this 
maxiniization is: 

IP^ = h;,\(P.Pl,(l i r)\. Cl) 

Note that expresses housing demanded per pci son, unlike /t,. 
which expresses housing supplied pet unit ol land. 

Each individual is free to move among jurisdictions, and his lot.i- 
tional choice is that which maximi/es his utilitv. In etjuilibiiuin, no 
individual has an incentive to nxave. I htis, utility must be the same in 
all jurisdictions: 

y{cp.p‘„i\ + C)i = Pi(P.Pj,(i + p}]. ( 1 ) 

All /V-' individuals in each jurisdiction must be housed, so 

X'/iJ, = /./d,, Ct) 


and all individuals in the itietropolitaii .irea must live in some jui is- 
diction, 

j 

Si ^ .vc (6) 

>=1 

Conditions (5) and (6) can be combined to vield: 




1 202 


JOURNAI, OF POLITICAL ECONOMY 

We will assume that the government of each jurisdiction, whatever 
its objective, adopts a Cournot-Nash strategy, choosing its and 
treating all other jurisdictions’ choices of G and t as parameters (i.e., 
no attempt will be made to investigate strategies in which one gov¬ 
ernment attempts to determine how changes in its tax rate or service 
level will affect the choice of lax rates or service levels in other 
jurisdictions). In the remainder of this section, we derive the com¬ 
parative static response of and Al-* in each jurisdiction when l'‘ and 
G'^ are varied in any one jurisdiction. For noiational convenience, we 
will denote jurisdiction 1 (i.e., ft = 1) as the juiisdiclion whose tax and 
spending levels are perturbed.” 

If jurisdiction 1 dilferentially changes its lax rate while no other 
fiscal parameter is altered, individuals would move into or out of 
juiisdiclion 1 and out of or into the remainingy — 1 jurisdictions. 
Ohanges m housing prices will ensure that e(|uilibi'ium, as described 
by conditions (4) and (7), is restored. (a>nsequently, totally differ¬ 
entiating (4) and (7), we obtain the response of pi ices to a change in 


-U'oh),i\ + l')dP'„ + UydAl + t>)dP^„ = V'„h],P),dl' , j 9^ ]."' 



djfPM) 

dPU 


dPi, = -/. 


dWhh) 

dt' 


(Ha) 

(8b) 


III order to sinifilify notation, for each jurisdiction, let rf represent 
the elasticity of housing demand with respc'ct to the gross-of-tax pi ice 


IV<hiiicallv, this iiuxlel is not "closed" lot iwo uasons. First, land has a positive 
pi ice III all )ui isciictions .itui the icliii ii on laiici wealth has not been im luderi as income 
lo a>Msinm’i s, Sc'crind, if ilie ^ovei nment olxains f.ix icveniic in excess ofexpendiruies, 
the profit should he rountecl as income to some agent. Koiinally incoipoiating these 
elements retjuires dealing with wcalili eliects—which au* a nuisance and do not shed 
any light on the issues ol iiiteicst «i iis I he leium on land wealth could be iricorpo- 
raicd Ml one of two ways 1 itlc to land could Ih! treated with each consumer-resident 
endowed with shaies valued at l/Nlh of the aggregate value of ihc land in the 
mcirojxihiaii area. I his would not change oui analysis or our icsulls in any essential 
respect Alternatively, the uiiliiy funciioii of roiiMiineis could f>e assumed to (ji,splay .i 
constant niaiginal utility ot the niimeiaiie comiiicKhty, i.e , lo be of the foim U ~- 
W{(f,h) ^ ^ Wilh this utility lunclioii, variations in wealth arising either from changes 
Ml land value oi from piohfs in the government sccioi would not affect (heequilibrium 
in the maiket for housing within or across jui isdiciions. The fact that wealdi eliects t an 
be introduced in these ways without alfeciing our cpiaUlativc resiiUs remtoiccs our 
assertion iliat they < an (x* assumed away. While closing the model by one or more c)f ihc 
devices mentioned alxne may Ix^ an appealing formalism, the added roinplcxiiy adds 
(iiinhei some detail wuhoui in any way illuminating the issues being addressed. 

In obtaining (8a) fiom (4), we have made use oi cq (2). Note that (4) and (2) 
together irnpiy that (fie direct utility function evaluated at the optimum choice of 
luiusing must lx* equal across jurisdictions. Hence we simply use the envcloj>e iheoicin, 
that residents’ housing demand optimally adjusts to the new gross-of-iax price of 
housing, /^w(I + /). in each community. 
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of housing (r) < 0), lei 6^ represent the elasticity of housing supply 
with respect to the nel-of-tax price of housing (6 > 0), and let y 
represent the elasticity ol housing tleinaiid with respet t to govern¬ 
ment services. In addition, let represent the functionally deter¬ 
mined density of population. 

Since in this section all governments are assumed to behave iilcnii- 
cally and adopt the same strategy and all governrnenis and resKlents 
are identical, vve assume that all governments will adopt the same G 
and I in ecjuilibrium. I'hus, when /' is perturbed, dP'ft = ... = (iP’n. and 
the levels of all endogenous variables are the same in all jurisdictions. 
Using these conditions and the elasticities and ratio dehned above, vvt 
may rewrite etptalions (8) as:" 


dP\, _ P,Av - (7 - m-v)] 
dP j(\+i){d--n) 


(9a) 


dPh _ PuO _ 

dP y(i+,)(0_^) 


/ = 2. J. 


(9b) 


It is apparent trom (9a) that an im reuse in the number of jurisdic¬ 
tions increases the sensitiviiy of P], to changes in (', (hat is, as/ 
inci eases, dP'nIdP Itecomes more negative, and in, (he limit ,\sj —» x it 
approaches - P„l( 1 + 1). I bis implies that in the limit the deritative of 
the gross-of-tax price, d{P]i( 1 + l')\ldp. .tppioat hes zero, so that any 
change in P is fully offset In a change in P'^. ,\s the number of 
jiii isditlions becoines large*, ihe price in other jiirisdiciions is unaf¬ 
fected by changes in d, as is seen l)y taking the limit of l9b) asy —* x. 

Under our simplilUng assumption of identital governments and 
residents, the comparative static effects on liousing pi ices caused bv a 
change in fd can be derived in the same manner .is the effects caused 
by change's in d. 1 hc'se comparative statics are: 


+_ ifji —: (1 Oa) 

dG' JUnhA^+l) JG{0-v) 

<lP‘ii ^ _ c_ lljl— (1 Ob) 

dG' /fV'd(l+0 JG{0-v) 

The first term in (lOa) reflects the increase in housing prices arising 
because jiii isdiction I becomes more dcsiiable lo consume is ,is (> 
rises, so that more consumers wish lo enter I he second Iciin indi¬ 
cates (he effect of the coin|)lc'meiii.uitv (y ^ 0) or subsiiiuiabiliiv (y -- 
0) of government services and housing. If ihev ,u e complemeiitai \. 
an increase in (P causes an increase in housing demand pci 


" ncl.iiled cMuimci.ilion <>t dir 'irp' m driivin.n dicv ic-ulis .iiid u aik> in suli 
sections is .ivail.ible (Hi icfjuesi. 
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fomumer —which in turn causes P'n to be bidden up.'^ The terms in 
(10b) are interpreted similarly. 

As the number of jurisdictions increases, equation (lOb) indicates 
that the effects of changes in G' on prices in other jurisdictions go to 
zero. As/ 00 , equation (10a) shows that any rise in G' will be offset 
by a rise in Pjf to keep the consumer-resident’s utility constant. 

The effects of changes in and G‘ on the population in jurisdiction 
1, /V‘, are derived by differentiation of (5) and by use of the results in 
(9) and (10). Performing the necessary differentiation and substitu¬ 
tions yields: 


dN‘ _ -(7 - iW'e ^ 
dt' /(l-t-0 

dN' _ -■y y (7 - l)f/f;(^’- tj) 

dG' [G JG JUsP,M^+l) 


(Ha) 

(Hb) 


These results are easily siimniai ized. An increase in the rax rate 
causes a decline in the jurisdiction’s population because the fall in the 
equilibrium net-of-tax price (see (9aJ) causes a reduction in housing 
supply. By contrast, a ri.se in government services makes the jurisdic¬ 
tion more attractive. As a consequence, the population of the jurisdic¬ 
tion will rise, except in the case where complementarity of housing 
and govetnment services cau,ses an increase in per capita housing 
demand large enough to offset the decline in per capita demand that 
results from the price increase exhibited in (10a). 

It should be emphasized that the comparative static results derived 
above do not depend on any particular objective function for the 
jurisdictions. They simply characterize an equilibrium in which iden¬ 
tical jui isdictions pursue identical Cournot-Nash strategies. They can 
thus be used to investigate governmental objectives such as maximi¬ 
zation of property values, the utility of the median voter, government 
expenditure, or “government profit." 


III. Profit-maximizing Government 

In this section we assume that each jurisdiction is controlled by a 
monopolist whose objective is to maximize governmental profit, 
defined here as the excess of tax revenue over government expendi¬ 
ture. We will neither defend nor deny the realism of this govern¬ 
mental objective; this choice of extreme objective is simply a conve¬ 
nient vehicle for determining whether jurisdictional competition can 


In theii study ol incidence, Polinsky and Rubinfcid (1978, p. 2.'57) find that this 
compleinentai ity nr substitutability is one factor determininjf how a change in govern- 
ineni services affects land prices and other variables. 
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eliminate the power of local governments to adopt policies which are 
not in the best interests of their residents. In practice, voting by 
residents might prevent such exercise of monopoly power. However, 
our purpose is to determine whether competition among jurisdictions 
is by itself sufficient to prevent the exercise of governmental power. 
Hence, we deliberately assume that the monopolist can set G and t 
without electoral approval. 

Profit to the government of the representative jurisdiction is:*^ 

n = lP„{t,G)H,\P„(t,G)] - N{t,G)G. (12) 

Superscripts have been dropped since all jurisdictions are identical. 

Differentiating (12) and using (1) and the comparative static prop¬ 
erties of the functions Pf/(I,G) and N(t,G), we can write the first-order 
conditions for profit maximization as: 



I0(\ + 0)1 ^ CiSI/ - 1)0] , 

7(0-7,) 7(1 + 1) 


(I.Sa) 


d-rr [[ U<;U - 1)6 , yS 

dG [ujp,M+t) jG(e--n) 

(j;5b) 

-I- 0) - G(0 - 7j) -I- (y - 1)| = 0. 

Canceling L, taking limits ,is J and rearranging the above, we 
obtain: 

tP,A,^G+^. (14a) 


i^[i+—— = 1 
Cfl P,A,( 1 + oj 


ll4b) 


From (14b) it is apparent that the marginal laie of siibsiiiution ol 
government services for numeraire goods departs Irom the ratio ol 
relative prices (which equals one). However, because ol the distortion 
that arises Irom the property t;ix on housing, even a wellare- 
maximizing governmetit would choose a level rrl public good suppiv 
such that the marginal late of substitution doe.s not equal the price 
ratio. 

From (14a), it is clear, utilike the Cournot result whim the numbei 
of firms is increased, that government monopolv profits are positise 
even when the number of jurisdictums is laige. lax leNenue per 
capita, tPi/fta, exceeds government spendmg pei' lapiia, (>, by a posi- 


'* We assume that the government d(x*s noi consume housing 
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live amount, related to the value of housing per capita, Pn^d, and to 
the housing-supply elasticity, 0. Thus competition among govern¬ 
ments encompassed hy hxed boundaties is not sulficient to eliminate 
those governments’ monopoly power. 

For an intuitive understanding of why jurisdictional competition 
does not eliminate governmental power, consider the following sim¬ 
plified version of the model. Suppose consumers get no utility Irom 
government services. Obviously, then, the bureaucratic monopolist 
will set C, = 0. The condition determining the profit-maximizing value 
oft is then (13a) with G = 0. Suppose fui ther that 6 and tj are constant 
over the relevant range. I’hen (1.3a) can be solved for t to yield: 


■ J((f - V) 

V) - ^ 

I'aking the limit ol (15) as t) it is seen that: 


(15) 


lim I = 

1 ’ » 


(9 ■ 


(1(3) 


riuis, when housing demand is perfetlly elastic, the prolit- 
maxnni/ing tax rate is determined solely l)y the elasticity of hoitsing 
supply independent ol the number ol jiirisditlions. With perfectly 
elastic housing demand, the price ol housing is exogenous, but bu¬ 
reaucratic moncjpolists can still profit by sharing in the rents of the 
lamlowners whose land is conlinetl to some given jurisdiction. Alter¬ 
natively, suppose that housing is in pcifectly elastic supply. I'hen, 
from (15): 


lim I 


-1 

V + I 


ify- 1 


(17) 


I 0 il / > 1. 

With only ()n<'jui isdit tion, the bureauc i atic monopolist can exploit 
the housing-demand elasticity. If there is more than one jurisdiction, 
housing can Ite expandetl indefinitely at constant cost in each jin is- 
diction, and the individual jurisdictions are unable to exjjloit the 
elasticity of the demand curve. 

Next, consider the effects of an increase in the number of jurisdic¬ 
tions in (15): 


lim I = 


2 

d' 


(18) 


(Comparing (16) and (18), we see that the effect of having very many 
jurisdictions is similar to the effect of having an infinitely clastic 
housing demand. Competition among jurisdictions prevents a bu¬ 
reaucratic monopolist from exploiting the demand elasticity but not 
the supply elasticity. 
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Finally, dilfercnliating / wiih icspt-tt to J in (13) reveals tliai 



(6> - T>)(1 + 0 ) 

0[J(0 - 7)) ~ + 6)Y 


( 19 ) 


An inn ease in the number of jurisdit (ions does nu^notonirally de¬ 
crease the tax rate. 

While these lesulis are surprising, they are easily explained. 1 he 
goveirnnent of each juiisdiciion is given the exclusive right to tax the 
property in that jurisdiction. Since boundaries are fixed, a landowner 
cannot move his land to another jurisdiction by a redefinition of 
boundaries. Fixity of boundaries pi events the tax on land from being 
competed away. Jurisdictions lompeie for residents, and competition 
among a large numbei of jurisdictions prevents exploitation of the 
elasticity of fiousing demand. Juiisilictions in our model beie do not 
compete for land; conseipienlly, tfie existence of a large number of 
jurisdictions does not prevent exploitation of' the elasticity of housing 
supply. 

Although the tax exploits only the elasticity of supply when there 
arc many jui isdictions, one shotild not conclude that the lax merely 
redistributes wealth from landovsiiei s to bineaiu rats. I'he tax reducts 
the unit price of housing received by suppliers, thereby reducing the 
supply of housing lelaiive to the no-tax e(]uiiibi aim. .\s a result, the 
gross-of-tax (nice of housing must be higher aiul per c<ipita housing 
consumption lowei than in the no-ia\ equilibiium. I'he only excep¬ 
tion would be if housing supply were perfectly inelastic. For this case, 
the tax would not affet t housing production or cisnsumpiion. 

Our results can be illustrated geometrically if we lonsidei .i sim¬ 
plified veision of the model in whkh6' is hxed at /eio for alljuusdic- 
tions. Each curve in the tight-hand portion of figure 1 shows tombi- 
nations of net-of-iax housing prices and lax tales that gcnei.iie the 
same gross-of-tax pi ice and hence the s.ime level of uiilitv. Fhe 
further to the northeast, the lowei is the utility level, l.et F" represent 
the highest utility att.iiiiable, that level which would occui it ( = (“ = 0 
in all jurisdictions and competitive mai ket forces detei mined ,i nei- 
of-tax price of E*?,. I'he housing m.irket is depicted m the lef t-hand 
diagram. Recall that the housing-suppiv function per unit of land,/i,, 
is independent of the number of jin isdictions and of the lax rate. Fhe 
housing-demand function per unit of laiul is a product of the housing 
demand per person,/irf[PH(l + OXH. and the population density, 5. 
Hence each jurisdiction exhibits the same housing prite, Ph. and the 
same population density, S”. 1 he zero lax equilihiiuin in the housing 
market is shown as point Z. 

Now consitler the situation in which .ill governments set the s.nne 
positive tax rate. t'. We show that this loweis utility .md results in .i 
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nei-ot-lax price lower than P?,. If the net-of-tax price were to decline 
sufficiently to reestablish the original utility level, at point W, the 
housing-demand function would be unaltered at 8 '%,/[Ph{ 1 + t®)]. But 
with the resulting lower net-of-tax price and an unaltered demand, 
there would be a housing shortage. With a tax rate off*, the housing 
market could clear only if the net-of-tax price rose above the level at 
point W. This l equires an increa.se in the gross-of-tax price, hence a 
deciease in utility below the T® level. If P// did not fall, the housing- 
demand curve (not shown) would shift downward since P?,(l -I- f') > 
P%{\ -t- f") and 8 is unchanged at 8®. This would create a housing 
surplus. Thus, the housing market will clear only ifP// falls below P?,, 
say, to P'h. Cotisequently, a feasible equilibrium with tax rate f* is 
showti as point X in the housing market and as point Y in the utility 
space. Finally, the tax base per unit of land, with f = <*, is found as the 
area OP\,Xh\. 

Now, suppose the points X and Y represent the equilibrium when 
there are many jurisdictions. We illu.sirate why it would not be in the 
interest of any of these jurisdictions to unilaterally depart from this 
equilibrium. We then illustrate why it would be in the interest of a 
monopoly jurisdiction to choo.se a higher tax rate. 

Suppose a competitive jurisdiction, say jurisdiction 1, were to uni¬ 
laterally choose a tax rale, say, (*><*.'■* Given residential mobility, the 
utility level cannot differ across jurisdictions. Thus, the gross-of-tax 
price in jurisdiction 1, P]^\ + l^), would equal P)^\ -I- <*), which 
implies P% < P\,. Con.sequently, fewer housing units would be 
supplied in jurisdiction 1. To restore equilibrium in the housing 


We assume in the diagrainmatic exposition that toinpctition among jurisdictions 
does not forte all jurisdictions to offer the same tax-service package. This issue is 
investigated in Sec. IV. 
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market, it ■would be necessary for some residents to move to other 
jurisdictions, lowering the population density in jurisdiction 1, say, to 
S' < S". The new hypothetical equilibrium for jurisdiction 1 is illus¬ 
trated by points S and T. Since it contains but a small fraction of the 
area’s land and population, migration out of jurisdiction 1 would have 
no effect on the remaining jurisdictions. Thus, the hypothetical 
equilibrium for the remaining jurisdictions remains at points X and Y. 

By increasing its tax rate, jurisdiction 1 decreases its tax base per 
unit of land to OP%Shi. Should the percentage decline in the tax base 
be greater than the percentage increase in the lax rate (as suggested 
by fig. 1), it would not be in thejui isdiction’s interest to increase its tax 
rate. Similarly, were any one jurisdiction to reduce its tax rate below 

the resulting increase in the tax base would be more than offset by 
the decline in the tax rate. Consequently, points X and Y describe the 
equilibrium for competitive jurisdictions, because there are no incen¬ 
tives for any of the economic agents, governments, residents, or 
housing suppliers to alter iheii behavior. Note that, in equilibrium, 
there are tax revenues colleiied by each jurisdiciion, even when G = 
0; jurisdictional competition is not sufficient to eliminate government 
profit. 

Note that any jurisdiction’s incentive to change its lax rate, and 
hence the equilibrium tax rate, depends on th«. elasticity ot the h, 
function, fhe lower is this elasticity, the greater is the incentive to 
raise the tax rate. This result can be indicated by considering the case 
in which is completely inelastic; in particular, assume that h, rises 
vertically through point X. If anv jurisdiciion were to increase iis tax 
rate, any decline in housing demand would not cause a loss of housing 
units; hence the jurisdiction’s tax revenue would increase as long as 
the percentage increase in the tax rate, Xt/t, is greater than the 
percentage decline in the net-of-tax price. \Pn/P «. .Since gross-ol-ta.x 
price is constant, {Ph - XT//)(I + t + Xt) = Pn{\ + t), aiul con¬ 
sequently ^Pn/PH ~ Xt/(I -t- t + Xt) < Xt/t. A similar incentive to 
increase t would face each jurisdiciion, and t would be coiuinuouslv 
forced higher. Bv altering the suppK curve through point X, we can 
easily show that the incentive to raise / deci eases with an increase in 
the supply elasticity and that a sulficient increase in elasticity cieaies 
an incentive to lower the competitive ecpiilibiium lax rate. 

Next, suppose that the metropolitan area has but one monopoh 
juristficlion. Also assume that the tax rate was initially set at /' and that 
the housing market and utilitv conhgurations were initiallv desenbed 
by points X and Y. If the monopolist were to choose a higher tax rate, 
such as this would increase the tax revenue per unit of land, unlike 
the situation for the single competitive juri.sdiction, because lesidents 
cannot escape the monopolist by moving to other jurisdic tioii'. Popu- 
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lation density throughout the monopolist’s jurisdiction would he un- 
thanged at S". An increase in the tax rate to has two effects. First, 
assuming momentarily that the net-of-tax price is unchanged at P]f, 
the per capita demand for housing declines, since P]i( 1 + 1^)>P],{\ + 
t'). But with population density fixed at 8“, this implies an excess 
supply of housing. Consequently, the second effect is that the tiet- 
oF-tax price must tall, say toFJ,. The resulting situation is depicted as 
points M and N. Note, though, that the decline in the monopolist’s tax 
base per unit ot land, to OP%Mhi, is substantially smaller tliati tbe 
decline previously indicated for a competitive jurisdiction con¬ 
templating tbe same tax rate increase, t hus, tbe monopoly jurisdic¬ 
tion would find it profitable to increase its tax rale, even though a 
competitive jirrisdiciion would not. A.ssummg tax revenue changes 
inonotmiically with (he number of jurisriit lions, it follows that tax 
revenue ()er unit of land, hence t<»tal lax revenue, is inversely relaled 
In the number of jurisdictions. With C, fixed, govei iiment profit is also 
inversely rehited to (he number of jurisdictions. 

IV. Differing Objectives among Governments 

As we indicated in the 1 nitoduclion, data for local jut isdiciions have 
been used extensively lor empirical studies of mirdels of government. 

1 be implicit presumption of these studies is that com|retition among 
jurisdictions does not prevent different governments from pursuing 
different goals or prevent them from olfering different tax and 
seta ice packages. In this .section, we demon.strate that this presump¬ 
tion is correct. In so doing, we provide a theoretical jusiihcatif)n lor 
viewing local jtii isdic lions as natural laboratories for studying models 
of government. At the saiiie time, we demonsiraie that interde¬ 
pendence among local jurisdictions should not be ignored when such 
studies ate ( oiulucted. Finally, tesults in this sec tion make it c lear that 
our result in the pieceding section, ilial competition does not pievcnt 
governmental exerc ise of monopoly powei, is not an ailifact of our 
assumjjiion of homogeneous govei mnenlal oirjectives. 

As in the jneceding section, we assume that an ecpiilibiium exists, 
and we characteri/,e the properties of that ecjuilibrium as the number 
of jurisdictions becomes large. It is first necessary to derive tbe com¬ 
parative static properties of an ecjuilibrium in which lax and service 
packages differ across jurisdiciic^ns. Conditions (1) through (7) still 
obtain. I'he comjraraiive static resjron.ses of |)rices in all jurisdicticans 
to a change in the t;tx nite in an arbitrarily chosen Jurisdiction can be 
derived from eejuations (8). I'here are / eejuations in (8) from which 
the / expressions dPyOt' can he derived. 'I'he derivation cliffeis from 
that employed in Section 11 only in lh;it jurisdictions are not assumed 
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to adopt idetidcal lax-scivict' packages in equilibi iuin. By similarly 
pertiiibing G in some arbitrarily chosen jiiiisdiction, say, I, we can 
derive the 7 expressions ior aPjjldC,'. rhiis, just as in Seciion II, 
comparative static responses ol |)rice in each jiii isdit tion to changes in 
its rjwn fiscal variables and to changes in other jurisdictions' tax rates 
and service levels lan be derived. 

By investigating the limiting pro|)enies ol these cermparative statics, 
we find that, when the number ol jiiiisdiciions in a particular subarea 
becomes veiy large, the level ol utility of' residents ol any til these 
jurisdictions is invariant to its choice of lax rale or level ot go\ eminent 
services. The .imount ol land in any of these jurisdictions is very small 
relative to both the total land area and to tlie laiul area of jui isdiciions 
having the same amenity level. Conse<]uenily, the number of residents 
migrating into or out of any one ol these jurisdictions in response to a 
change in its tax rale or let el of goternineni services is too small to 
affect the price of housing elsewhere. Hence, utility docs not change 
elsewhere. Rather, the migration induces an adjustment of the hous¬ 
ing price within the jurisdiction and that adjustment ensures that the 
level of utility within the jurisdiction continues to ecpial the utility level 
available elsewhere. By contrast, if a jurisdiction has a nonnegligible 
share of the land in the metropolitan area, migration induced b\ a 
change in its fiscal variables is sufficient to change housing prices and 
utility ill all jurisdictions. 

While an intlividual government cannot affect the etjuilibriiiin level 
of Liiililv of its residents when the number of junsdiciioiis is large, tax 
rates and service levels ate not similarly constrained. Wbethet 
changes in these v.iriablcs are desirable depends upon the govern- 
ment objectives pursued, but it is clear that local goxeiinnenis have 
latitude in c hoosing these variables. Put dif fei enllv. the “iiiarkei" lor 
local imlilic gocxls does not force the government to act as a "tax i .ite 
taker.This is illustrated in figure 1. Su))pose th.u there are a laige 
number of jurisdictions each ol which iKcupies a sm.iil fraction of the 
land in the metiopolitan area. Let V be the level of utilitv lealized in 
ecjuilibrium in all jurisdictions. Oonsider two of these )uristtittions .ind 
assume, for simplicity ot illtistiation, ifiat bcitb oiler the same level ot 
governntent .services, G. 1 he jut isdiciions of fer dilleiing tax rates, /' 
and but residents of both jurisdictions enjoy the same level of 
utility, V (.shown by the equilibrium points Y and 7j, and the same 

K,U h ot llic lll.lIIV |IU Isciu lions I.KC'S llu' COMSIl.llMIs .ts (lest I llx'cl l>\ (dl ji tit ) — 

(dPifhlC.') = 0 and Iw ecjc], (1). (2). .iiicl (l> II one. .nut iHiti.ips onh one, ol ihesc 
junsclirlions should Iw .i piohl-m.iMini/mt' )iiuvli(lion, .is deliiied m ihc piesions 
section, it is relaliielv e,is\ lo show dial suth .t luiivimion would tluMise iis tisi.il 
variahles such ih.il ((indiiions (11) .no .iii.iincsl Hence, choke ol 1 ,ind (> Ijs piohl 
inaxiini/.uioii is nol iiuonsisieni w’lih lutisdiciion.il conipeinion 
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gross-of-tax price as all other residents of the metropolitan area. In 
this equilibrium, jurisdiction 2 has a higher tax rate, a lower net-of-tax 
price, and a smaller population than Jurisdiction 1. 

By holding the tax rate constant, jurisdictions offering different 
levels of government services can be illustrated by a diagram similar to 
that in figure I. The diagram would differ only in that G would 
replace t on the right-hand quadrant, and indifference curves would 
slope upward. Jurisdictions offering high levels of G would attract 
more residents than jurisdictions offering low levels ofG. As a result, 
the price of housing would be higher in jurisdictions offering higher 
levels of government services. Competition will require that utility be 
the same in all jurisdictions, but equality of utility across jurisdictions 
docs not require equality of tax rates or equality of levels of govern¬ 
ment services. 

The finding that, in equilibrium, utility of residents of a jurisdiction 
is independent of its government’s fiscal decisions is a startling result, 
one which highlights an important concepUial distinction between 
residents of a jurisdiction and its landowners. For residents who rent 
housing, the choices ol' tax rate and government spending level are 
matters of complete indifference in equilibrium. Housing prices will 
adjust to maintain their equilibrium utility level. By contrast, the 
wealth of landowners is directly affected by the choice of tax rates and 
level of gftvernment service. A resident landowner would clearly 
suppoit a governmental objective of property value maximization. 
Such an objective would maximize the wealth of the resident as 
landowner, and, given his mobility, it would not diminish his utility as 
a consumer of govei innent services and housing. 

The results in this section also have implications for tests of the 
I’iebout hypothesis. In proposing his test of the 'fiebout hypothesis, 
Oates (1969) argued that differences in tax rates or in the level of 
public services |jrovided would be capitalized in housing prices. Kdel 
and Sclar (1974) argued precisely the opposite—that in a long-run 
T'iebout equilibrium, tax rates and government spending levels would 
be uncoirelated with housing prices. In the model developed here, 
Oates’s interpretation is clearly correct. In equilibrium, differences in 
tax rates and levels of government services will result in differences in 
housing prices across jurisdictions. This result does not depend on the 
existence of a large number of jurisdictions, nor does it depend on the 
objectives being pursued by individual governments.'® 


li does, however, depend on die ;i.ssuniption that governments do not directly 
intervene into housing markets (e.g., via /oning) to induce individuals to consume more 
housing than they would otherwise choose. Sec Epple (1980) for further di,scussion of 
this issue. 
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The test proposed by Oates may be interpreted as a test ot equilib¬ 
rium condition (4). In equilibrium, with homogeneous residents the 
maximum utility level Individuals can achieve will be the same 
everywhere. Denote this utility level P. Setting F[G'/’1,{1 t')] = (/ 

and solving for P}/, we obtain an equation of the form estimated by 
Oates: 


P‘h = 


f(PXP) 
(1 +/') 


( 20 ) 


In any given equilibrium, V is the same across jurisdictions, and hence 
it is impounded in the constant term in a regression equation. I'o 
empirically test this equation, it is ncce.ssary that one have obser¬ 
vations on at least two jurisdictions. However, the equation holds 
regardless of the number of Jurisdictions. The above result and re¬ 
sults in our previous paper (Epple, Zelenitz, and Visscher 1978) 
indicate the crucial role of assumptions about movability of jurisdic¬ 
tional boundaries. In our earlier paper, we argued that housing prices 
must necessarily be equal across jurisdictional boundaries when 
boundaries can he costlessly redrawn. By that argument, the hypothe¬ 
sis of Edel and Sclar is cot rect, but Oates’s hypothesis is correct when 
boundaries are fixed exogenously.’’ 

When the number of jurisdictions is large, changes by any one 
jurisdiction have no effect on V. Ihus, when the number of jurisdic¬ 
tions is large, differentiation of (20) provides equations that correctly 
indicate the marginal change in the net-of-tax price of housing which 
will compensate residents when their jurisdiction changes its tax rate 
or service level. When the number of jurisflictions is small, a change 
by any one jurisdiction affects V. In that case, the appropriate expres¬ 
sion for the price derivatives cannot be deduced from equation (20), 
and, as a result, the compensating alteration in housing price caused 
by a change in tax rate or service level cannot be deduced trom (.20). 


V. Jurisdictions Differentiated by Size and laxration 

In preceding sections we considered models in which land at all 
locations is identical and jurisdictions are identical in size. In this 


Nole again ihal the lest pioposcd by ttales is a lesi of eipiililirmm Londuiou yl). U 
individuals are costlessly mobile .iinongjiirisdiclions, it mdiMdiials have full nifoian.i- 
tion concerning jurisdictions' aliernalivc fiscal cnvirontnenls, and it juriMliciions do not 
restrict resideius' consumption bundles, iben. as Oates t Ie.ii Iv s.ivv, equihbninn tondi- 
tion (4) obtains. This is not, however, a test ol lielMiiii's li\[x>lhesis the l.nier is a 
normative statement, concerning [he achieveineni of an .iiiKuini ol the public gixKi, . 
which maximizes individuals' uiiliues. As is clear Iroin the .tn.ih sis m ihe ji.ijHo, there is 
no assurance that the uulity-inaxiniizing level ot (■' is attained tor aii\ mdivulual 
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section we f^enenilize the model to pct iiiit Jut isdit lions to dine/- 
both size and degree ol desirability. We argue that our preccdinff 
resijJfs concerning tompefiiion bold when jurisdictioris are dilfer- 
entiated by desirability of Joeaiioii. We also discuss how the preseim^ 
of a large number of small jurisdictions in one part of the metropoli¬ 
tan at ea affects the monopoly power of a large jurisdiction in the 
same metropolitan area. Finally, we discuss the effects of permitting 
new jurisdictions to form at the urban fringe. 

Since the method of analysis recjuiretl here is similar t(} that em¬ 
ployed in previous sections, our discussion is presented without 
benefit ol explicit mathematical arguinent or proof, technical details 
are available on retjuesl. 

To investigate the effects of differentiation ol' locations, we assume 
that the total land area /. is divided into A' subareas or locations 
indexed by k. Each subarea has amenity (or accessibility) in amount 
Alt. Variable A is assumed to enter the utility function directly. This 
specification permits investigation of the effects of varying either the 
number of juiisdictions at each location or the number of distinct 
locations. 

F.tjuilibrium conditions analogous to (2) through (7) continue to 
apply. Dillerences are that amenity level now appears in the utility 
and demand functions. In addition, supply and demand for housing 
must Ite eciuilibrated at each location. Comparative static results are 
deliver! by the same procedure used to obtain equations (9), (10), and 
(11), and governmental objectives are studied using the approach 
employed in .Sections III and IV. 

The implications of incteasing the miniber of jurisiliciions offering 
a given amenity level are fouiul to be similar to those derived in 
-Section III. I he presence of a huge number of small juiisdictions 
offering the same amenity level prevents any one of these jurisdic¬ 
tions from exploiting the elasticity of housing demand but not the 
elasticity of housing supply. I'he profii-maximi/.ing tax rate for these 
jurisdictions converges to the inverse of the housing-supply elasticity, 
a result analogous to tliat in (18), 'Fax revenue per unit of land will be 
higher in locations with a high because the price of housing will be 
higher at sucli locations. This is a consequence of the equilibrium 
condition ret|uiring utility to be constant across Jurisdictions. 

The presence of a large number of small jurisdictions in one part of 
the metropolitan area, however, does not prevent a large jurisdiction 
in another pait of the metropolitan area from exploiting both the 
elasticity of housing demand and supply—a result analogous to that 
in (15). 

To interpret these results, think of two locations in a metropolitan 
region, one the central city and the other the suburban area. The 
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above results imply that a large numliei ol jui isdiclions in the subuv 
ban region are suHicient lo prevent individual subtiiijs Irom exuU it 
ing the elasticity of housing demand. But the relatively large ntip.., 
tion ol metropolitan land area oaupied by the central cits enables its 
bureaucratic monopolist to exploit the elasticity „t housing .lemand 
even if there are a large number of juiisdictions in tbe suburbarl 
region. 

Instead of hypothesizing an increase in the number of jurischc tious 
offering a given amenity level, we might argue that each juiisdiction 
encompasses land with uni<iue attributes. The implications of in¬ 
creasing the numbei of distinct jurisdictions occupying a hxed land 
area are found to lie the same as those derived in Section 111. 1 he 
presence of a large number of small jurisdictions, even if eacb oilers a 
unique amenity, is siifbcieiu to prevent any jurisdiction from 
exploiting the elasticity of housing cleniaiid but not the elasticity of 
housing supply. 

Finally, it might he argued that our results in Section HI arise 
because we assumed a fixed amount of land avail.ible for urban use. 
I bis conjecture is incorrect. To demonstrate this point, suppose that 
locaticjns ate indexed in order of decreasing desirabilitv, so that juris¬ 
diction K has the lowest amenity level of the existing jurisdictions. 
I.anct with amenity level .1^ + , inav then be thought of as the urban 
fringe, which we assume to be in perfectly elastic supply. With these 
assumptions, the utility that can be achieved by choosing land with 
arnetiity levcd is constant at, say, C. 

There are two possible ccpiilibria. All individuals may find the 
existing Aw jurisdictions more attractive than the in ban fringe. In that 
case, the urban fringe is uninhabited anti all results arc unchanged. 
Alternatively, some individuals may live on the urban fringe. .Since all 
individuals must achieve the same level of lUilitv in ctjuilibi inin. the 
ulilit) level throughout the metropohian area is 0. However, as dem¬ 
onstrated in Sections III and IV, an cntrenclicd government that 
must provide its residents an exogenously specified level of utility can 
still profit by exploiting the elasticity oi housing supplv. I be existence 
of juiisdictions at the extensive iiiaigm iUk-s not eliminate land rents 
in infr.amarginal jurisdictions, nor does it prevent ihe taxation of 
ihose rents, rherefore, reg.irdless ol which outcome occurs, jurisdic¬ 
tional competition, even when residents can relocate on the uriwn 
fringe, cannot prevent an entrenched goveininent from exercising 
monopoly pc^wer. 

In summary, by the introduction of heterogeneous in ban lix aiions, 
we demonstrate new results .uicl establish that tlic rc'siilis ol Sections 
HI and IV hold in a mucli more geiiciai setting. Whether the focus tx* 
on one urbanized area in isolation or on all urbani/ed aieas within a 
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nation, whenever rents exist, jurisdictional competition does no), pre¬ 
vent an entrenched government from expropriating part of those 
rents. 

VI. Conclusions 

I'he major finding of this paper is that competition among many local 
jurisdictions is not sufficient to prevent local governments from exer¬ 
cising monopoly power. While we have derived this finding in a model 
in which government revenues are raised via property taxes, the 
result also follows if communities are empowered to levy a sales tax 
(i.e., a tax on the numeraire commodity), an income tax, or a combi¬ 
nation of these three taxes.'* 

The feature of our model that is primarily responsible for this 
result is the assumption of fixed jiiri.sdictional boundaries.'" A gov¬ 
ernment given taxing powers in a jurisdiction with fixed boundaries 
can exploit the immobility of land and share in the rents accruing to 
that land. Mobility of residents across a large number of jurisdictions 
can prevent individual governments from exploiting the elasticity of 
housing demand but not the elasticity of housing supply. 

These results vindicate researchers seeking to develop a positive 
political theory of local government behavior. Jurisdictional competi¬ 
tion does not predetermine the outcome; Tiebout does need politics. 
On the other hand, these results highlight the importance of consid¬ 
ering the economic as well as the political environment of local gov¬ 
ernments. As shown in Section IV, competition among a large 
number of jurisdictions establishes ettnstraints on the choices made by 
individual governments. 8uch constraints should not be ignored in 
modeling decision making by local governments. 
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Monetary Expansion and Real Exchange Rate 
Dynamics 
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Hehteiti Unn'ersity oj Jerusalem 


This paper exaiiiiiics the effect of a c hange in the rate of monetary 
expansion on the tlynatnics of the real exchange rate under fully 
(fexihle domestic prices and a floating exchange rate. The model is 
similar in structure to an earlier paper by Cairo and Rodrigues, the 
mam dif ference being that in our model consumers have an infinite 
planning hon/on and long-iun pel feet foresight. One of the mam 
findings is I hat an increase in the rale of monetary expansion has the 
impact effect of creating a real appreciation of the exchange rate and 
a deterioration of llie balance of payments, contrary to the results of 
Cairo and Rodrigue/. 


I. Introduction 

This paper analyzes the effects of increasing the rate of exp;nisioii ol 
ncrniiiial money, /c, on the rc;il exchange rate, e, defined as the ratio ot 
the nominal exchange rate, It, to tlie price of home goods for an 
economy operating tinder a floating exchange rate and fully fiexihle 
price level for lioine goods. 

rhis issue is analyzed in a recent paper by Cairo and Rodriguez. 
(1977) (hereaf ter CR) in tfic context of a small iwo-sccior economy 
(consisting of traded goods and fioine gtjods) whose domestic citizens 
hold hotfi domestic and foreign money, which are the only assets. 
I'heir conclusion is that an increase in fi will increase e, that is, cause a 
real tievaltialion. Conse<|ucnlly, the impact effect of an increase in 
monetary expansion is to induce an increase in the protiuction ol 

1 am iDdcbfccJ lo Jacob Frenkel for poiriiing out an error m an eailicr version of ihe 
paper and for various coniincnu which heljH*d lo improve ihe exposition. 

[JouTTuit oj HolUwat ttotumy, vtW #19, no I>| 
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traded goods and to cause an iiiiproveinent in the balance of pay¬ 
ments. 

In the present paper we reexamine the pioblcm within the 
framework of utility-maximizing behavior and long-run perfect 
foresight using an approach similar to that of Sidrauski (1967) and 
Brock (1974). It is shown that a central property of the (^alvo- 
Rodriguez model, namely, the long-run constancy of total assets, no 
longer holds in the present model. As a result, the impact effect of p, 
tends to result, under our assumptions, in a real appreciation of the 
exchange rate, contrary to the CR result. In the final section of the 
paper the two models are compared in some detail. 


II. The CR Model and Results 

It will he usef ul to stal l with a brief description of the CR model which 
deals with twet commodities—traded goods (/') and home goods (H ). 
The demand functions are 

c-r = Cj{e,a); r„ = C:,i(e,a), (1) 

where e — {EIP„) is the real exchange rate and a denotes real assets, so 
that 


<7 = m + ef-, m 



( 2 ) 


where A/ is nominal domestic monev and j is the stock of foreign 
currenev held b\ domestic ciii/ens. In a lull-einpl(t\meni economv 
output depends only on c, .so that 

= QHf): ('') 

Since the home-goods market is afwass cleared ,is a result of llexihle 
prices, we ha%e 

= Q,/(c). (4) 

Bv using (1) and (3) we can .tlso express the hal.mce of pasments 

^ = Qiie) - ( 5 ) 


as a function of c and n. 

Note that h\ (2) and (4) weean expiessct asa function of r.a = If'(c), 
W < 0, so that using (5) we oht.iin 

/ = (f>(c); <f)' ^ 6 

by the tisual assumptions. T he lund.imental le.mne of this lelaiion- 
ship, from the point of\iew of the suhsccpieiit .inahsis, is that it holds 
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independently of p.. This implies, of course, that the steady-state 
value of c, derived from = 0, is also independent of /x and hence a 

constant. 

The basic relationship of the assets market is 

where E = (dE/dl)IE (we shall useX to denote [dX/dt]/X), and L' < 0. 
Since E = e + tt (tt = Ph) and m = p, — ir, we can express (7) as 

'?=h(^) + m-p. (8) 

However, since a and e are functionally related, we can express wi as a 
funttion of c and/. By virtue of (6) and (8) this implies thatc can be 
expressed as a function of e, /, and p, say 

f (9) 

where p is dynamically constant. The assumption of perfect foresight 
permits identiheation of “actual” and “expected” rates of change. 

Equations (6) and (9) form a dynamic system in e and/ with p as a 
parametet. As is usual in monetary-growth models with consistent 
expectations, the dynamic system around the steady state corresponds 
to a “saddle point” solution. Along the saddle path of the linearized 
system / follows a linear first-order dif ferential equation, which in 
turn implies that/ converges monotonically to the steady-state solu¬ 
tion. 

Suppose we start at a steady state and increase p. Let us assume that 
this causes an increase in the steady-state value of/ (denoted /). We 
know, however, that in the course of adjustment to the new equilib¬ 
rium J must increase monotonically at a diminishing rate, as it should 
be for a linear dilferential equation.' Hence if/„ (the initial value of 
[djldt]) was zero before the change in p it must become positive as p 
and/increase. However, since/and e are functionally related by (6), 
the increa.se in J„ must be a.s.stxriated with an increase in e„, which is the 
result proved by CR. In economic terms, in order to support the 
acctjinulation of/through the balance of payments we need an initial 
real devaluation. This argument can be summarized by 

' T ile equation lor / is one ot the form/(() — f + X < 0, and b — to ~ f- Hence 
/ = and j„ - Kb. After the increase in /a we have/„ = X(/„ -/i) > 0 since X < 0 
and /, > /„. 
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Since the steady-state value of'e is independent of /u and since there 
is a unique relationship between a and e that is consistent with home- 
goods equilibrium, it follows that the steady-state value of assets is also 
independent of the rate of monetary expansion. I'he higher steady- 
state rate of depreciation that is associated with the higher implies 
that in the new steady state a higher proportion of the portfolio will be 
held in the form of foreign exchange so that/is raised. Consequently, 
/„ and e„ increase. 

The constancy of the total value of assets across steady states is 
central to the CR thesis, ft may be noted, however, that an increase in 
/A represents in some sense a reduction in the return on assets in 
general, which may lead in a different model to reduction in their full 
equilibrium value. In order to focus attention on this aspect we shall 
formulate a long-run perfect-foresight model where the relation be¬ 
tween the rate of return on assets will have to be balanced against the 
subjective rate of time preference, as in optimal growth models. 


III. The Long-Run Perfect-Foresight (LRPF) Model 

We assume that the supply side is the same as in the CR model. The 
difference is in the treatment of demand for commodities and assets. 
We wish to base the consumers’ behavior on utility analysis. In order 
to account for differential demand for m and / we shall follow the 
procedure of including the liquidity services provided by both assets 
in the utility function (in this respect, we extend the Sidrauski [1967] 
approach to two assets). 

An important assumption that we shall use is that the instantaneous 
utility f unction is separable^ as far as consumption of ordinary com¬ 
modities and liquidity services is concerned. It is this assumption 
which will enable us eventually to derive a relationship between / and e 
which is independent of fi, as in (6). The utility integral to be 
maximized by the representative consumer can then be written as^ 

f c *'[«(f„,C7.) + Ffw.cyjjrft, (11) 

0 

where u and F are strictly concave functions and 6 > 0. 

As in CR, there are only two assets —m and/. There are no bonds 
and hence no borrowing. I he only way the consumer can spend in 
excess of his income is by running down his stocks of m and/ (“bor- 


* t he sepaiabilitv m (II) is sliongt-r ihaii ne<cssaiv. VVhai is iciiuired is ih.ii the 
marginal rate of .substituuon between Cr and C„ in u( ) be independent ol the assets 
(i.e., separabiUiy in the Leoiuicf seiiNo). 

’K( ) can be expre.ssed more gener.illy as withoui allecting our results 
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rowing from himself”), so that there is an upper bound on his current 
consumption possibilities. This assumption is, in fact, required for the 
existence of a solution. 

The income receipts consist of the value of output^ = q,, + eqr and 
of transfer payments by the governments. We may then express the 
accumulation of assets (fi) as 

d = q + s ~ fhn - (fl - K)ef - (c„ + ecj), (12) 

where Hm and (II - E)ef represent “depreciation” terms. Cleat ly, when 
{E — IT) is positive we may regard (E — \\)ef, or e(f/), as an “income” 
term. In fact, in the present model e often plays a similar role to the 
“real rate of return on capital” in monetary-growth models. Using the 
definition a = m + ef wc may rewrite (12) as 

d = q + .s E ea - (e + !!)« - (r„ E ecr), (13) 

where e represents the “real interest rate” and (c -t- 11) represents the 
nominal one. 

The optimization problem is then to maximize (1 I) subject U) (2). 
(13), and initial assets («„). It should be noted that from the point of 
view of the maximizing individual e is a market parameter and the two 
assets are lully interchangeable at any instant. From the Kuler condi¬ 
tions we obtain 


Fm - y, . - 

+11. 

(14) 

u„ 


Ur 

— 1 — = e, 


(15) 

"// 


— ^ e - o — 

9 

(16) 


U,i 


where subscripts of u and V denote taking partial derivatives with 
respect to the indicated variables (e.g., u-r = fdu/dfT-J, Kf = [dE/d{ef)], 
etc.). 

We may then rewrite (14) as 

y - 

e = (-n) -I- ^^ , 

which means that along the optimal path the real rate of return on F 
must equal the real rate of return on m (i.e., —II) plus a lk{uidity 
premium. (Condition (16) is the usual condition of optimal growth 
models where the role of the “real rate of return on capital” is now 
taken by [(V^^lun) + e], which equals the real market rate of return on 
F plus the nonpecuniary liquidity element. 



MONKTARY tXI’ANSION 


I 


Lei us examine first wfiether in the present model we may still 
derive an autonomous relationship between /and e. Since in equilib¬ 
rium C,i = Qji(e), we can write (15) as 

-—i— = 

rl 

which can be solved to yield — Cr(^). From the balance of payments 
we then have 


y = (27-(«’) - CV(c) = 'F(c), (18) 

which proves that we still have a functional relationship betweeiiy anti 
c which is independent of /a. 

The si^n of is generally ambiguous. IT. however, w'e take 

the example of a Cobb-Douglas type of utility runttion, then (Uj/u„) = 
Pifulrr) (ft a constant), and fience by (18) Cr = l/fQ/yfcl/c] so that 
[dCfieyde] < 0. The same is true fiir any honiothetic u{ -) function. In 
any case, whatever the sign of the latter dcrivatne, we may a.ssume 
that the sign ol 'F' in (18) is dominated by Qflc), so that 'F' > 0 (clearly 
ir [dC-idde] < 0 this tendency is only strengthened). Thus the prop¬ 
erties of 'F(c) are basically the same as those ol 

Sitice in eciuilibriuin Ct t an be expre.ssed as a 1 mution ol e alone and 
since the same is true of Cn< follows that Un aixf Uj aie, in eqiiilib- 
riutn, functions of c only, say Uji((') and Uf(e). 1 his implies that (16) can 
be expressed as 

e ^ 0(ejn,j'). (19) 


By a similar consideration, and using U = n - (m/ni), we ran express 
(14) asG(w,w,c,(’,/,M) = 9. Substituting (19) m the foregoing relation¬ 
ship and solving tor lii we obtain 

m = -ylc.w./.fi) 


Equations (18), (19), and (20) lorin the dynamic system lore, m. and/ 
for the economy under cotiipetitice e(]iiilibrmin. 

When we consider the s\steni at some ‘'imtiar' point ol lime, then/,, 
is moinentarilv lixed. However, the iiiUMi \alues e,, and w„.ue lice to 
be detei mined by the market since all prices are llexilile. We assume 
that in the LRPF (long-run pei lect-loresight) niodel e„ .md ni„ ai e 
detei mined uniquely so as to set the economv on the 'saddle path. 

In terms of the lineal i/.ed scsteiii around the steacb state, we can 
express the solution for anv ol oui xaiiables. ,sa\ lor/, in the lorm 


J(l} =y + /hc*'' + I'f''- + <2 ’ ’ 

wheie the //s are determined bv the paiameieis ot the system .md bv 
the initial values,y„. >n,„ and c„. 1 he fact that c.. and ii.„ are determined 
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SO as to set the system on the stable trajectory leading to the steady 
state means that two of the three \’s are positive (or have positive real 
parts in the case of complex roots), and therefore their corresponding 
bi’s have to be set equal to zero by appropriate values of and mo- 

Furthermore, the fact that the “saddle path” is stable means that the 
remaining X (say Xi) is negative. Since on the saddle path 

/(/) =/ + i,e*i', (22) 

wc see thatft] is determined by/(0) —/. 'This leads to the fundamental 
conclusion that the convergence of/(/) is monotonic, as in the CR 
model. 

We turn now to consider the effect of an increase in p, on the real 
exchange rate (starting from a steady-state position). Since in a steady 
statey = 0, we see immediately from (18) that e is uniquely deter¬ 
mined, since 'F' > 0. 'I'his corresponds, of course, to the slope-of- 
transformation curve between// and T at the point of tangency with 
the indifference curves relating to the two commodities. Thus /a has 
no effect on the long-run real exchange rate. 

In order to analyze the impact effectof /xone„ we make use again of 
the fact that/ = '^(c) is independent of /x, and of the property of 
tnonototiic convergence of j. As we saw in connection with the CR 
model, this itriplies sign (dejdfj,) = sign (dfjdfi) = sign (djidix). I^t us 
then determine (dfldfi), which is ihe key to the impact eflect. The sign 
of this derivative depends critically on the sign ol the mixed derivative 
The usual assumption in these cases is that Vmt' > 0; that is, the 
two assets behave like “cooperant” factors of production in the theory 
of the firm. 

In steady slates c is a given constant, while conditions (14) and (16) 
reduce to 




Uif 


fx. 


(14') 


-^=8. (16') 

Dilferentiating this system with respect to m,J, and fx we obtain the 
following solutions; 

dm ^ TpyUH dj ^ m U„ -OQ, 

dfx A ’ dfx A e 

A = ~ yi„. 

Since < 0 and A > 0 by the concavity of F(') we find that (dm/dfx) 
< 0 as expected. We also find that under the “cooperancy” assump¬ 
tion (djld^x) < 0. Since this refers to/of steady states, it follows from 
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the earlier remarks that (dfjdfi) and (dejdfi) arc negative as well. 
Thus an increase in /x leads to an appreciation ul the real exchange 
rate contrary to the CR result. 

These results are illustrated graphically in ligurc 1, where .S.S rep¬ 
resents the relationship between / and / along the saddle [lath, while 
EE represents f — 'k(e). Note that e increases from right to left on 
the horizontal axis. As /a increa.scs we have a new /(/i) and a new .S''.S ' 
curve passing through it. However, since momentarily / = /„, we 
have/„ decreasing from zero toThis corresponds on the EE curve 
to a decrease in e„ from e to e'„. The dynamic paths are repre.sented 
by movements along AB and CD. Thus along the dynamic adjust¬ 
ment path we have real depreciation associated with a negative tiade 
balance, but e is still below e throughout. 

IV. A Comparison of the Alternative Models 

The key to understanding the dif lerence in the results obtained in the 
two models is in the different effect ol on the steady-state value of/. 
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The negative ef fect of fx on/ in the LRPF model and its implications 
can be explained as follows. 

In steady states the real rate of return on F, that is, e, is identically 
zero, while the real rate of return on m is given by —/i. Hence an 
increase in reduces the real rate of return on money and therefore 
reduces m. By the cooperancy assumption this reduces the marginal 
utility of/, and consequently its equilibrium value is reduced. Under 
long-run perfect foresight tlie decumulation of / must start im¬ 
mediately so thatyi, decreases. However, a deterioration of the balance 
of trade is compatible only with an appreciation of the real exchange 
rate. Hence (de,JdfjL) < 0. 

Since the cooperancy assumption is crucial for our result, we must 
discuss it in greater detail and interpret it, for the sake of comparison, 
in the context of the CR model. 

Note that and {Vyhi,,) can be interpreted as the .subjective 

rates of return on m and F, which will be denoteil by and py, 
respectively. Condition (M') can then be written asp,„ — = Pf (p. = 

ir), which indicates that the p^ must equal p„, after allowing for the 
inflationary depreciation of real balances. Thus W'c have essentially a 
(ondition of equality of the “net” rates of retin n properly defined. It 
can therefore be regarded as a jjortfolio allocation condition analo¬ 
gous to condition (7) in the CR model. 

Indeed, in (14') we have a relationship between m and F m steady 
slates that must be consistent with p (= F), as is the case in (7). 11 we 
accept this analogy between (14') and (7) it seems reasonable to 
require that for a given p, m and F should be positively related in 
(14'). Writing (14') as 


ymPn.l'') 

U;/ 


— 5 = p. 


(14") 


we see immediately that a jtosilive relationship between m anrl F 
implies V,„y > 0; that is, the factors must l)e cooperant. Thus the 
cooperancy assumption seems to be in line with C.R's equation (7). 

While (14') relates the rates of return to each other, (16'), which can 
he written as pf (m,F) = S, pins down the rate of return on F to the 
given subjective rate of time preference. Thus (16') helps to deter¬ 
mine the “scale” of assets. As p increases and m decreases, we have to 
determine what happens to the value of F which is required to main¬ 
tain py,. = 8. In this connection we note that if m and F are positively 
related in (14"), this must be the case for (16') as well (since V,„y > 0 
and V is concave). 

Note that while (14') plays a similar role to that of (7) in the (iR 
mf>del, condition (16') is the one which replaces their assumption of 
the constancy oI'ot -t- F across steady slates. In the present model the 
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tonsi.mcy of tlie value of toial assets is replatcd l)\ the comfition of 
constancy of tfie net rate of return, that is, 8 = = p,„ - II, which has 

quite cliff'erent iinplitatious. 

I fie cliffereute between ifie models can now be clarihed by means 
of fifcure 2. Suppose that the/M curve leprese as the portfolio bal¬ 
ance condition given bv (7) m the CR model and by (H") in the l.RPf 
model. To close the model, Calvo and Rodiiguez introduce the con¬ 
dition that )/i + F is constant as represented bv the Hli curve. 

In ific LRPK model the svsicm is closed in a different manner, 
nanielv, fiv the condition Pf = 8, that is, by (16’) as expressed h\ BB 
The latter curve must be upward sloping since ihe assets arc cooper- 
ant, and it is steeper tli.iii -‘f.-f as a result of the < onc.iv ity of P (■). Ihe 
infeisection ol .‘(.‘f and BB reinescnits the solniion coi 1 e^'pcmding to 
(1-4') and (16'). An increase in p, will then shift tlie long-run eiinilih- 
riuin to Q' in the CR model (where F increases) and to Q” m the 
alteriuitive one (where f decrc'.ises). I iilike the (,R mocfc'l, wheie total 
a.s.sets (w -I- F) are constant, we must li.ive .1 ledm tion ol .issets in the 
alter native LRPK model. 
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Economies of Scale and Barriers to Entry 


Richard vSchmalensee 

MfLS'iiirhu.seUs In\tilule of'i'echnoiogy 


Dixit has recently ptesentecl a inotlei in which established hrrns select 
capacity to discourage entry but cannot employ threats they would 
not rationally exet ute after entry. Kntry deterrence in a slight tnodi- 
hcatiori of this model involves the classic limit-price output. Under 
linear or concave demand, however, the capital cost of a firm of 
imiiiitiutii eUicietK scale is an upper Ixiuntl on the present value of 
the monopoly profit stream that cati be shielded from entry. It is 
argued that this suggests the general unimportance of entry barriers 
erected by stale econotnies. 


In his sciniiKil work on conditions of entry, Bain (1956) argued that 
the necessity for a firm to be large relative to the market in order to 
attain productive efficiency created a barrier to entry. The notion that 
large-scale entry can create a discrete difference between preentry 
and postenrry price and profit levels is clear enough, hut the details of 
Bain’s argument are still somewhat controversial. In assessing the 
importance of the scale-economics barrier, Bain introduced the 
limit-pricing model of entry deterrence, in which established firms act 
as a perfect cartel and potential entrants expect those firms to main¬ 
tain their preeiury levels of output even after entry. I'his model has 
been subjected to strong criticism, however, in large part because it 
may not be rational for the established firms to keep output constant 
after large-scale entry has occurred.' Moreover, Stigler (1968) and 
others have challenged the basic idea that scale economies can create a 

I am indebted lo Steve Salop, Roger Bohn, and Nalin Kulalilaka for helptul com¬ 
ments, though 1 cannot share blame for remaining defects with them. 

' Scherer (1980, chap. 8) provides a useful discussion ol the limit-pricing model and 
its ciilKs. 

[Jtmmnl of PoUhfal F.rorumnt, 1981. veil 89. no. 6] 
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meaningful emr, barrier. I hese crUic. have „re.„d ,l,e lac, ,ha, 
once an entrant has invested in an efficient plant, there is no dif- 
lerence (under the usual assumptions) between its position and that ot 
established firms. Without a postentry difference, they have argued 
there can be no real barrier to entry. 

In part this last argument concerns the most useful definition of 
barrier to entry. Spence (1977), building on insights of .Schelling 
(I960), has recently presented an interesting analysis of the economic 
issues involved in that argument.^The basic point that emerges is that 
established firms, assuming they can coordinate their actions, have the 
advantage of being able to make some irreversible decisions before 
new entry appears. In particular, they can select the level of capacity 
facing any new entrant. Even it entry txcurs and the established firms 
then wish to have less capacity, their preentry commitments may 
make a rapid reduction in capacity impossible. Recognizing this, en¬ 
trants may be deterred. In his formal analysis, Spence assumes that 
the established firms build enough capacity to produce nearly the 
competitive output. Before entry, they produce the monopoly output, 
but they threaten to use all their capacity if entry occurs. If this threat 
is believed, as Spence assumes it is, entry can clearly be deterred. But a 
threat to increase output after entry is surely no more ci'edible than 
the threat to maintain output that is the core of the limit-price model. 

In an important recent paper, Dixit (1980) explores the implica¬ 
tions ot restricting the established firms to threats that they would hnd 
it in their postentry interests to execute. In particular, he assumes 
that potential entrants expect the postentry market equilibrium to be 
that of Cournot doupoly. That is, they expect ihc established hrms to 
behave rationally but noncooperatively in response to their entry. 
Though this market model is far from peilect, it at least implies 
plausible profit-seeking behavior by both established sellers and new 
entrants, and it avoids building ad hoc behavioral asymmetries into 
the postentry situation.® Dixit finds that the established firms may still 
be able profitably to deter entry if they can commit to a level of 
capacity before potential entrants appear. 

In Dixit's simplest model, scale economies arise because ot fixed 
costs. In order to relate his anal)sis to the earlier limit-pricing litera¬ 
ture and to empirical work on stale economies, it is assumed here 
instead that the long-run average cost curve is L-shaped, so that the 
minimum efficient scale of operations is well defined.^ The resultant 


’ See also Dixit (1979). Speiue( 1979), and loi aKo<Kl discussion oi the basic ion, epts 
involved, Salop (1979). 

“In addition to ihe di.scossion ol ihc Coninol .issiimplion bv Dixit (1980), see Oir 
and MacAvoy (1965), whete it isarmicd ihat (aiurnol be!iavior is a plausible threatened 
reaction to cheating on a cartel agreement 

‘See, e g., Modigliani 1958 and Schcier 1980. rhaps 4 and 8. 
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model is analyzed in the next section. It is shown there that entry 
deterrence involve.s charging exactly the clas.sic limit price before 
entry and holding output constant if entry occurs. A basic dif ference 
between this analysis and that of Bain-Sylos, however, is that the 
threat to hohl output constant after entry is not always credible. It is 
shown that if the threat is credible, and if the market inverse demand 
function is linear or concave, the preentry present value of excess 
profits that can be shielded from entry cannot exceed the capital 
(start-up) cost of a firm of minimum efficient scale. In Section If, it is 
argued that this upper hound and widely accepted estimates of the 
importance of scale economies imply that the entry barriers consid¬ 
ered here are generally of little quantitative importance. I’he implica¬ 
tions of localized competition and long-lived advertising in this con¬ 
text are discussed briefly. 


I. Credible Entry Deterrence 

Following most of the relevant literature, the established firtns are 
assumed to be either a single enterprise or a perfect cartel and arc 
referred to as “the monopoly” in what follows. Only one potential 
entrant, called “the entrant,” i.s considered. I'he itionopoly is assumed 
to select and totislruct its capacity, K"\ prior to the entrant’s appear¬ 
ance. Both the monopoly and the entrant operate under the cost 
cftiiditions shown in figure 1. Economies of .scale are such that firms 
with capacity less tfian A,, have prohilritively high costs, so that A'„ is 
the minimum efficient scale in this industry. For levels of output, q, 
between A',, and ta[)acily. A', variable cost i.s v per unit. Capacity costs c 
per unit to install. It is assumed to last forever, so that the How cost of 
capacity per period is (rc) per unit, where r is the relevant discount 
rate. With capacity A' in place, the heavy kinked line in figure 1 is the 



Ko K q 

Fir.. 1.—Marginal cost after acquiMtion of iap.tcity A' 
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elTeciivc inarfrinal cost schedule. Bv selecting A'"', the monopoly de¬ 
termines where the kink in its marginal cost limciion orciiis. I he 
entrant, with no sunk investment in capacity, faces a constant maigi- 
nal cost ot / = /c + (or all output levels alxne A'„. All this follows the 
basic setup of Dixit's (1980) simplest model, except that he has = 0, 
and he allows nonzero hxecl setup costs. 

1 he first stej) in the analysis is the description of the monopoly’s 
posietitry Cournot reaction function. For illustiatice purposes, sup¬ 
pose that the iiidusity tttverse demand iunrtictti is gicen by 

P = P{(1) = a - h() = a - h(q'' • q'"). (1) 

where a and h are positive constants, P is pi ice per unit, (> is total 
industry output, q"‘ is monopoly output, and q'" is the entrant's output. 
If margintil cost were some constant. X, the monopolv’s reaction 
lunclioii would Ire simply 

q" = (rt - bq' - X)/2I>. (2) 

Things are more complex under the cost stiucture assumed here. 

The heavy kinked line in figure 2 shows the iiKrnopoK's postentry 
reaction liinciion given installed capacity A'"' and market demand 
given by (1). 

In figure 2, the line AfAl' is the leaction function ol a lirm choosing 
q'", taking!/'’ as fixed, with constant marginal cost ccjual tot, while A’.\" 
is drawn for marginal cost u. It q'' exceeds the addition of Ao to 
industry output would drive price below average variable cost, v, so 
that the monopolv's optimal output is zero even with c.ip;tcilv m place. 
For (/'■ Isetween A,, and U the monopoly optimally sets q'" -- A’„. If 
there were no lower bounil constraint on output, a sin,tiler yaliie of q"‘ 
would be chosen tilongNjV'. .As^' is reduced beloyv L,.. the monopoly 
expands output along ,V.V' until the capacity constraint is encoun¬ 
tered. I he horizonial section ol the reaction function in figure .f 
corresponds to the vertical portion ol the marginal cost schedule in 
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figure 1. If is sufficiently low, it may (depending on the initial choice 
of K’”) be optimal for the monopoly to add capacity and expand 
output, at a marginal cost of t per unit. As goes to zero, the 
monopoly’s optimal output goes to M, the unconstrained monopoly 
level, as figure 2 is drawn. If were chosen above M but below N, 
q”* would be set equal to K”' for q' = 0. Monopoly output never ex¬ 
ceeds N. 

If the monopoly had not acquired any capacity, it would not invest 

q^ were above U,, since the addition of Kq to industry output would 
drive the price below total unit cost, 1. For q^ between L, and Ut, it 
would be optimal to set q’” = K^, while for smaller values of 9 * it would 
be optimal to acquire more than the minimum possible capacity. 
Formally, the key quantity levels discussed above are given in general 
and in tbe linear case of equation ( 1 ) by 


P(L, + ATo) + K,P,AL, + /Co) = t; 


L, = (a - 2bKo-l)/b, (3a) 


PiU,+Ko)-l-, 

P(L,- + /Co) + KaP(j(L,. +/Co) = v; 
P(U, + /Co) = u; 


U, = (a ~ bKo - t)Jb, (3b) 
L^= (a - 2bKu - v)/b, (3c) 
U^={a-bKo-v)lh. (3d) 


The reaction function relevant to tbe entrant’s decisions is given by 
the mirror image of the schedule M/fB in figure 2. With no commit¬ 
ment to capacity, the entrant has a constant (long-run) marginal cost 
oft for output levels above/Co- Consideration of figure 2 then makes it 
clear that the entrant will be deterred if it expects the monopoly’s 
postentry output to exceed U,. From (3b), Ut is exactly the preentry 
output predicted by the classic Bain-Sylos limit-price theory, with 
P(U,) the corresponding preentry price. 

Figure 3 depicts a situation in which entry is deterred by monopoly 
choice of a capacity level, K’", above the limit quantity, U,. (Note the 
reversal of axes between fig. 2 and fig. 3.) Given its precommitment to 



Fur. 3 .—Credible enrry dcierrence 
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that capacity, the monopoly’s reaciion function is exactly as before; as 
long as does not greatly exceed K«, it is clear that the monopoly’s 
optitrial response is to utilize capacity fully. The entrant’s reaction 
function, fiap, is the mirror image of the ex ante monopoly schedule 
MAB. Note that with the monopoly’s capacity in place and the corre¬ 
sponding costs sunk, the conventional Cournot equilibrium point, <■, 
has no particular significance. 1 he only equilibrium isq"’ = = O', 

entry is deterred, and the monopoly always u.ses its installed capacity 
fully. 

A bit more analysis of the situation depicted in figure 3 makes it 
clear that in order for the monopoly to be able to tleter entry and still 
enjoy positive economic proht, two conditions are necessary. First, c 
must be positive; it must be possible for the monopoly to commit 
itself in advance to capacity, to purchase a lower marginal cost over 
some output range. If c = 0, theAfC = v reaction schedule A'W (which 
corresponds to the left-hand portion of NN' in fig. 2) collapses to 
M'M, and the monopoly cannot avoid the Cournot point, r, which 
involves entry. Second, must be positive; iherc must be scale 
economics as well as durable capital. It A.'n = 0, the entrant’s reattion 
function extends to the point ft' , and K'" must exceed ft' in order to 
cfeter entry. But an exatnination of the entrant’s reactioti function 
makes it clear that P(ft') = t, so that the monopoly can deter entry 
only by producing at least the competitive output in the absence of 
scale econotnies. (When <7'’ = 0, the entrant's marginal revenue is 
exactly the ttiarket price. Reference to [3b| makes it clear that U, < ft', 
as fig. 3 shows.) 

A somewhat stronger necessary condition for entry detet rente with 
positive profits is that the point /3 in hgure 3 must lie inside the X"N 
locus. That is, if marginal cost is v, it must be optimal lor the 
monopoly to produce at least U, in response to q'’ — Kg. If not, anv 
entrant can plan on producing K„ and earning |)osiiivc proht, re¬ 
gardless of the monopoly’s choice of capacity or preentry output. If 
the market marginal revenue curve slopes tlownward, this condition 
means that the monopoly’s marginal revenue at the ])oiiit ji must be at 
least V. 

P(u, + Kg) u,p^(u, + a:,.) 3 v. (d) 

Rearranging, noting ihm P(U, + K„) = / from (3b). and multiplying bv 
Kg, one obtains 

[-Pii{Ut + Kg)Kg\U, ^ rcKg. (.^) 

If the inverse demand function is either lineat ot lonc.ue, weak 
concavity implies 

P(U,) ^ P{U, -f- A'fl) + Pu(Ui + - iU, + Ko)]. (ti) 
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Substilufion from (6) for the bracketed term in (5) yields 


[P(V,)-t]U,^rcK^. (7) 

Since U^ is the monopolist’s minimum (preentry and postentry) deter¬ 
rence output, condition (7) says that the flow of’excess profit enjoyed 
by a monopoly that has credibly deterreil entry cannot exceed the 
pcr-period cost of capital employed in an enterprise of minimum 
efficient scale.“ This inequality clearly reflects the fact that both c and 
Kf, must be positive for this sort of limit pricing to be credible and 
profitable. 

It should be noted that it may not be optimal for tlie monopoly to 
install sufficient capacity to deter entry even if it is possible to do so. 
Depending on the details of the demand structure, the monopoly 
might prefer to allow entry and have the equilibrium occur at some 
point along cay3. It can, of course, guarantee itself any such point 
within N'N by its choice of AT'". If the monopoly does decide to admit 
the entrant atid the latter invests in capacity, inequality (7) serves to 
bound the total excess profit that the resiiltant duopoly can protect 
from entry by coordinated action.* In general, if entry is unattractive 
to outsiders, the fltrw of monopoly profits received by cooperating 
itisideis cannot exceed the flow cost of capital assets embodied in a 
firm of minitnum efficient scale, as long as industry demand is not 
strictly convex. 


11. Extensions and Implications 

The detivatioii of (7) rests on a number of strong assumptions that 
serve to overstate the ease of deterrence. First, it is at least arguable that 
the Cournot model overstates the intensity of rivalry to be expected in 

^ 111 lltis niodcl. ihcK* woiikl .seem i<> be m) geiieiiil way lo oblain a boiiix) oi> prohi, 
like (oiuhiioii (7), wiiliom using i oik avitv. (II deni.iiul elasiu Uy is a (onsiant, E < - I, 
f4] is siiiisfied for all posiiivc /vy it and only d v ^ ic\E — M ) Similar liounds liold in 
some relaie<l models of entry delenciice In the “simple illuslialive fotinal model" ol 
Schmalenscc (1978, pp. eg, if new biands do not expand loi.d dcm.md. 

[)er brand e\(ess piofi( in deterieiuc c<|uilibriuin cijuals bijind-spet ibe fixed (osj. With 
oijipul cx[)jn.sion, fixed cost fx'comcs an upper bound. See also ilic equilibria in ihe 
lefeicnces tiled m S(limalensee (1978, p. 313, n. 1.5) 

11 a given level of capacity is divided among several firms, each will see a larger 
Cournot iiiaiginal revenue at capacity output than would a single scllci. Kor any given 
enUani oiitpui, tapatiiy is thus inoie likely lo !)c fully uiih/ed with several csiablished 
fnins ihan with a single momipolisl. riii.s means dial a higher level of profiis can m 
piini iple lx* pioiecied against eniry by (^ouinot l>chavior than the monopoly analysis m 
the text implies, (nven the ilifhculty of c<K>idinatmg invesimcnl policies among seveial 
films, however, there seems little lo be gained by a detailed atialysis. U is suielv 
reasonable, though not necessarily correct, to assume that monopoly entry deieiTence 
involves higher preeniry piofits than feasible oligopoly dcterience strategies. 
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the postentry tluopoly.^ If the entrant is more optimistic, entry will be 
harder to deter, and the How of excess profits that can be protected 
will be smaller than (7) allows. Second, it was assumed that the 
motiopoly was able to invest in inhnitely durable assets. The recent 
work of’ Katon and I.ipsey (1980) snppotts the intuitive argument that 
the strength of the monopoly’s commitment is reduced as the assets it 
purchases become less durable, since the monopoly cannot generally 
commit itself in advance to invest after entry in order to maintain 
capacity. I bird, the assumption that average tost is effectively infinite 
for outputs below Kg is very unrealistic. As the presentations of Mo¬ 
digliani (1958) and Scherer (1980, chap. 8) show, entry deterrence in 
the Bain-Sylos limit-pticing model is more difficult the more slowly 
average cost rises when ovitput is reduced below K„. The need to deter 
entry at small scale lowers the proHtability of the limit-price point, 
which is exactly the point consideietl in deriving inequality (7). Fi¬ 
nally, tfie assumption ifiat the established linns manage to coortlinate 
their investment policies peifecily in the interest of entry deterrence 
is very strong indeed. I.ess than perfect coordination in deterrence 
would be expected to make entry more likely. 

Given the stretigth of these four assumptions, it would seem sensi¬ 
ble to use (7) as an upper bound on monopoly profit as.sociated with 
stale-economy entry baniers even in situations ,vhere industry in¬ 
verse detnand may be convex. 'I'<t explore the implications of this 
conclusion, let tis consider an industrv with (prtce-level adjusted, net) 
assets A, earning an accounting r.ite of return r* on those assets. 
Assuming awav me;isiirt*nient problems and winiifalls, accounting 
profits equal iioriiial rctitrns to capital plus monopols prohis. I-ct the 
competitise rate of return, the ratio of normal piofiis loA, be c. It is 
then .sensible to use r to go from snu ks to flows in this mdusii v as is 
done in (7). I'liat condition tan then be rewiiiieii as 

(r* - r)A ^ nA'o. ) 


which immediately laecomes 

(,* - r)/r 


cKJA. 


(H) 


Assuming that economies of scale are esseiitialK exhausted at 
capacity A'„, llie ratio cKJA should be a good approximation to the 
ratio KgKl, the ratio of minimum efficient scale (m output terms) to 
market deniaiul. For nmst industries, most sclmlars estimate this ratio 
to be less than 0.10. (Sclierei 11980. chap. 4] presents a large luunbei 
of such estimates.) Gondilion (8) then implies that it an mdiisti v has a 
return on a.ssets of, sav. 10 percent, entry is not taking place, and onb 


'Sec Ihc ai.s(ii.s.si<»i <>1 m suhiiei (l<m 0 . di-ni 5 ) 
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scale economies can serve to deter entry, the normal rate of return 
must be at least 9.1 percent unless scale economies are exceptionally 
important. Scale-economy entry barriers thus cannot account for a 
rate of monopoly profit of even 1 percent, surely not much above the 
standard deviation of the corresponding measurement error in most 
situations. 

An analysis of deadweight loss similarly suggests that entry barriers 
derived from scale economies are not generally important. If industry 
demand is given by ( 1 ), it is easy to show that the deadweight loss from 
limit pricing is b(Ko)V2. Letting demand elasticity, P/bQ, equal E, the 
ratio of deadweight loss to sales revenue becomes l{Ko/Q)^J2E]. Thi.s 
exceetls 0.005 only if economies of scale are of exceptional impor¬ 
tance or H E is less than one. 

I'he analysis so far, based on conventional wisdom about the im¬ 
portance of scale economies, supports Bain’s (1956, p. 212) conclusion 
that “economies of .scale iti production and distribution do not loom 
large as the basis of barriers to entry.’’ Two factors that might weaken 
this conclusion deserve brief mention, however. 

First, tnost U.S. estimates of the importance of .scale economies 
assume national markets and implicitly treat products as homogene¬ 
ous. If tt ansport costs are important and markets are regional, scale 
economies can becotne very important indeed. .Scherer (1980, p. 98) 
esiitnates that a cetnent plant of minimum efhcient scale would ac¬ 
count for about 40 percent of demand in a typical regional market, 
for instance. If products are differentiated, competition may be simi- 
lai ly localized in product space. As is discussed at length in Schmalen- 
see (1978), this sort of demand structure also serves to magnify the 
importance of scale economies in entry deterrence. 

.Second, spending on advertising generally has an investment com¬ 
ponent, though there is considerable uncertainty about the typical 
dvn ability of the corresponding as.set.*The analysis of Section I above 
implies that, by itself, the longevity of advertising’s effects on demand 
does not make it possible to u.se adverii.sing to protect excess profits. A 
number of authors have suggested that the use of advertising involves 
significant economies of scale, however. There is also considerable 
uncertainty about the importance of these scale effects.'* If it turns out 
in general or in .some industry that scale economies in advertising are 
important and that advcrti.sing has relatively long-lasting effects on 
demand, the analysis above would lead one to suspect, by analogy, 

* (Joriianor and Wilson (1979) survey the evidence on this point. 

’ See Schmalcnsee 1972, chap. 7; Comanor and Wilson 1979. Spence (1980) provide' 
a useful discussion ol the definition of economies of scale in this context, though hi 
docs not treat advertising a.s an invc.stment. 
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that advertising could be used like investment in plant anti equipment 
to protect excess profits from entry. In ortier to verily this suspicion, 
however, one would at least have to verify that at any instant past 
investment in advertising has the same sort of impact on marginal 
returns to changes in output that investment in capacity does. I'he 
first of the models discussed in Schmalensee (1974) illustiales that 
advertising does not have such effects in at least some plausible 
dynamic models.'® Rather than attempting to reason by analogy troin 
the treatmetit of investment in capacity here, it would seem preferable 
to construct explicit models involving economies of scale in advertis¬ 
ing and dutability of advertising’s ell'ects on demand and to sec if 
those models allow established monopolies to deter entry by means of 
investment in advertising. Such models may even imply simple 
analogs of (7), but that remains to be seen. 
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Having presented the model in the first section, we use it as the 
basis of our discussion of industrialization in Brazil in the years of 
the Great Depression. The key to understanding the behavior of the 
Brazilian economy in the 1930s is the government coffee-support 
policy, which held the income of the export sector at a high level and 
hence enabled the manufacturing sector to expand.' We show that 
stockpiling expanded income, and that, when stockpiling was 
financed by export duties, the trade balance improved, but when it 
was financed by credit, a trade deficit arose. The fact that the currency 
was devalued during periods in which the coffee sector was supported 
through domestic credit expansion but appreciated when coffee pur¬ 
chases were financed via duties strengthens the hypothesis that de¬ 
valuations were a respon.se to the external disequilibrium created by 
the cof fee policy. The fact that industrial output expanded in the face 
of both real depreciation and appreciation points to the importance of 
expenditures effects, relative to depreciation and price effects, in¬ 
duced by fiscal and monetary policies during the 1930s. 


I. The Model 

The economy has two sectors: a coffee-producing sector oriented to 
the export market and a manufacturing sector that produces goods 
tor domestic consumption and investment. Imported goods compete 
with domestic luanufacuires on the internal market. 

fhere are markets for three types of goods; coffee (C), domestic 
manufactures (Q), and imported manufactures (M). The prices of 
these goods in home currency are, respectively, Pr, P, and P„. The 
nominal wage is denoted W. The world supply of imported man- 
ulaciures is perfectly elastic at the international price P%, so thatP„, = 
ePf„, where e stands for the exchange rate. 

When impoited manufactures are used as the numeraire, the real 

' I Ills hyputhesis i.s advanced by Kurtado (1963). Kurlado is crituued by Pelac/ 
(1972), who contends that a large share ol the income generated by the multiplier 
eliccts of the cot tee-support cxpendituies was canceled by the olfselting negative 
multiplier etfecls ot the taxes levied to finance ccittee purcha.scs. Fishlow (1972) argues 
that the col lee-support prograin generated demancl in a somewhat more complex 
manner than Furtado liclieves. Basing his argument on a partial equilibrium mcidel, 
Fishlow maintains that the greater part of the export duties were pas.sed on to the 
foreign consumer. I'he transfen of the duties was guaranteetd by the inelasticity of 
demand and the high elasticity ot .supply. Our more complete model incorporates the 
mechanism described by Fishlow, wtiilc at the same time analyzing the cciflee policy 
Irom the standpoint of its impact on real income in accord with the approach advised by 
Pelaec and Sticigan (1976). A monetary explanauon for the industrial upsurges during 
the 1930s is found m Pelaez (1972) and Neuhaus (1973). The monetary factors ex¬ 
plaining the liehavior of economic policy in the Brazilian experience during the De¬ 
pression are also incorporated in our model. 
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prices are: p, s P,IPm,p = PIP^, and w = WIP„. We assume unem¬ 
ployment and a given nominal wage rate. 

A. The CoJJee Market 

In the coHee sector, the fixed lactoi (suitable land planted in coffee 
and served by adequate Iransjxnt) can be combined with varying 
quantities of labor to obtaitt the crop. I he grower maximi/cs piofits 
by equalizing the value of the marginal product and a given nominal 
wage. When the price of coffee lises, so do employment and output. 
The coffee supply is, therefore, a lising function of the price of coffee 
in wage units; 

a ^Cipjw). ( 1 ) 

Coffee is exported, foreign demaml is a (unction of the real income 
of the coffee-importing countries,y*, and of the dollar price of coffee 
deflated by the world price of imported manufactures: 

C"-C"(7'^.y*). (2) 

wherepo = (P,.leP^) and T = \ + I, with l being the duty rate on coffee 
exports and P^ and Pf„ the home price of coffee and foreign cun ency 
price of manufactures. 

Pan of the coffee is bought by the governinenl rather ihan sold 
abroad. These puichases are financed thiough foreign loans, domes¬ 
tic credit, and export duties on coffee. The real coffee purchases on 
the part of the government are designated G. 

Equilibriutn on the coffee market requires ihat^ 

C\pM = C‘>{p,-T,y*) + G. (.'1) 

Equation (S) deteimines the real protluceis’ price of coffee, p,., as a 
function of the exogenous variables;' p^ = p,-(>'*. (G T. i' )' dp,Jdy* 
> 0; dpJdG > 0; dp,/dT < 0; and dp,like > 0. 

We also define real export earnings. A', and real dis|)osable income 
of the coffee sector, Vr- measured in terms of impoitcd in.uiufacturcs, 
respectively, as:'' 


“ It IS a.s,siiiiicd itial doincslii lofici- (onsiimplinn is nigligihlc 
“ Krom (3). 

^— (f, w' I Tju-y*' c «■ -- 7j,/'), H) 

+ rj. 

where a priiiie beside die varialile iiidii.iles iis InRai iihinie dei is.iiivi.e . \ - u — 

dG/C'‘\ = price elaslu ily of it'e col fee .supply. T, - .ihsohite c.iliieol die pi ice el.istKity 

of the demand for colfee, and - imoiiie el.i-sticiiy ol die demand loi collo; 
Kejuatiun (4) rs an approximation. It rs exactly correct only il initiallv t- “ t) oi G — (A 
' The effect of ch.inges in the cxcigeiions variables on veal export eatnings can be 
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X = Tp,CXrpc,y*) = X{y*,G, T, w). (7) 

y,. = prC^iprlw) = G, T, w). (8) 

Real export earnings and real disposable income of ihe coffee 
sector are increasing functions of foreign income, government pur¬ 
chases, and real wages. A rise in export taxes raises export earnings 
but reduces disposable iticoine. 

An increase in the real income of the coffec-iinporfing countries, 
V*, enhances external coffee demand and laises the real price of 
cof fee, real export earnings, and real disposable income of the coffee 
sector. 

An incicase in gervermnent coffee purchases boosts the price, and, 
since tlie dc'inand tor coffee is inelastic, export earnings and real 
income of the coffee sector lise. 

An increase in export duties, T, raises the dollar price and leduces 
internatJonal demand for coffee, bringing on a decrease of the real 
price received by the grower. Since the demand for coffee is inelastic 
relatice to its ptice, export earnings lise. But disposable income of the 
coflee sector falls, Ix'caiise lx>th the cinantity produced and the price 
leccived tty the grower diminish. 

An increase in the real wtige rate lessens coffee supply and pushes 
tlie teal prite of coffee up. Both expoit earnings and real income rise 
inasmuch as demand is incl.istic. 

li. Tilt' Mriikct for Domi'sitc Mfitiu/actinrs 

f ile price of domestic inamifaciures is cost detei mined. I he re;tl 
[nice of domestic nianuf.ictures, in terms of imported manufactures, 
is 


p = Pw, (9) 

where fi stands lor tlie outpul/laboi ratio. 

Manufacturing output is cleleiinined Ity demand, which depends 
on re.il cloinestic income, v, on its pi ice in telation to that of impcjt ted 
similats, p, and on rc’al Italances, h. Real domestic manuiaciuring 
output, y,,, thus is: 

y„ = pQ_(p,y,h)- (10) 

.suidjcti using iho JoDnwmg ctjiidtion: 

1 he etfect changes in ihe exogenous \ariahtes on the income ol ihe coffee scctoi' is 
given by; 

y! ’ -Vvu’y*’ <r - Vif") + {——)(> ~ VcW. (t>) 

\ fr ^ Vi f \ + Vi > 
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Real ineotne is flelined as: 


v^vv + v,. (11) 

Equations (9), (10), atid (1 1) allow us lodetciminc internal eqinlil)- 
liuin and to express real income as ,i liinciion of the income of the 
coffee sector, real hahintes, and the teal wtige.^ (nven the money 
stock, the fiscal policy, atid the monev wage, we c;in depict intetntil 
ecptilihrium hy an inverse relationship between real incottie ;ind the 
exchange rate, as illustrateil in figure 1 by schedule yv '* Assume an 
initial comhintuion of the real income and the exchange rate lor 
which there is ititertnil equilibrium. An nutease m the exchange rate 
reduces the real wage, real balances, ;md income of the coffee set lor 
Both llie redut lion in real balances antf the ledutlion m the income of 
ifie col fee sector conii ihute to leducing the demtmd lor domestic 
goods. However, the reduction in teal wages, and thus m the pi ice ol 
domestic goods in relation to imports, mcieases demand lor domeslic 
manufactuies in suhsiitution for the impoiied goods. As long as the 
two first effects domimite, a depreciation ol the ext haiige rate reduces 
demand lor domestic maiud.ictiires. Conseqiienllv. mdusirial output 


■’ Assuming Ih.u, miM.ill),G = I - 0 ,m<l .V = .M.isluu-A; si.imlslm im|)orts,ssc li.isi- 
V, = X - M .uul, irom t*<} (II). 

,h.M - v; + (v.,/A/)v,;. (i‘') 

Kioin cq. (10), 

y;=(i 17.,)/''vt'ldi,) + M'- d.'ii 

where: t)„ ^ .ahsululc v.iluc ul ihc piKC eUslu ilv <.l ihedein.uul lui dumesiu m.iniil.u - 
lures, q = m,irgin.il nropeiisii) lo .oiisiime d.iiiiesCK m.imil.u Hues, .mil X„ e isiuils 

ol demand lor domestir mamil.uimes iii ... u> leal balance Siilimimmg (13) into 

(12) and obseiMiig. Iioiii the Imdgel ioiisli.mil. di.il 1 ,v.,'.MMI vJ Vm- >' 

the pine elaslKily ol die demand loi im|>oiled maiml.mmes, cse ol.Mm 


c/v/M - ll/ll c/dh; ~ Vmt>' 1 HX.,/1'), 


( 1 - 1 ) 


whcie « yJM. , , i . j. , 

» Note that h' - - le - e', for guen nominal mones si.x k .md ' • W i 

given policy, we .liso li.ise: v,' = c*. ii' . whete cA^ (I ri,)\f,l{€,. i), 1 < • 

rewrite ( 14 ) asr7(c/v/M) ^ -b r in)/«A, + HK, - r,„). wt.eve c = piopensity 

to save. I he schedule vv is downward sloping as long .is <.i>, + 
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falls, and real income is reduced. It is implied that a depreciation is 
contractionary when the nominal money supply is held constant. 

C. The Balance of Payments 

Imports of manufactures, M, are a function of their price relative to 
that of domestic similar products, of real income, and of real cash 
balances; 

M=M{p,y,h). (15) 

The balance of payments measured in terms of imported inanufac- 
lures is 


B=X-M+F, (16) 

where F stands for independent movements of foreign capital ex- 
pressefl in terms of imported goods. 

External equilibrium, B = 0, can be represented by a positive 
relationship between the exchange rale and real income, as illustrated 
by schedule BB in figure 1.^ If real income increases, the demand for 
impor ts rises, generating a trade deficit. To restore erprilibriurn, the 
exchange rale has to depreciate. For the depreciation to improve the 
balance of payments, the reduction in cof fee-export revenues has to be 
smaller than the reduction in import spending, obtained through the 
substitution effect, and the reduction in cash balances, induced by the 
depreciation. 

As illustrated iti figure 1, internal and external equilibrium prevails 
at E, where yy and BB ci-oss. We shall now illustrate the model by 
presenting some historically relevant comparative static exerci.ses. 

D. Cojjee Purchases Financed by Export Duties 

Assuming that G ~ / = 0, at the outset, the result of the government’s 
increasing expenditures while maintaining a balanced budget (rfG = 
d/C'^) is as follows. 

Since the demand for coffee is inelastic, the initial impact of gov¬ 
ernment outlays hnanced by export duties is to raise cof fee prices and 
export earnings," creating a trade surplus and shifting BB to the 
right,® as in figure 2. On the other hand, as the income of the coffee 


’ To obtain the slope ot BB, wc assume B — 0,X = M, and t' - 0, at the outset. We 
thus ditlerenliate (16) totally to obtain e'KdylM) = m/{r)a + m = the marginal 

propensity to spend in imported mamilactures; and = the elasticity ot demand 
for import-s in relation to real balances. 

"Observe that dX = dy,. = «l + €r)l(l — r?rV(*r + 

"The schedule BB shifts to the right in the following proportion- dy ~ (Um)dX. 
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sector rises, the demand for manufactures is promoted, and industrial 
production expands (yy shifts to the right).'® At the higher income 
level, there still is a trade surplus." The higher the marginal propen¬ 
sity to .save, the larger is the halance-of-paymcnts surplus, because less 
expenditures on foreign manufactures will be induced by the higher 
domestic income. I he exchange rate appreciates to correct for the 
external surplus. 

We thus conclude that coffee purchases financed by export duties 
are expansionary and induced an appreciation of the exchange rate. 


E. Coffee Purchaser Financed by Foreign Loam 

Next, we consider coffee purchases financed by foreign loans. As in 
the preceding case, increases occur in the price of toffee, in export 
earnings, and in income of the coffee sector. Likewise, industrial 
output expands, and domestic income grows. With the increase in 
imports, a trade deficit lower than the government deficit arises. Since 
the government deficit is offset by foreign loans, the balance ol 
payments improves. Coffee purchases financed by loieign loans ate, 
thus, expatisionary and induce an appreciation ol the exchange late, 
ceteris paribus.'^ 


F. Coffee Purchaies. Financed try Pnn/ing Money 

Consider, now, coffee purchases financed by priming money {prdC = 
dh). As already seen, the increase in government expenditures on 


'» The schedule vy shifts to the light m the tolU.wtng pu.poiuci. -= [Ills + m)\dy,. 

" At the going exchange rale, the trade soipliis can he inea.siirtd by 

"^'“^Ohserve that yvhen coftee purchases are hnanced hv toicign Uwns, dX = 1(1 - 

uoserve inai, wnen m + r,.)]d(; In leinis cl our liKure.s, yy-slnlts m the 

r)f)/(ef + TjrllifG and dy^ l( .) 1 f >, j gg shifts to the right in the 

right in the lollotving proportion, dy - I!/(' + ^ 

folloyvtng proportion, dy = {(l/''i)[(l + O^le.- + ''idl} 
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coffee will laise the price of the prochict, export eaniiiigs, and (he 
income of the cof fee sector. Observe that the increase in the income of 
the coffee sector, for an increase in government expenditures on 
coffee, is larger than the increase in export revenues since it also 
grows hy the government deficit. 

I'he increase in money balances induces more spending on imports. 
If this effect exceerls the increase in export revenues, even at the 
initial income level, a tiatledeficit appears, shiftingB/f to the left,as 
shown in figure 3. On the other hand, at the going exchange rate, the 
increases in real money balances and in the income of the coffee 
sector contribute to creating excess demand for domestic goods. Ac¬ 
cordingly, die equilibrium output level of the manufacturing sector is 
enlianced, and real income grows." 

At the going exchange rate, a balance-of-payments deficit appears. 
The external disequilibrium is corrected as the exchange rate depre¬ 
ciates; Domestic goods substitute for imports, and the recfuction in 
real balances reduces demand for both domestic and impoi ted goods. 

We thus conclude that financing cof fee purchases through credit 01 
duties has far different effects foi the balance of payments. The 
government can also finance its cof fee purchases through a combina¬ 
tion of duties, credit, and foreign loans. The final elfect will certainly 
be an expansion of income, but the effect on (he exchange rate then 
depends on the shares of credit, export duties, and foreign loans in 
the total government expenditures on coffee, and on the savings 
propensity. 

II. The 1930s: An Application 

Figured exhibits the Brazilian indtistrialcautput expansion dttiingthe 
1930s in the face of alternating real depreciation and appreciation. 
I'he modc'l developed in (he previous section as.serts that the coffee- 

ITif shill mfifl tan lie incasuiccl by dy = {(l/m)((l - T|r)(€,. + t)„) - 
'’ T he shill In yy is mcasiiied by rfy = {!!/(« + »i)l[(l C e,)(Cr + Vr) + i.y„lh)k^])dh 
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Fur. 4.—Real uuinsirial (uupvit and vhc leal exihangc laic, Hia/il. 1929-34. Sfjuitt- 
table 2. 


I ABl.K I 

Raiks ()t DteRttiAiioN IN Bm/ii. 1928-39 


Pct'inel I 


68 

II 

37 

07 

19 


S(M'K< f — I alili* 3 


1928-31 

1931-33 

1933-35 

193.5-37 

I937-.39 


support policy piir.suetl during the 1930s in Brazil atct^unis lor ihc 
iiuiiislnal output expansion ob.scrved in the peiiod. Jt also follows 
from the tnodel that the observed appreciation rates, show n in table 1, 
can be atlributcd to the (act that, in the periods 1931-33 and 
1935-37, sltK-kpiling was mainly bnanced by export duties, and that 
the observed depreciations, in the years 1933-34 and 1937-39. can be 
attributed to the tact that, during those periods, cotlee purchases 
were inainlv (inanced by domestic credit. VVe turiher explore these 
facts ill this section. 

In the course of the 1920s, the col lee-support program proved 
quite effective. By purchasing unsalable stocks, the government ie c 
the international coflee price at high levels. Because of i y'''''' 
stimulus, coffee production neailv doubled Irom I925 

same lime that expoi t tent.. .ipproxirnalelv constant. In J- *. J, 

it was possible to export only two-thirds of the harvest. 

When the crisis erupted in 1929. both the federal goveimnent and 
the stale of Sao Paulo abandoned then coHee-support progiams 
because llie possibility of external financing had vanished. 

Instituto Patilista do Cafe was in financial straits."’ 1.7, 

and 1930, the coffee price index in eru/eiios fell from 100 to tx ./ 


'’See Furtadci 1963. p. 198. 

'"See SiIIht 1977, p. 189 
See Malan and Bonelli 1977, table 1.2. 
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These facts can be fitted into the model presented in Section 1. With 
the relinquishment of the support policy and the drop in the coffee 
price, both export earnings and domestic income decrease. This, in 
turn, lowers industrial output (the industrial production index fell 
from 58 in 1928 to 52 in 1930 (base = 1939])‘* as well as causing 
expenditures on manufactured imports to decrease more than export 
earnings. Even so, the trade surplus’® is outweighed, and the govern¬ 
ment is forced to devalue the exchange rate in order to deal with the 
balance-of-paymenls deficit. 

In mid-1930, (he Sao Paulo State government managed to arrange 
a foreign loan for the purpose of guaranteeing coffee prices,®® though 
at this point the mechani.sin must be qualified because the year 1930 
was marked by serious political disturbances and a withdrawal of 
capital that contributed to the deterioration of the balance of pay¬ 
ments as well as to the devaluation of the exchange rate. By assuring 
minimum purchase prices that were profitable for the majority of the 
colfee growers, the program actually maintained the employment 
level of the export economy and, indirectly, of the production sectors 
linked to the domestic market. Thanks to the coffee policy, industrial 
production began to recover from its 1930 bottom. 

In May 1931, the Conselho Nacional do Cafe was established to 
support the coffee sector by purchasing and destroying stocks, f'hc 
purchases were financed through duties levied on coffee exports and 
funds obtained from the Banco do Brasil and the National Treasury. 
In 1931 and 1932, the coffee bought by the government accounted 
for roughly 30 percent of the export revenues. Between May 1931 
and February 1933, 65 percent of these expenditures were financed 
by duties and the remaining 35 percent by credit from the Banco do 
Brasil and the National Treasury.®’ 

As the income of the colfee sector recovered Irom its lowest levels 
thanks to the price-support policy, the increasing demand for man¬ 
ufactures led to a ri.se in industrial output. Since government expen¬ 
ditures were in part financed by duties, higher income levels were 
compatible with the improvement in the balance of trade and the 
appreciation of the exchange rate. 

Between February 1933 and December 1934, the shares of the 
financial sources changed considerably, with domestic credit coining 
to assume a more important role. To better understand this policy, 
one might glance at figure 3, where the manufacturing sector ex¬ 
pands in the presence of exchange depreciation. 


'"See Fi.shlow 1972, table A-1. 

Between 1928 and 1930, export earnings fell Irom US$473.4 million to US$319 4 
million, and imports (ell from U.S$383.3 million to US$225.1 million. 

“See Silber 1977, p. 190. 

” Ibid., p. 192. 
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Fio. 5.—Real inctJinc and the real money stock, Braril, 1929-39. Source: table 2. 


7 ABl.E 2 

REAt, Out PUT, RKAt. Money, AND Rkai Ext nANt;E Rair Indices i-OR Braeii.: 1928-39 






Index of 


Index of 


Index of 

Import Pnees 


Real Industrial 

Index of 

Money Balances 

DeHaled by 


Output 

Real Output 

DeHateii by the 

Industrial Prices 


Q 

V 

GDP Deflator 

ePyP 

Years 

(1) 

(2) 

(3) 

(41 

1928 

58 

65.2 

51 5 

67 2 

1929 

56 

65 9 

,52 9 

69 9 

1930 

52 

64.5 

54 1 

91.7 

1931 

55 

62.4 

64 8 

99.7 

1932 

56 

65 1 

72 1 

86 2 

1933 

61 

70 9 

81 2 

83.0 

1934 

68 

77,4 

80 2 

84.0 

1935 

77 

79 7 

81 8 

110.6 

1936 

91 

89 3 

88 0 

1 10 3 

1937 

93 

93 4 

89 2 

110 0 

1938 

96 

97 6 

101 9 

105 5 

1939 

too 

100.0 

100 0 

100 0 


SocRcf.s —C.<il I, tisWow OMTtl. lahle A-l Col 2. Haddad (I'-l’TI. pp 11.191 Ca.Ia 3 and 4. table S 
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Money Siock and Price Indues kir Bra/d ’ 1928-29 


Money Snxk 
Annual 
Average 
H 

Years (1) 


Dollar 

Extliange lin|X>iT Iinlusiiial 

Rate Puces Pi lies 

r c/': ^ 

(2) (3) (7) 


1928 52.9 41 3 

1929 52.2 44 3 

1930 46.8 48.4 

1931 49.9 74 5 

1932 56.5 73 4 

1933 62 2 66,1 

1934 65.4 73,4 

1635 70.0 90.6 

1936 76 5 90 I 

1937 84.8 83.9 

1938 100 1 91 7 

1939 100,0 100 0 


.56.7 
54 6 
59.4 
69 2 
60 I 
59-4 
63.0 
85 7 
91 3 
94 8 
99.0 
100 0 


81.4 
78 1 
64 8 
69 4 
69 7 
71.6 
75 0 
77 5 
82 8 
86 2 
93 8 
100 0 


.Soiim ts -tk.1. I, Prlaer and Suntan (IRTlit. laUe A ’ Gd 'd- ''a' "' 
table A V 12 this is a larin.imlii.ite crtireiio pnte ol in.pt.ns la.l» 


and Bniiflii 
.i.hI 3 . il)id tsibli 


('.DP DeHdU)! 
(5) 


102.7 
98.7 
86.5 
77.0 
78 t 
76 6 
81 5 

85 6 

86 9 
95 1 
98 2 

100 0 


table 1 3 I .4 3. ibitl . 
.\ V H 
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Between 1935 and 1937, the Departaniento Nacional do Cale con¬ 
trolled the export supply through a quota system whereby collee 
growers were obliged to turn part of their production over to the 
government I'or the formation of stocks. This was similar to financing 
coffee purchases through duties. During this phase there were less 
financial resources, and the manufacturitig sector expanded in the 
presence of a slight exchange appreciation. 

Figures 4 and .5 and tables 1, 2, and 3 summarize the main facts. 
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Confirmations and Contradictions 


Estimating Property Tax Capitalization; 
A Further Comment 

Raymond M. Reinhard 


In recent years, an increasingly popular area of empirical investiga¬ 
tion has concerned attempts to test Tiebout's (1956) hypothesis that 
allocative efhciency in the provision of public set vices may be attained 
through a system of local governments.' In his seminal contribution to 
the analysis of the Tiebout hypothesis and the capitalization of prop¬ 
erty taxes, Oates (1969) reasoned that, if consumeis are sensitive to 
the public service benefits and property tax costs associated with home 
ownership, these factors should be reflected in housing prices. Until 
recently, most studies of the capitalization phenomenon have fol¬ 
lowed Oates’s speciheation in which housing values are regressed on a 
vector of housing charactei istics, public service measures, and a tax 
price term.'' 

As .several analysts have noted," however, attempts to "test ” the 
Tiebout hypothe.sis by using an Oates-tvpe etjuaiion m ist certainly 
lead to failure. This is so because, in a perfect I iebout equilibrium, a 
marginal increase in housing value brought about by increased piovi- 
sion of public services must necessarily be offset by a marginal de¬ 
crease in housing value caused by the adtfed tax cost (assuming that 


1 am Kiatelul u, Lee S Kncdm.in, Paul R Pcimev. PluH.p F,. V.ncem I honias 
King, and an anonymous iele.ee lor ihen helptuUomments W,dUe Oaies, Jon 

Sonstelie, and Pant R. Ponnes kmdis .n,ute u imilion 

■See eir Oates 1969, 1973, Pollakowski 19/,t; Fdel and Xi l.ir 19/4 n.imillou 

1976«,’l976i; Meadows 1976, King 1977: Rosen and FulU-rlon 
and Visscher 1978, Linneman 1978, and 

' A recent exception is Fpple, Zelenilz, and \i.ssdier (1. / ) 

’Sec Edel and Sclar 1974; Hamilton 1976t>: and linneman - 
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the public services are fully financed by property taxes). Thus, as 
Linneman (1978) recently observed, “[ejconometrically, an attempt to 
estimate [an Oates-type equation] will be rendered impossible due to 
data matrix singularity in taxes and some linear combination of the 
public goods” (p. 536). 


The Contribution of King 

In a recent issue of this Journal, A. Thomas King (1977) offered two 
significant contributions to the measurement of capitalization effects. 
First, he observed that most prior studies relied on an improjjer 
specification of the tax price term. As King noted, “[wjhereas the 
hypothesis suggests capitalization of the lax burden, this [Oates's] equa¬ 
tion suggests capitalization based upon the tax rate” (p. 426). Second, 
and more iinporianily. King suggested that the lax price term be 
included as part of the dependent variable. His slated concern was 
with avoiding a possible bias in the coefficient of the tax price term; a 
more cogent rationale for the alternative specification is that it avoids 
the data matrix singularity problem mentioned earlier.^ 

Unfortunately, King’s suggested estimation technique is flawed by 
an error of ecotiometrics. A reanalysis of his results, summarized 
below, shows that King’s technique, as originally presented, will lead 
to biased estimates of the true extent of capitalization. The economet¬ 
ric error, however, is easily remedied; as corrected, the King tech¬ 
nique appears to offer a significant improvement over the standard 
Oates-type equation. 


The Model 

King suggests that an equation of the following form be used to 
estimate the extent of properly tax capitalization: 

U - =/3„ + + X/SjPj. (1) 

In equation (1), K represents the market value of the property, t is the 
property tax rate, and X, and P j represent physical attributes of the 
property and measured public services, respectively. The variable b 
measures the extent to which the property tax bill, tV, is capitalized 
into the property value, “. . . and is constrained to a value selected 
from the interval (0.1-1.0)” (King, p. 429). King suggests that “a 
maximum likelihood estimate of the extent of capitalization can be 

* In addition, by incorporating the tax price term on the left-hand side of the 
estimating equation, the dependent variable then reflects the actual user cost of the 
residence, while the right-hand-side (independent) variables measure the services which 
flow from the house in a standard hedonic price equation framework. 
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obtained by varying b over the interval and reestimating, observing 
changes in R^” (ibid.)- Applying this methodology to Oates's data, 
King concludes that the result of the misspecification in the original 
Oates equations is to overestimate the iruc extent of property tax 
capitalization. 

Before discussing the econometric error, two points deserve brief 
mention. First, given the range of possible values suggested by King, 
the sign of b should properly be positive, not negative. Recall that, in a 
conventional specification of the capitalization equation (i.e., with blV 
on the right-hand side of the equation), the expected sign of b is 
negative, since an increase in tV should lead to a decrease in F, ceteris 
paribus. Because {—btV) has essentially been subtracted from both 
sides of the equation, the sign of 6 should therefore be positive. Thus, 
the depiendent variable, as redefined, represents the value of the 
property which would have obtained in the absence of any propierty 
tax, but receiving the same level of public services as before. Obvi¬ 
ously, this value should be greater than V. 

Second, King’s equation suggests the capitalization of 1 year’s tax 
bill, whereas the hypothesis wouUl suggest the capitalization of the 
present discounted value of the stream of future lax bills. Thus, the 
term blV must be adjusted to reflect the appropriate discount rate, r. 
These two corrections are incorporated into equation (2), below, 
which assumes an infinite time horizon;* 

p + + 2/3iX. + (2) 

r 

As noted. King suggests using an iterative technique based on 
observing changes in the level of R^. However, this technique cannot 
be legitimately applied to either his equation (1) or the alternative 
equation (2) without one further adjustment. The difficulty arises 
because the variance m the dependent variable (and, rcaisequently, 
the amount of variation to be explained by the independent variables) 
is increasing as b increases,* thus imparting a bias toward hndings of 
low levels of capitalization when the level of R'^ is used as the 
maximum likelihood criterion. 


» In private correspondence with the author. King agreed with the validity of these 
two ma^lheinatiial collections. However, he described their omission in 'h'’ ^ 

article as a flaw of exposition, noting that his empirical analysis look these 

‘"Mran“analoROUS case of translorinalion of the dependent va.iable (involving a 
corri^don for h«eroscedas.icity), Pmdyck and Rubinfeld (197b) observe: Note that 
theR- measure associated with the weighted leasl-s<iiiarcs procedure is lower ihan the 
aLeiated with the unweighted procedure 77w dreUne 

tnduoHo,, that th, heUroscfdasHnty correctam '‘^^"'"^'hasrad 

procedure .nvolves th, me of a transformed dependent variable (p U12. emphasis added). 
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To fariliiate a valid comparison of the level of R'^ (or, equivalently, 
of the T-statistic) across equations, the estimating equation may be 
reexpressed as follows; 

/3o + + SftPj 

[1 + ibt/r)] 

With this simple correction, a maximum likelihood estimate of the 
extent of capitalization may legitimately be obtained by varying (h/r) 
and successively reestimatirig, observing changes in the T-statistic.' 


Estimating Property Tax Capitalization 

Because Wallace Oates also made his data available to me, it is possible 
to compare the results of several alternative specifications of the 
property tax capitalization equation, including the corrected King 
specification just proposed. I'o review, Oates writes his model of 
property tax capitalization as follows: 

V = f(T, E. Z, M, R, N, y, P), (4) 

where V = median home value by town, 7’ = the efiectivc percentage 
tax rate, E = annual etlucafional expenditures per pupil, Z — per 
capita municipal spending on all functions other than local public 
schools and debt service, M = the linear distance in miles of the 
community from midtown Manhattan, 77 = median number ol rooms 
per owtiet -occu|>ied house, N = percent ot houses built since 1950, Y 
= median family income, and P = percentol families in the community 
with an annual income of less than $3,000. To these nine variables. 
King adds a tenth: Tax, the median property tax bill by town, defined 
as the product ol T and V. As Oates and King note, in the single¬ 
equation framework just ilescribed, the variables 7i, Z, and /’ or Tax 
will be correlated, thereby necessitating the u.se of two-stage least 
squares ( fSLS) to obtain unbiased estimates ot their respective 
coefficients. Accordingly, all ot the equations were estimated using 
TSLS; the results are presented in table 1. 

Table 1 follows the format of King (1977) in presenting the results 
of the following specifications of the capitalization equation: (la) and 
(Ha) correspond to Oates’s (1969) and (1973) estimates, respectively, 
using the property tax rate as the independent variable measuring the 
extent of capitalization; (lb) and (lib) correspond to King’s first alter¬ 
native specification, using the property tax bill as an independent 

’ In practice, most siandarcl statistical packages do not print the level ol /t* precisely 
enough to enable the le.searcher to identify the inaximutn likelihood estimate. For this 
reason, the use of the F'-statisiic, which conveys the same information as ft* but in .i 
more precise form, is suggested. 
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variable; while (Ic) and (lie) are the estimates from King’s second, 
erroneous specification. Finally, (Id) and (lid) are the results using 
my corrected version of King’s specification. 

Again following King, computed values for the coefficients of Tax 
are presented for my equations (Id) and (lid), to facilitate compari¬ 
sons across equations. These values correspond to capitalization of 
100 percent and 145 percent of the property tax bill, respectively.* 
Oates’s estimates put capitalization at 66 percent and 92 percent, 
respectively, while King’s estimates from (lb) and (Ilb) indicate 40 
percent and 70 percent capitalization and his (erroneous) estimates 
from (Ic) and (He) indicate 63 percent and 67 percent capitalization, 
respectively. Thus, while King finds that Oates’s misspecification of 
the properly tax term results in an overestimation of the extent of 
property tax capitalization, my results indicate that this misspecifica- 
tion results in underestimating the extent of capitalization. 

One might reasonably object to the finding of capitalization in 
excess of 100 percent as inconsistent with capitalization theory. How¬ 
ever, the extent of capitalization depends directly on both the dis¬ 
count rate (r) and the time horizon assumed. As figure 1 illustrates, 
the corrected King technique is only able to identify the ratio of 6 to t 
which maximizes the value of the/■'-statistic. Thus, the finding of 145 
percent capitalization at a 5 percent discount rate and 40-year time 
horizon is also consistent with, for example, 100 percent capitalization 
at a 2.6 percent discount rate and the same time horizon. And, viewed 
as a homeowner’s opportunity cost, r would be defined as the interest 
rate on mortgages minus the rate of appreciation of the house. Given 
mortgage interest rates prevailing in 1960 (when Oates’s data were 
gathered), therefore, a finding of 100 percent capitalization at a 2.6 
percent discount rate may be quite reasonable and not at all inconsis¬ 
tent with the predictions of theory. As f urther evidence on this point, 
it is interesting to note that, for the equations in both groups (1) and 
(II), the highest values of/(* and of the t-statistics consistently obtain 
under the corrected King specifications, (Id) and (Hd). 

Finally, it is quite correct to observe, as did King, that “[i]t can ... be 
objected that this equation is inadequate to represent the very com¬ 
plex bundle known as ‘housing’” (p. 426). Perhaps, as King specu¬ 
lates, ‘‘[t]he instability in the estimated coefficients under different 


" The estimation technique provides a direct estiniale of b, the extent ot capitaliza¬ 
tion; this estimate is then used to compute a coefficient of Tax comparable in inter¬ 
pretation to the coefficiencs ol Oates's original spedficauons. Eqq. (Id) and (Ild) indicate 
capitalization of 86 percent and 124 percent, respectively, at 5 jjercent with an infinite 
time horizon; this is equivalent to capitalization of 100.2 percent and 144.5 percent with 
a 40-year horizon. The present value of a unit stream for 40 years, discounted at 5 
percent, is 17.16. Therefore, 17.16 x 1.002 = 17.19 and 17.16 x 1.445 = 24.80. 
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Fig. I.— Relationship between the F-stattstir, discount rate, and the extent of capitali¬ 
zation in the corrected King technique (ray a'a' is a projection of ray an onto ihe h-r 
plane). 


specification.s noted in the exchange between Pollakows*;i (1973) and 
Oates (1973) is very likely attributable to the fact that each included 
variable acts as a proxy for a great many om.tted but correlated 
variables” (ibid.). To those who would be tempted to cast these results 
in a similar light, however, 1 should note that the alternative specifica¬ 
tions described herein were tested on a much larger and more de¬ 
tailed data set, with much the same results. 

Specifically, the various forms of the capitalization equation were 
applied to a sample of 1,453 single-family houses sold in San Mateo 
County, Calitornia between June 1, 1969 and June 1, 1970;* because 
observations on suitable insirumental variables were unavailable, all 
equations were estimated using ordinary least squaies. Ihe sample, 
however, is noteworthy lor at least two reasons: First, it consists ol 
observations of actual sales of individual houses, in contrast to Oates s 
relatively aggregated data. Second, eath house is described by a de¬ 
tailed set of physical characteristics, public services received, and lax 
bills paid (see the Appendix lor a Kill description ot the variables 
included). 


“ The San Maico tajumy data wen- originally gatlieicd loi a Mudi oi (apiiali/aiion 
and the Tiebout hypothesis conduiicd by Jon C. Sonsirlic ol the L'nneisiiv of Cahroi- 
nia, Santa Barbara, and Paul R Poiiney ol Rexouices lor ihe Funire (Sonstelic and 
Ponney 1980). I he author wishes lo ihank Sonstelie and Poitnev lor knidlv making 
their data available. The resulis lepoiied hcie arc duiwii lioin a lai ger .study (ondiu ted 
by the author lor the Education a>niinission ol the Stales (Reinhard 19iH) 
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An initial attempt to estimate the extent of capitalization using 
King’s suggested form of the dependent variable (eq. [2], with the 
appropriate corrections for b and r) found that the value of the 
F-statistic fell ash was increased; that is, the results implied no capitali¬ 
zation of the property tax. Inspection of the difference in the residual 
sum ol squared errors between the equations in which b took on 
values of 0.2 and 1.0 showed a large difference—an indication ol 
changing variance in the dependent variable.'® 

Estimating the extent of capitalization with equatioti (S) yielded a 
maximum value of the F-statistic as.sotiatcd with 100 percent capitali¬ 
zation at a 3.6 percent discount rate and an infinite time horizon." 
Detailed results of the San Mateo (loiinty regre.ssion are pre.sentecl in 
table Al. Finally, the extent ol capitalization was also estimated using 
a conventional Oates-type equation, with the property tax rate in¬ 
cluded as an independent variable. The results (not shown here) 
indicated 31 percent capitalization of properly taxes, using the same 
discount rate and time horizon—significantly lower than the estimate 
obtained with the cot retted King specification. 


Conclusion 

The results reported here do nothing to tleiract from King’s observa¬ 
tion that "[mlany studies of property values have used a cleat ly incor¬ 
rect specification of the tax effect, one likely to inirotluce a bias of 
uncertain direction and magnitude" (p. 430). However, while King’s 
results indicate that the misspecification leads to an upwaril bias in the 
rax coellicient for his sample, I find that the use of an Oates-type 
e<|uatiou results in a downward bias in this coellicient, both when 
applied to King’s data and to data from San Mateo County, Califor¬ 
nia. Further, on the basis of the usual statistical criteria relating to 
gsiodness ol fit (R‘^) and the robustne.ss ol individual parameter esti¬ 
mates (/-statistics), my correctetl version of King’s specification ap¬ 
pears to offer a significant improvement over the competing aliei na¬ 
tive specifications compared heiein. Whether this is a general result is, 
of course, a question for fulure research. I'his comment, however, 
clearly underscores the importance of the choice ol estimating 
equations in influencing the particular results obtained. 


This imprc.ssjon was substantiated by the icsuiis ot the (ioldteld-Qiiandt lost lor 
hettToscedaslicity. l est statistic ” 1.84, critical value {F with 1,431, 1,431 <11 * 

d() - 1.0. 

“ During the 1-year period in which the houses in the sample were sold, the rate on 
conventional home mortgages was 9 percent. A value of r equal to 3 6 j>erceiu implies 
that the hoii.ses were appreciating at a rate of 5.4 percent [jer year, not an iinrea.sonable 
assumption for the houses in the sample. 
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Appendix 


Definitions of Variables,* San Mateo County Data 


ROOMS 
BA I H 
QUAl.l 
QUA 1.2 

pool. 

RMS1/.K 

AUK 

LOl 

UULSAC 

ALLEY 

NULS 


PLAN 

FIRE 


CRIME 


No. of rooms in the rlwelling, excluding bathrcxnns 
No. of balliiooins 

Oumiiiy variable indicaling average tjualily of consli iiclion 

Dummy variable indicaling good <|uaiity of conMiuclioii 

Dummy variable indiraiing ilie presence of a swimming pool 

Average loom si/e in scpiare feel 

Age ol house 

Si/e of lol m scpiaie feet 

Duiiimv variable indicaling lotaiioii on a cul-de-sac oi dead-end 
St I eel 

Dumnn vaiiable mdicalmg alley behind house 
Dummy variable mdicalmg presence ol a nuisance near properly 
(e g., bus slop, parking lol. church overilow parking, etc.) 
Dummv variable indicaling well-planned neighboi hood 
Insurance Services Oflice of California laliiig ol the cpialilv of 
liK.il lire proteclion, h.iseci on local cleparlmeni. adecjuacy of 
water supply, etc. (low rating incticales better protection) 

FBI Umfviim Crime total per I0l> lesidciiis 


* Varialiles aclapicd troiii Sonstelio and Poitnev (1980) 


TABLE At 

Rn.KrssioN Rrsi i is ior S.vs Mvirot'oisn 


Vai lable 


Coeihe iciil 


t-slalislic 


CDNS FAN 1 

- 19,608 65 

ROOMS 

4.698 11 

BA TH 

3,088 4 1 

QUAl.l 

977 16 

QUAL2 

34.598 91 

POOL 

7.978 61 

R.MSI/.E 

137 11 

AGE 

-213 36 

1.0 1 

1 142 

CLIISAG 

3.010 12 

ALLEY 

2.099 20 

NUfS 

-2.625 10 

PLAN 

I.HI 64 

FIRE 

-923 08 

GRIME 

- 1.945 80 

REG 

- 430 68 

ROAD 

1 .,580 84 

DIS T 

2.794.35 

IMS 12 

108 ,59 

AIR 

- 774 01 

READ 

1.468.13 

EXWADA 

29 <■)(» 


.lie ilehiieil .1' HI >l>< Vpix-Hili' tMH.iiii 

except RH'.irc 

= 97, 

ul flic pieiliurtJ Me.tn 'lej>eiuiei‘t 


9 0 
I:) 0 
d.O 

1 I 
12 3 

3 2 
14 fi 

5 7 
14 6 

2 (i 
1 I 

.3 

1 1 

1 .3 

2 7 

3 1 

4 H 
4 I 
4 ,3 
3 h 

6 2 
li.li 


|i ISl Slju.ufs til 
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RFC 

ROAD 

Disr 

DiST2 

AIR 

RKAD 

EXWADA 


Recreational expenditures per capita 

Exfjenditures per capita on streets and highways 

Linear distance to San Francisco 

Linear distance to San Francisco, squared 

Number of days per year with at least 0.10 ppm high hour 

oxidant concentration 

Improvement in reading level between first and third grade 
(measured in months) 

F.xpenditures on elementary and high school education per 
pupil in average daily attendance (weighted sum) 
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Tax Changes and Cigarette Prices 

Michael T. Sumner and Robert Ward 

University of Saljord 


The recurring question ot the difference between specific and ad 
valorem taxes has recently been revived by Barzel (1976), who argued 
(p. 1185) that . . when quality is not fully controlled, the effects of 
the two types of taxes differ radically. The excise tax induces a 
substitution of quality for quantity, resulting in a greater increase in 
the price per unit than predicted by the constant-cpiality model. The 
ad valorem tax induces a reduction in quality and lower after-tax 
price than predicted.” Evidence consistent with these hypotheses is 
found in the response of cigarette prices to tax changes. In the 
further tests reported by Johnson (1978), prices rise significantly less 
than the tax in New Hampshire, where the state tax was ad valorem, 
and significantly more in the remaining states, which impose specific 
taxes. 

The purpose ot this note is to question not Barzel s analysis but 
rather the interpretation of the evidence, on the pedestrian grounds 
of implausibility. As Barzel himself observes (p. 1194, n. 32), "the size 
of New Hampshire makes it unlikely that cigarettes would be tailor- 
made for it. The scope for removing characteristics is quite limited, 
then.” The scope for adding characteristics in other stales seems 
equally limited. To avoid misunderstanding it should be emphasized 
that the hypothesized variations in quality are a supply-side phenom¬ 
enon, not a demand-side response to the relative cheapening of 
superior brands when a specific tax is increased. Johnson’s estimates 
imply that the 9<t increase in the New Hampshit e tax over the period 
1954-76 should have reduced the net price by 3.6«!, ceteris paribus; 
the sHkIhIv larger mean tax increase in the remaining 48 continental 
states should have raised net price by 0.90. What changes m the 
quality of the services sold with cigarettes could, have justified such a 
differential? What are the observable counterparts of the tax 
changes? In the absence of a convincing answer to those questions, the 
estimated response of cigarette prices to taxes represents a fact in 
search of a theory. 


lyoumol ly PMaual Kamomi. 19SI. vol. ,^,,j,*l^0(„SjOl 50 

© 1910 by t he llnu-rtMly of Chicago OOSZ 
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TABLE I 



Johnscjn's Cicarei 

-IE Re(.kcssion Reslicts 


Variable 

(Coefficient 

Stanciard Error 

T'-Value 

Consianl 

13.651 

.439 

31.078 

To/n, 

l.lOl 

.013 

87.028 


No IK — H* - ‘Mi5, nuinbri <it ob>ierv.itioi)s = l.KW. 2‘2 year duniiiiics. 50 diiminics, and a New Hampslurc 
lax diiininv were also included ir\ ihc regression e4|uatutn 


T'he explanation suggested here is that price changes are discon¬ 
tinuous because of the cost of change itself , the uncertainty of com¬ 
petitors’ reactions, and the indivisibility of the unit of account. When 
there is a major change in costs, the retail margins eroded by continu¬ 
ous changes in minor cost components will be restored, so that retail 
price rises by more than the initiating factor; the latter therefore 
appears to exert a disproportionate influence on prices, unless ap¬ 
propriate allowance is made for the backlog of small increases whicfi 
were not worth implementing in isolation. If inflation is not antici¬ 
pated, the at ttial price will lag behind the eqtiilibi iuin price except at 
the moment of aeijustment; if it is anticipated, the actual and equilib¬ 
rium price levels will coincide on average in the period between 
adjustinents.' The tests reported below cannot disci iminate between 
these alternatives but presume that one or other mode of behavior 
prevailed throughout the period. 

To illustrate the relevance of this pro.saic suggestion to cigarette 
prices, consider Johnson’s equation: 

P T ■ 

- = «„ + O’, + dummy variables, (1) 

lb lb 

whereis cigarette price in cents per pack, T is tax in cents per pack, 
and n is the consumer price index on a unit base; subscripts i and;, 
respectively, denote time and state effects for which the dummy 
variables are entered as controls. His estimates of the critical parame¬ 
ters are reproduced in table 1. The inteicepi, which registers the 
effect on nominal cigarette prices of general inllation, implies that 
during the first half of tlie .sample period, when the consumer price 
index was rising very gently, a 10 price increase would have occurred 
at average intervals of more than ,5 years in the absence of other 
disturbances. Tax changes took place more frequently, so it would not 
be surprising if price changes were concentrated in years when this 
major component of retail price increased. In the .second half of the 

' We are indebted tci Barzel tor drawing our attention to this distinction 
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(aoefhcieni 
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/•-Value 

(aonstani 
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050 

670 

Alb 

8 228 


1 270 

6.480 

M'u 

928 


036 

25.792 

^U^k=r7l+I AFIfc 

2 409 


669 

3.691 

^w^ 

1.020 


til 

9,151 

\<iir —- 744. timnljei 

(if iilisri v.itioiis 7H4 





period general intiation accelerated and tax changes became less 
frequent, so tfiat the modeling of discontinuous price changes be¬ 
comes less critical. 

One simple method of distinguishing between the direct effect of 
tax changes and their indirect ef fect of facilitating the restoration of 
margins is to add an additional variable to a hrsi-differenced nominal 
version of Johnson’s basic lormulation. The estimating equation pro¬ 
posed is 

I 

AF,ja^n, + |3^T„ +yt,j V All*. (2) 

= m +1 

where/(j = 1 if AT",; 7^0, = 0 otherwise. Subscript w indicates the date 
of the previous tax change in siatej. This foi mulation differs tiom a 
transformed version of Johnson’s equation (1) in not permitting the 
effect of general inflation to differ acro.ss states and time through 
the dummy variables; instead, effects of time specific to the cigarette 
industry were captured by adding the change in wholesale cigarette 
prices (AfV) to equation (2). and the residuals were scrutini/cd for 
state effects. 

Thus augmented, equation (2) was estimated for 4r; states which 
impose a specific ta.x* with tfie results shown in table 2. 1 be mteicept 
is appropriately insignificant and, as anticipated, wholesale price 
changes have a coefficient which does not differ signihcantly from 
unity. The inclusion of the latter variable reduces the estimated effect 
of general iiiHation but does not affect the tnain conciusions. I hese 
are that the diffused sources of price changes represetited by getieral 
inflation enter both directlv and in conjunction with tax i hanges; and 
that otice allowance is made foi the ind.iecl itiHuence of tax changes 
in effecting backlogged price increases, the coeflicietit oti tax change 


»Wc- arc gr.uclul ,o I.itii.si.n tor provi.ling l.is claia. Hi.sl ditterciu.ng • » ■ 
impririantly the conanicion of the ba. klog s.inahle .e< ..red 

784 observation., .ind eliiniiuic.l three su.e, .n whieh onlv a singU lax. hange ..c.uri 
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TABLE 3 

ClCARETTK RCCRESSION RESULTS; 11 STATES 


Variable 

Coefficient 

Standard Error 

T-Value 

Constant 

.046 

.081 

.569 

m, 

5.158 

2.252 

2.291 

ATu 

.698 

.061 

11.442 


5.910 

1.230 

4.805 

AIT, 

1.2.36 

.196 

6.300 


Noi> —R • = 727, imrtiber cif observations ■ 231 


itself becomes significantly less than unity. Finally, inspection of the 
residuals does not indicate any need to incorporate dummy variables 
for state effects; The largest mean state error is less than half the 
standard deviation of the residuals. 

It was suggested above that the association between tax changes and 
the restoration of margins eroded by mild inflation would be stronger 
the lower the general inflation rate. That suggestion was verified by 
fitting the estimating equation to the sample of 11 states for which 
data begin in 1956, the earliest possible date. The major differences in 
results are immediately apparent in table 3. The most obvious change 
is the lower point estimate of the tax-change coefficient. Second, the 
sum of the coefficients on the catch-up variable and contemporaneous 
price change remains about the same, but their relative size changes 
dramatically: Fax changes are much more important as an occasion 
for effecting backlogged price increases in this sample, in which the 
earlier, more quiescent inflation experience is more heavily weighted. 

The major question remaining is how well the model fits New 
Hampshire. For the period 1955-76® the mean forecast change ex¬ 
ceeds the actual increase by 13 percent; Theil’s {/-statistic is 0.362. 
These figures are difficult to appraise in isolation but do not seem out 
of line with normal forecasting standards. That impression is 
strengthened by the observation that in 10 states the mean residual 
exceeds the average forecast error in New Hampshire. Finally, the 
largest individual error occurs in 1975, when the predicted price 
increa.se exceeded the actual rise by more than 1.50 to yield a propor¬ 
tionate error of 50 percent. This is particularly inconvenient for 


’ T<i avoid losing early observations additional data were collected from the Tobacco 
Tax Council (1978) The tax series for New Hampshire differs from that provided by 
Johnson in two lespeccs; 'fhere is an automatic change in 1957 which docs not appear 
in Johnson's data until I960; and a single legislated change in 1971 is split between 
1971 and 1972 in Johnson’s data. The revised data are used here. The analysis of the 
forecasts reveals no difference between the impact of legislated and automatic tax 
changes. 
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Barzel and Johnson because the form, as well as the rate, ol tax was 
changed in that year from ad valorem to sjiecific. 

For present purposes, the most important result is the negative one 
that the tax-change coefficient is not significantly greater than unity; 
but, from a broader perspective, the conclusion that it is significantly 
less than unity is at least equally interesting. A possible explanation 
for the difference between this coefficient and that on wholesale price 
changes is interstate competition. In this context it is particularly 
noteworthy that the state which lies furthest below the regression 
plane is geographically isolated Maine, where the levels of taxes and 
prices have consistently exceeded those in neighboring New Hamp¬ 
shire. Of the two price increases not associated with intrastate tax 
changes and exceeding 1^ in magnitude, one coincided with and the 
other followed a year after tax changes in New Hampshire. Mas.sa- 
chusetts and Vermont also lie below the regression plane and have 
consistently experienced higher tax and price levels than New Hamp¬ 
shire; in both cases, three of four large price changes unconnected 
with domestic tax changes coincided with tax changes in New 
Hampshire. These selective observations arc suggestive but hardly 
constitute evidence. It is, however, beyond the scope ot this note to 
construct a scalar measure of state openness and to test its interaction 
with tax changes. 

There remain unexplored possibilities of modeling discontinuous 
price changes more faithfully, but tor present purposes it is not 
necessary to putsue them. It has been shown that there is no dif¬ 
ference between the effects of specific and ad valorem taxes on ciga¬ 
rette prices and that the impact of taxes per se on pi ices is signifi¬ 
cantly less than one for one. 
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Miscellany 


Twas a Night in the Sixties 

’ Twas a nighi in the sixties 
And all through ihc land 
Unemployment vvas tailing 
Inflation in hand. 


I'he sttx'k. market was rising, 
Without any care, 

In hopes a Dow thousand 
Soon would be there. 


The Keynesians were stiuggled 
Secure itt their (Chairs, 

While visions of multiplieis 
Allayed all their cares. 


Paul with his textbook 
And Art with his gap 
Had settled their brains 
For a long postwar nap. 


When out in the land 
I'here atose such a clatter, 

A voice that was crying 
Fhat money could matter. 

This study extends earlier work hy C C. Moore, to whotii tny dehl is obvious. It was 
presented at the 1980 Harvard Esonomics Dcparlment Suident-Faeulty (ihrislmas 
party. Altlutugh I am always grateful to the National Science Koundalion and National 
Bureau ol Economic Research, neither organtzation provtded any financial sitppoit 
for this project. 
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Away froi7i their desks 
They hew in a Hash 
To see who was claiming 
Such power lor cash. 


1 hey looked at their models 
With etjualioiis piecise. 

That gave semblance ot prool 
To ccjnclusions so nice. 

When what l<t their wondering 
Eyes should appear 
But a miniature sleigh 
With eight tiny reindeer 

With a little old driver 
Who was having such fun 
They knew in a moment 
It must Ise .Vliltt'ii. 

More numerous than eagles 
llis supporters they came 
And he whtstled and shoittotl 
And tailed them bv name. 

First ]ohn Say and then Hutne, 

I hen Maishall and .Mill, 

Now Brunner and Melt/ei 
Atid .Anna and Phil. 

Fiom the U. ol Chicago 
lo Minneairohs-St. Paul 
Then tiash awav! Dash awas! 
D.tsh aw.iy .ill! 

■As economic theories 
With which economists pla\ 

When iliev meet with an obstacle 
Assume it awav, 

So ott to tlie jottt nals, 
rheir pairers iliev Ucw. 

With nioiietarist theotetns, 
Ration.tl exjteci.titttns uk). 
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And even in Cambridge 
Was heard the new truth, 

The theorems and lemmas 
Of each little proof. 

The Keynesian thinkers 
Were spinning around 
When onto the scene, 

Milton came with a bound. 

He was dressed ail in gold 
From his head to his foot 
And his ideas were polished 
And ready to put. 

“Velocity’s stable, 

M 1 and M2, 

Which shows what the Fed 
Shouldn’t be trying to do.” 

“That curve by Phillips 
It really is straight 
And the co.st of funds 
Is the real interest rate. ” 

He wrote many a word. 

And with evidence too. 

At the NBER 

His volumes they grew. 

His ideas how simple. 

He puls them so well. 

It would be no wonder 
When he got his Nobel. 

A wink of his eye 
And a nod of his head 
Soon gave Keynesians to know 
I'hey had something to dread. 

Fhen he sprang to his sleigh 
To his team gave a whistle 
And away they all flew 
Like the down of a thistle. 
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But I heard him exclaim 
As he drove out oi sight, 

“Keep freedom for all. 

And keep money tight.” 


Mar tin S. Ffi os tein 


National Bureau of Economic Research 
Harvard University 



Policy Research 


The title of this discourse is “Policy Research," a contradiction in 
terms, [ts purpose is to explain that the demise of this once proud 
office was not an accident of circuinstance hut an inexorable conse¬ 
quence of a tendency which has been at the heart of political systems 
since the Middle Ages. 

The elements which come together to explain thi.s phenomenon 
were collected in various places. 

The first element appeared at the Imerttational Trade Commis¬ 
sion, where for a year 1 was privileged to observe both the tlelibera- 
tions of the commission to reach decisions and the diligent efforts of 
tlieir staff to produce a staff report. But there was no relationship of 
otie to the other. Indeed, the place where the commission's tiecision 
and the staff report tame together was always the piintshop. 1 notice 
that though its purpose was obscure, the stall report did serve a 
function—to misdirect and to obfuscate. Having been exposed to the 
insfimtionalisi tiadition, I was confident that this funciit)n servetl 
some purpose, but 1 was unable to deterntinc what that purpose was. 

Years passed, and I observed that in nearly every town and city of 
Kurope, there was a “Jews quarter” or Ruelle des Juifs in the old 
commercial part of town. This treasure, too, I stored in the backyard 
of my mind. 

Finally, a book by Lester Thurow reminded us that /ero-sum deci¬ 
sions are ven difficult for a democratic political process to make. Then 
things began to fall info place. The purpose of misdirection and 
obfuscation became obvious. By making the deci.sion proce.ss difficult 
to understand, they di.scourage those with a diffuse or marginal 
interest from participating in the decision; that is, they tent! to disen¬ 
franchise the opponents to a petition for protection and hence trans¬ 
form a zero-sum situation into a positive-sum decision —a decision f rom 
which ail the effective participants will gain. 

This, then, is the essence of the decision process. Unless the great 
clamor of procedure confuses one side about what is going on, there 
can be no decision, that is, no policy. 

Presented .September 4, I9tt(>, at the U.S. Trea.sury I>cp.iriiiieiit, on the txta.sion i)l 
the deparluie of Vingcr and the tlissolutn>n o( the Office of I'lade Research, of which 
Kingcr was the director. 

\Joumal of Political F.roiiomy, 1 9H I , vol H9, no. 6 ) 

© I9HI by Ihc UniverMty of Chicago 0022-5»08/HI/B906-<K)18$0l 50 
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Of course, if procedure confuses both sides, the government is in 
the ideal situation of not having to make a decision at all. This occurs 
with some frequency. 

But if one side sees through the obfuscation and the other does not 
(the result which the system .seems designed to produce), then the 
aware side’s interests will be disproportionately weighted in the deci¬ 
sion process, and e(|uilibrium will be “loo far” in their direction. In 
summary, the policy prcKess in an open political system 

(1) does not do a lot of things it should do, 

(2) appears to do a lot of things it does not do, and 

(If) overdoes everything it does do. 

Good research whicli accuratelv itlentifies the costs and benefits of a 
course of action is, in any of these avenues, more likely to be embar¬ 
rassing than welcome. It returns attention to the direct conllict which 
had to be nvaided before a coiiise of action—a policy—could be set. 

I bis means th;it a person who delivers a novel piece of analytical work 
to his Political Master will be made to feel like a woman wbit biings a 
new child to her husband—who had a va.sectoiny 5 years beloie. I he 
Master is less likely to share her jov in the birth ol a new ihing than to 
decry this evidence of her having conspired with mean spirits. Politi¬ 
cal responsibility is, you see, the ultimate intellectual 'asectomy. 

I'he history of research in government agencies is like the history of 
jews in pre-lndttsti ial Revolution Europe 1 be Gentiles ol a lown 
would decide that they wanied to establish their town as a center ol 
commerce. 1 hey wouid then invite the praclilioneis ol this science 
into then lown and set aside a place where ihev might live and 
practice. But, the jews would practice their nade hv i tiles so basic that 
they would not allow then gentlemen compc'iiiors, lo whom com¬ 
merce was a diversion and a livelihood but not a hfe-style, to turn a 
profit. So, slowly the number ol streets open to the Jews would be 
restricted tiiicl the number ol products thev were allowed to iiade 
reduced. Or, thev would be limited to the wholesale tr.ide, m the hope 
that gentlemen could turn a profit retailing the merchandise the Jews 
gathered. 

But their numbers wcnild dwindle, not so imuh becanse they had 
been sent away but because other Cieiitiles in other towns bad decided 
that they should establish centers of commerce and had lured them 


away, to begin the process again. 

As we see across Em ope the signs announc ing the lormer presence 
of lews, we see acro.ss m gani/aiion c harts of government agencies the 
street signs of research. But the original citi/ens of these quarters 
have usually gone, and their places have been taken bv retoim con¬ 
gregations, or even Gentiles. I hus. we. the researchers, like the Jews 
of the Middle Ages, are cursed to wander Horn agenev to agenev. 
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deprived of home and property by our inability to lose track of what 
we do.' 

J. Michael Finger 

World Bank 

* I'he mortal remains (former employees) of the dissolved Office of 1 rade Research 
were sent away with a memento inscribed "There are no Statesmen in this business. 
Trade theory is about whose hand is in whose pocket and trade policy is about who 
should lake it out." It was noted gratefully that as they departed, their temple would 
be dismantled and the signs in their quarter taken down. Thus they would be spared 
the indignity of seeing a reform congregation move in and eat pork and write briefing 
papers where tince they followed the old law. The spirit of the office was also dis¬ 
part lied. It was represented by a full frontal picture of a charging infantry brigade, 
inscribed. “We owe our success to a simple underlying philosophy." 
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